Gene Cagg_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3687 
Symbol 
ID7268222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4481727 
End bp4483478 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content55% 
IMG OID643568493 
Producthypothetical protein 
Protein accessionYP_002464959 
Protein GI219850526 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAC CCGATCCGTT CTCGTTACCG CCCGTTTCGT CTACCGAACC ACGACGCAGT 
CCATTGTTCC TGATCATCGG CGGTTTAGTC ATCGGTGGAT TACTGATCAT CGTCGGTGGT
GGTGCGTTGG TATGGAGCAT GATCAACCAG CGTGGTAGTG CCATTCCTGA ATTATTACCG
GCTGAAACCC AAATCTACGC TGCGATCACG CCCAATCTGA GCGATCTGCC GAATATTGAC
CGTTTACGAC GAGCCTTTCC CGAAACCTTC GACTACCAGA ACACCGACCA AACGAGCGAT
TTTTTGCAAG AACGCTTTGG TGTAACGTTT GCCGATGACA TCGCGCCGTG GATAGGTGCT
GAAGTAGCGG TCGCCGTCTA CGGCTTGCCG ATCGAGCAGC TAAGTGCAGT CGTCGGCGAA
TTGTCCAATC CATTCAATCC GCCGGCAACA CTCAACCCGC TAGAAGATGC TGATTTACGC
AACACCAATG TGCTGTTGAT CGTAGCAGTT CGTGATCAAC GGGCCGCCCA GGCTTTTCTC
GACAAACAGC GCACGTTTCG AGAGGCGCAG GGTGAGCGTT TTACGAACAG CACAACGAAT
GGGGTGACGA TCTACGAAAG TGAGAGTGAT GAAACGGCGT TTGCTGCCTT CGCACTGGCC
CGCAATATGG TCGTCTTTGC CAACAATGCC ACGAGCATCT CTACGCTGAT CGAGCAACGT
AGCGAGACCG CACTGGCCCG TAGCGCACAA TTTCAAGCCG TGAGCCAGCG CTTGCCGACT
GACCGGATTG GCACGATCTA TCTTGCCGGA GATGGATTGG CTCGTTTTAT TGACAGCCTC
TTTGCATCAG GCTCACTCGA TGAGACCGTG CCAATGCTGG CCGATATGCA ATCGGCAGCC
CAAGCTATGC AAGGAGTCGG CTTCACAATG GCCGTTATCG AGAGCGGTCT GCGCTTCGAT
GCAGTGACCG TCTTTGATCG CAACCGGCTG AGTAATGCAC TGCGCGAGCA ACTCGGTAGC
CTGCGCCCAA CCGTCTCGCC CGAACGAGCC GGTGATGTCA GCAGCACCGC GATCGGTGTA
TTCAGTTTTG GCATACCTGC CGATTGGGGG CAGCGTCTCC GTGATCAGTT AGAGGCCGAA
CCTGAAACTG CCAATGCGCT GCGTGATCTC GAAGACAGTC TCAACATCAG TCTCGACCGC
GACTTGTTTA GCTGGTTTCA CGGTGAAGGG GTGATCGCGC TGTTGCCTAT CGATAGTGTC
GAATTGCCGG TAGGAGGCTA CTTCGCGCTG CGTGTTGCCG ATCGGTCGGC TGCCGAGCGA
GGTATGCAAC GGCTCATTGA ATTGGCCGAA GACCTTACCG GTATCCGGAC CGGTACAACC
TCGCTGGGGC GCACGCAAGT GCAAGCGTTT GAAGAGGGCG ATCTCTTCTT TGGGTACGGC
TTCAACGGCA ACGATCTGGT GATTGCAGTG GGTCGACCGG CGATGGAAGC TGCCTTTGGC
GTCGAACAAA AACTGTCAAG TGTGGCGACC TATGCGAATG CGTTGAAGGC GATGCCCTCT
CCCAATGGTG GTGTCCTGTA TATCAACCTT ACCGCAGCCC GCAGGTGGTT TAACCAGACA
AATGATCCGA TTGACCCCGA ACTTGAGCAG CGGTTGGCTC CATTCACTGC TATCACGAGC
AGTGGCACGG TCGGGATCGA TGATCGTGGG GTAATGCGTG GTACGCTGCT GTTAAGTATT
GAACCGCAAT GA
 
Protein sequence
MTTPDPFSLP PVSSTEPRRS PLFLIIGGLV IGGLLIIVGG GALVWSMINQ RGSAIPELLP 
AETQIYAAIT PNLSDLPNID RLRRAFPETF DYQNTDQTSD FLQERFGVTF ADDIAPWIGA
EVAVAVYGLP IEQLSAVVGE LSNPFNPPAT LNPLEDADLR NTNVLLIVAV RDQRAAQAFL
DKQRTFREAQ GERFTNSTTN GVTIYESESD ETAFAAFALA RNMVVFANNA TSISTLIEQR
SETALARSAQ FQAVSQRLPT DRIGTIYLAG DGLARFIDSL FASGSLDETV PMLADMQSAA
QAMQGVGFTM AVIESGLRFD AVTVFDRNRL SNALREQLGS LRPTVSPERA GDVSSTAIGV
FSFGIPADWG QRLRDQLEAE PETANALRDL EDSLNISLDR DLFSWFHGEG VIALLPIDSV
ELPVGGYFAL RVADRSAAER GMQRLIELAE DLTGIRTGTT SLGRTQVQAF EEGDLFFGYG
FNGNDLVIAV GRPAMEAAFG VEQKLSSVAT YANALKAMPS PNGGVLYINL TAARRWFNQT
NDPIDPELEQ RLAPFTAITS SGTVGIDDRG VMRGTLLLSI EPQ