Gene Cagg_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3152 
Symbol 
ID7269901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3825794 
End bp3830944 
Gene Length5151 bp 
Protein Length1716 aa 
Translation table11 
GC content56% 
IMG OID643567973 
Productconserved repeat domain protein 
Protein accessionYP_002464446 
Protein GI219850013 
COG category[S] Function unknown 
COG ID[COG1470] Predicted membrane protein 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGTC ATCTGCGCAT TGCCATCCTG ATTATTACCT TGATCGGCTT GTTGCCTACA 
ACCAAACCGG TCTACGCCGC TGCATTTGTA GTCAACTCAC TGGCCGACAC CGACGACGGG
GCTTGTACCA CCAGTGCCAA CGGATGTACA CTGCGCGAAG CTATCAATGC CGCGAACAGT
AATGGTATCC CCGACACGAT CACCTTTAGC GTGAGCGGAA CGATCTACGT CCAAAACGCC
GGGCTACCAC CACTAACCGA AGGCAATACC ACGATTGACG GTGGCAATGG GCATACGGTG
GTCATCAGCG GCGAACAGTT GCGTGACGCT AATGATACCC CGATCCCAGC GCATGGTTTG
GGCATCGCAT CCAGTAACAA CGTTATTCGC GGACTGGTGA TTATCCGCTT TAGTCGCGGT
ACCAGTGTCG GCGGTGCCGG GATTTACCTG TATAACAATG CGCAGAACAA TCTGATCGCC
AACAACTGGA TCGGCCATCT TAACGGTTTG CCTGAACCGA ACACGGGGTA TGGTATCCTC
GTCGATGGTG GCGCAAGCAA TAACCGGATC GGAACCGGTG ATCCGTCTGA TCGTAATGTC
ATAAGTGGTA ATGTCATTGC CGATATTTCG ATTAGTAATA CCACGAGTAC TGCCTTCGTT
AGTGGTAACC AAATCCTTGG CAACTACATC GGCACGACCG TTGCCGGCGA TGCCGCCCTA
CCGGTCACAC CGTTAACGGC AAACCTCGGT GGTATCACTA TCGAGCAATA TGCCCGCGAC
ACCGTTATCA GTGGTAATGT GATCGGTGGT TATCTCGGTT CCAACGCTGC CGGTATTGTG
CTCTTTAGCA ACTCGACGAG CGCCGGCTCA TCCAGCATTC CCCGTACTAC CCGGATTACC
GGGAACTGGA TCGGCGTGAC ACCTACCGGT ACGGTCATTG CCAACCGGAT CGGTATTTTA
GTTTCGGGCG GCGCTGTGTA CGGTGCCATC GACACGGTGA TCGGTGATCC GATTGACCCG
GTGGGTGGGC GAAATTATAT CAGCGGCAAC ACCAACGGCG GGATTGTGAT CAGCGATACC
CAATTCGCGA CCGGCCCTAC GACTATTGTC GGGAACTGGA TCGGTATTGC CCTCAATTCA
AGTGGTAATC CCTTCCCGGT CGGCAACGGC ACGATCAGTC AAGTGAGCGG TGGTGAAGGT
GTTTTCGTTG GACGCAACAG TGTAAACACC GTTATCGGGC CGGGCAACGT GATTGCCGGT
GCGCGCACGA ATGCGATCCG TATCCGATCC GGTAACACGA TTGTGCGGGG TAACTATCTG
GGTACCGATC CAACCGGTTC CCAAACCACC ACCACCACTA TCAGCCAACC AACCGTTGGC
TACGGTACTG GCGACGCCAC GGTCTATATT GAGAACGGCA GTAATTCGCA GATCGGCGGA
CCCAACCCGT CCGACCGCAA TGTGATTGCC AACGGTAACT TTGCTTACAC CGGATCAGGG
GCGGCGGTGT TGATCGAACC ATGCGCTACC TGTACCGCCA ACAGCAACAT CGTAGAGGGG
AACTATCTCG GTGTACGGGC TGATGGTAAC AGCATCCTTG CCAGCAGCAT TCTTGCCGAG
AGTGAAGGAT TACGGCTCAG CGGCGTCAGT AATACAACAG TACGCAACAA TCTCATTGGT
GGTGTCGATC GCGGTATCAA CATTCGCAAT AGCGCGAGTA ATAATCTGAT CGTTGGTAAC
CGTATCGGTG TCCGTGCTTC GAGTAGCGAG ACACCGGGAA GTGGTACGAC CACTCGTAAA
GATGGTATCC AACTCAATAG CGGTAGCAAT AATCGGATTG AGAACAACCT GATCGCCTTC
ACCGGTCAGT CTAATTTAAG CCTCGTAGGT GCTGCTCACG GTATTACTGT CAAAAGCAGC
AATAACCAAC TGAACGGCAA CCGACTGGTG CGCAACGGTC AACTGGGTAT CGGTCATGGT
ATCTTTGTCG CCAACGGTGT TTCGGGGGTT CTGATCTCTC GTAATACGAC CCAAGACAAC
GCAGGCGATG GGATTAGCCT TGAAAGTGGT GCAAACGGCG GGTTAGCTGC GCCAACCTTC
AACCCGATTA CCGCCGGTTC CCCAATCGTC ACCGGATCGA CGGGCTGCGG AGCAAACTGC
GTAGTCGAAA TCTTTACCAC CAGCGCCAGT ATTACCGACC GCGATCGGGA GGGGCCGGTC
TTCTTAACCA GCGTTACGAC GACAACCGGC GGCAGTTTCA GCGCCAACAT TACCGGTTGT
CTTGGTTATA TCACCGCGAC CGTCCACAAT CCGACAACCG GCAATAGTTC GCCGTTCTCC
AACGCGCTCA ACGTCAGTGC CACCGATGCC TGTGCCACAC CAACTGCTAC GCTCACCGTG
ACCGGTGGCA CTAGTCGGGT TGTGAGTATC GGTAGCACAT CCACGTATAC GCTGACTCTT
AGCCATACCG CATCGGTCAC CCGGACCTAT ACCTTAAACC TTACCAGTGA TCGCGGCTGG
ACGAGCGGCC CCGCTCTGGT CGAGGTGCCA CCCCAAGGCA GCACCGAGAT ACTGATCGGT
GTTCTCGTGC CGTTTACGGC AGTAGCCGGT GATACCGACA CCACGACTGT GACCGCTCGC
AGTGATCAAA CGCTCTCGAA CAGCGTTACC CTGACAACCG TTGCCCAAGC CGCAACCATT
ACTCCTGCGC AACCGGTAGT CTCACCCGGT TACATCATCG AACGCAGCGG GAATACCATC
ACCTTTACCC ATACTGTGAC CAATACCGGC CAGTTGATCG GTAATCTCAG TGTGATCCGA
CCTGATGGCA GCAGCGGGTT ACCGGTATTC AGCGGGACGC CACCGACCGG ATGGAGTATT
GTTTCGGCTA CATTCGGCAG CACAACACTA GCTGCCGGTA CCACGACAAC ACTCACCATC
GTCGTCAATA CACCGGATAG CGGAATGCTC ATTGCCGGTG ATTACTCGTT TGCATTCCGC
GTGCGCGCGG TGAGTCAGCA GGGCACGCAG ATCTTTACGG AGCAGAGCGA TCCGCCGACG
ACCGATACCG TGCGCGTGCC GGTGGTGCGC AGCTTCGAGT TTACTGCTCT GGACCCAACC
ACCCGCCAAT TGACACCGGC CAGTTCGGTT GAGTTCAGCT ATGTCATCAC GAATACCGGT
AATTTTACCG ACACCTTTAT CATCATGCCG CCGACCGGCA CAACTCCCGC TTCCAGCCTG
ACGTTTGCCA CTGCCCCAGC TAGCAACTTC ACCTTGGCCG CCGGACAATC GCGCCCGATC
ACCCTCACCG TTACCGCTGG CGCCAGTGAG CCGGTTGGCT TCTACAACTT TACCGTCCAA
ACCGGTGTCA CCGGCGGCGC TAACCCGCCA GCCAACCGCA CGACGACTGG CACGGCGCAG
GTCATCGGCG GCGGCACGCC GATCTTTGTG GGTACGCCAA TCGTGGCCCC TAACCCGGTT
GATCCAGGCG CGACGGCAAC GATCACTATC ACGGTGCGCA ACGGCGGAAA CGCTGCGACC
CCGTTTGAGT TTACCCAAAC CTTGCCATCC GGCTGGAACC TTATCGGCAG CAGTACGACA
TGCCCTATAC CGGTACCGAC AAACGGTACA ACGTGTACGT ATACGTTGCA AGTCGGCGTT
CCGGCTGACG CCGATGGTGG GGAGACGACG GTTGAGGTGC AGGCGATTGC TCGCAATGGT
GGTCAAACCC CGCCGGCGCC GGATAGCACG GCCAATCAAC CGGTTACTGT CACCGTTGCC
GCTGTGCGTA ACCTTAGCTT TACACCGACA TCGCAGACTA CCAATGCCGA TCCGGCAACT
ACGGTAAGCT TTACCCATAC CTTGACCAAT ACCGGCAATG CACCCGACAG ATTTACCCTC
AATCTTAGCG GTCTACCCAG CGGCTGGACG GCGACAGTCG ATCCGGTGAC GACGCCTATC
CTTGCGCGTA ATGCCAGCAT CACTGTGACC GTGCAGATCA CGGTTCCAAC CGGCATCGCC
GCCGGAACCA CTGCTACCGC AACGGTACGC GCCACCTCAC AAGGCAATCC GGCGGTATAC
GCCGATGTTG CTGATAGCAT CACCGTCAAT GCGGTGAACG GATCCGTACT CTCGCCGGGC
ACGACGGTGA ACAGTACGCC GGGCGCAACC GTTGTCTTGA CCCACACCCT CCAGAACAGC
GGCTCAACCA CTACTGCCTA TGATGTGGCC GTCCAGAGTA CCGATCCCGG TTGGTCGGCA
CCGATCATCG AACCGGTCAC CACACCCGTC CTCACGCCGG GAAGTAGTAC GCTCATTACG
GTGACCGTCA CTGTCCCTAC CACCGCACCA CCCGGTACCA GCAATCTGAT CACCGTTACC
GCCCGCGCCA CCGGCGACAT AACCATACTG GCAAGCGCCG AGCACACCAT CCAAGTCGGT
GCTTTACGGA ATGTAGTAAT CGAACCAGAA CGTAACGTGA TCGCATTACC CGACATGACT
ACCGTCATTA CGCACACGGT ACGCAACATC GGTTTCAGCG CCGATTCGTA CACTATTACC
GCACTTCAAG GCGACGGATT AAGCGCCATT GCGACACCGA ACCAAATTGA TTTAGGACCG
GGTGAAAGTC GTGAAATTGC GGTCTTGTTA ACCTTACCGG CAGGTCTTGC TGCCGATACC
GTGCTGAGCA ATATTCGAGT CACTGCTATC TCTCGCAGTG ATCCGTCGGT CAAAGCAAGT
GTACTCGATC GGGTGAGGGT AGGGCTGGTA ACCGGTGTCG TACTGAGCAG TGACCGTCTA
CGTGGCATTC CGTCTGGGAT CAACCGGCTC ACTTTTAGTG GGATCGAACT CGAAAATCTG
GGTAATGCGC TCGACACCTT CGATCTGACA GTGAGCGGGC TGGATAGTAG GTTCGGGGTG
ACGGTGATCC CCAACGAAAT CACCCTCAAC GGCGGCCAAC GCGATGTTGG GATCAGCGTC
ATCGTCAACC TCCCGCCGAT CCAGCCGGCA GCACTACGCC ACGACCTGGT GCTCACTGCA
ACATCGCGGC GCGATCCGTC ACAACAGAGT AGGATTCGGC TTTCAATGAT CTATCTCTAT
CGGGCAGATA TGTTTGGCGA ACCGATCTTT ATACCATTGG TAAGTCGCTA A
 
Protein sequence
MHRHLRIAIL IITLIGLLPT TKPVYAAAFV VNSLADTDDG ACTTSANGCT LREAINAANS 
NGIPDTITFS VSGTIYVQNA GLPPLTEGNT TIDGGNGHTV VISGEQLRDA NDTPIPAHGL
GIASSNNVIR GLVIIRFSRG TSVGGAGIYL YNNAQNNLIA NNWIGHLNGL PEPNTGYGIL
VDGGASNNRI GTGDPSDRNV ISGNVIADIS ISNTTSTAFV SGNQILGNYI GTTVAGDAAL
PVTPLTANLG GITIEQYARD TVISGNVIGG YLGSNAAGIV LFSNSTSAGS SSIPRTTRIT
GNWIGVTPTG TVIANRIGIL VSGGAVYGAI DTVIGDPIDP VGGRNYISGN TNGGIVISDT
QFATGPTTIV GNWIGIALNS SGNPFPVGNG TISQVSGGEG VFVGRNSVNT VIGPGNVIAG
ARTNAIRIRS GNTIVRGNYL GTDPTGSQTT TTTISQPTVG YGTGDATVYI ENGSNSQIGG
PNPSDRNVIA NGNFAYTGSG AAVLIEPCAT CTANSNIVEG NYLGVRADGN SILASSILAE
SEGLRLSGVS NTTVRNNLIG GVDRGINIRN SASNNLIVGN RIGVRASSSE TPGSGTTTRK
DGIQLNSGSN NRIENNLIAF TGQSNLSLVG AAHGITVKSS NNQLNGNRLV RNGQLGIGHG
IFVANGVSGV LISRNTTQDN AGDGISLESG ANGGLAAPTF NPITAGSPIV TGSTGCGANC
VVEIFTTSAS ITDRDREGPV FLTSVTTTTG GSFSANITGC LGYITATVHN PTTGNSSPFS
NALNVSATDA CATPTATLTV TGGTSRVVSI GSTSTYTLTL SHTASVTRTY TLNLTSDRGW
TSGPALVEVP PQGSTEILIG VLVPFTAVAG DTDTTTVTAR SDQTLSNSVT LTTVAQAATI
TPAQPVVSPG YIIERSGNTI TFTHTVTNTG QLIGNLSVIR PDGSSGLPVF SGTPPTGWSI
VSATFGSTTL AAGTTTTLTI VVNTPDSGML IAGDYSFAFR VRAVSQQGTQ IFTEQSDPPT
TDTVRVPVVR SFEFTALDPT TRQLTPASSV EFSYVITNTG NFTDTFIIMP PTGTTPASSL
TFATAPASNF TLAAGQSRPI TLTVTAGASE PVGFYNFTVQ TGVTGGANPP ANRTTTGTAQ
VIGGGTPIFV GTPIVAPNPV DPGATATITI TVRNGGNAAT PFEFTQTLPS GWNLIGSSTT
CPIPVPTNGT TCTYTLQVGV PADADGGETT VEVQAIARNG GQTPPAPDST ANQPVTVTVA
AVRNLSFTPT SQTTNADPAT TVSFTHTLTN TGNAPDRFTL NLSGLPSGWT ATVDPVTTPI
LARNASITVT VQITVPTGIA AGTTATATVR ATSQGNPAVY ADVADSITVN AVNGSVLSPG
TTVNSTPGAT VVLTHTLQNS GSTTTAYDVA VQSTDPGWSA PIIEPVTTPV LTPGSSTLIT
VTVTVPTTAP PGTSNLITVT ARATGDITIL ASAEHTIQVG ALRNVVIEPE RNVIALPDMT
TVITHTVRNI GFSADSYTIT ALQGDGLSAI ATPNQIDLGP GESREIAVLL TLPAGLAADT
VLSNIRVTAI SRSDPSVKAS VLDRVRVGLV TGVVLSSDRL RGIPSGINRL TFSGIELENL
GNALDTFDLT VSGLDSRFGV TVIPNEITLN GGQRDVGISV IVNLPPIQPA ALRHDLVLTA
TSRRDPSQQS RIRLSMIYLY RADMFGEPIF IPLVSR