Gene Cagg_0600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0600 
Symbol 
ID7266072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp740849 
End bp744187 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content57% 
IMG OID643565463 
ProductTPR repeat-containing protein 
Protein accessionYP_002461975 
Protein GI219847542 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.272318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.386774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGC CGTGGCGACG AACACCGACT GAAGGCGAGA GCGAGCAACA TAGCGAGCGA 
ACAACTGCGC AACGGCATCT TCGGCGCTGG CGTCGACGAT TACTCATTGT CATCATCATC
GGCATTGTCA TCAGTCTCTT CGTACTATTT CGTCCGGTTG ATCCGCGATT TGTCATTGTG
GTCGCACCCT TTAGCGACCA AGACGGCCGT ACCGGGCAAC AGATTGCCGG CGCACTGGCG
CGCCAATTGC GGATCAACGG TAACAACCTC GTACAGGTGA TCGAAAGCCG TACCCGCCCG
ACAAATAGTG CTGAGGCACT GACGTTAGCG CAACAGACGA ATGCTGACAT CCTGGTGTGG
GGATCGGTGG AAGCGGGTGG TATTCTTGAC AGTCCTAGTC TGCGACCCGA ACTTATCTAT
CTCCCCCACA ATATTGACGT GAGCCAAAGC TGGTACACCT TTGCCATTCG TTTTGCAATA
CCGTACCATT TTGTGCTGAG TACCGCGCCA ATGAACGGCC AAACGACGCT CGTCCCATAC
CTCCTCGCGC TTGCCGCCTA TCACCACGGC GAAGCTGATT TGGCCGTCGA ACGGTTGCAA
CAACTGCTCG AAATCAATCC ACAGGTATCG CCACTGCTCC CCCACATCCT GCGTGGCAAT
CTCTTGTGGG CACGAGGCTG GTATTCCGCC GCTACAACCG AATACCGGGC AACTCTTTCC
GTTGCAAATG GTGATCAGGC GTTATTAACC AACAACCTCG GTGCGATCTT GTTTGATGCC
CAAGACCCTG AGGCACTCCG TCTGTTTGCC GAAGCCATTG ATCTCCTCGA TGGTCGCGAT
CTCGGTCAAC TACGGGTAAA CCTCGCTTTG TGGGCCTTAC GTGAACAGCG CGCCGGTGAT
GCCGTATCCG ATCTGGAGCA GGCCCGCAAC CTGTTGCCAC CTCACCCGGA GCTGGAACTG
CTCCTCGCTA GCGCGTATCG CGAGAACGGT CAGCTCGATA AGGCAGCAGA TAGCCTCGCT
CGGGTTGAAA CGGCGAAAAC CGCGATCCTG GCGCGCACGC CGTTAGCAGT ACGGACGGCC
GTTGAGTTGC GCCTGAACAG CCTTCTGACC GAAGAGCGTG CTCTGATCGG GATCGAGCGG
AAACTGCCAC TGGCCGGCCC ACTCTCCTGG GCACTCGAGG CTGCCGACCC CTTGCCACCC
GCGGCCGATC TCCGACAGAC ACGCGATCAA CTGCGCACGG CCACCGAATA CTGTACCCGT
TCGATCAGCT TGTGGCGCCA GCGCGCAGCG AGCGAAGCAG CGATCTTCCC CGGTACCGGC
CTTATGGCAA CCGGACAAGC CGAACGCAGT GAAGAATTGG CTTATCGTCA ATCACGCACG
CTCGCACTGA TCGAAACCGC ACTTCAAGCA GTTGATGGGC AAACGAATAC CAGACCAAAC
CAGTGGTTCA TCGCCCTATT CGGTACTACG CGCGTTGACA ATCCGACCAT TGAGAAATTA
CGGCAATTGC GTGACAAACA ACCGGACGAT GTGTTGACAT TACTCGCCCT CGGCTTTGCC
TTGCGGATTG ATGGGCAACT CGATGCGGCA AACGAAAGCT ATCAGCAGGT CATCGAACAG
GCCCCACAAT TACCCGATGG CTATGTAGGC CTCGGTAAGG TGGCACTCGC GCGTAACGAT
CGGGAACGTG CGGTAACCGC GTTTCGCTCC GCCCTCGAAC GGAATAACCG TTTCTTTCCG
GCGCATGTGG CTCTTGCGCG CCTTGCCGAA GCTGACGGTG ATTGGCCGGC AGCTATTGAA
CACTGGCGCG CGCTCGTGGC ATGGCAAGAG TCGCCGTACA CGGCTGTGAA TCTCGCCCGT
GCTCTCCGGC GTAGTGGGGT CAGTGGCTTT GCTGAAGCAG AACGGATACT CGTACCGTTG
GCAACAACCT CTGCCGAAGC CGCGATTGAG TTGGCCCGCT TGTACAACGA TGCCGGCTAC
CCCAAAGAGG CGGCCGCCGT GTACCTTGAT GCCCTCACCC TTGATCCGCG TTCAACGGTT
GCGGCGTTTG AACTTGGGGA GACCTACATC CGGCTCGGTG AACTGACCGA AGCTGAGCGC
AGATTACGCG ATGCAATTAC CTTTGACGAA CGAAACCTCG ATGCCCGCCT CCGGCTGGCC
GAACTGTATG AAGGTCCGCT TAATCAGCCC AATCGGGCGA TTGACCAGTA CCGTATCGCG
TTGGGACAGG GTGTTAACGA TCTCAATCAA CTGATTGCCA TCGGTCAGGC GGCATTGAAC
GTCAACACCG CGACGGTCGC GATACAGGCA CTTGAGCGTG CCCTGCAACT GAACCCCGAC
TCGGTGACGA CCTATCAATT GTTGGCCCGC GCGTATCTGA TCGGCAATCG TCTCGAAGCA
GCAGCGCAAG AAGCCCTACA AACACTGCAA CGGACGGCAA ATCGGAATGA TCCGGAAGCA
GTCAATGCAC GAGTCAATGC GTTACTCACC CTCGCCGACA TTGCGCGCCG CCGTAACGAT
AGCGCTGCCG CCGAGGATTA CTATCGGCAA GTGCTGGCGG TTGACCCGCA ATCTATCGCT
GCCCACATCG GTTTGGGTGA ATTGGCCGGT GAGCAGGGAC AATGGGGAAT TGCGCTCGGC
TATTTCGAGA CCGCGGTAGC CTTACCGGGA GGCGACACGA ATGCTACTGC CCAGTTCTGG
CTTGGTGAGT CTCTCTTGCG GAGTGGTCAA CTCGCCCGTG CCCGTACTGC TTACCAACGT
GCCTTGGAAC TACAACCGAT CTACCCTGAA GCGTTGCTTG GACTAGCCCA AACGCAACAT
GCCCTTGGTC AAGCTGATGC AGCATTACAA ACGGTTGAGC AAGCCCTGCA CCAACGGAGC
AATTACGCTG AAGCGCATCT CTTTCGCGGC AAGTTATTGC AAGAAGCCGG TCGTTTTGCC
GAAGCACGTG CCGCTTACGA TGCAGCCATC GGCACCAACG ACCGCATTGC CGAGAGTTTC
TACCGCCGCG CACTGTTGGC AATTCGTGAC AACGATTACG ATCAGGCGAT CCGAGATCTC
AACCGCACCG TGACGCTCCA GCCCAACTTT CCCGAAGCTT ACTACTGGCT TGGCCGCGCT
TATTACACAC AAGGCCGCAT TGAGAACGCT CAGCAGGCGA TAGAACGGGC AATCACGCTC
AACCCTGATT ACAGTGAAGC TATCTTCTAC AGTGGCTTGA TCGCCGAAGA TCGGGCCAAC
GTCGCTGCTG CCCGCGATGC GTATCAAACG CTCATTAGTC GTGAGCCAAC TAGCGAATGG
GGACAACGTG CGCGCGCCCA ACTCGAACGT TTGCCATGA
 
Protein sequence
MRWPWRRTPT EGESEQHSER TTAQRHLRRW RRRLLIVIII GIVISLFVLF RPVDPRFVIV 
VAPFSDQDGR TGQQIAGALA RQLRINGNNL VQVIESRTRP TNSAEALTLA QQTNADILVW
GSVEAGGILD SPSLRPELIY LPHNIDVSQS WYTFAIRFAI PYHFVLSTAP MNGQTTLVPY
LLALAAYHHG EADLAVERLQ QLLEINPQVS PLLPHILRGN LLWARGWYSA ATTEYRATLS
VANGDQALLT NNLGAILFDA QDPEALRLFA EAIDLLDGRD LGQLRVNLAL WALREQRAGD
AVSDLEQARN LLPPHPELEL LLASAYRENG QLDKAADSLA RVETAKTAIL ARTPLAVRTA
VELRLNSLLT EERALIGIER KLPLAGPLSW ALEAADPLPP AADLRQTRDQ LRTATEYCTR
SISLWRQRAA SEAAIFPGTG LMATGQAERS EELAYRQSRT LALIETALQA VDGQTNTRPN
QWFIALFGTT RVDNPTIEKL RQLRDKQPDD VLTLLALGFA LRIDGQLDAA NESYQQVIEQ
APQLPDGYVG LGKVALARND RERAVTAFRS ALERNNRFFP AHVALARLAE ADGDWPAAIE
HWRALVAWQE SPYTAVNLAR ALRRSGVSGF AEAERILVPL ATTSAEAAIE LARLYNDAGY
PKEAAAVYLD ALTLDPRSTV AAFELGETYI RLGELTEAER RLRDAITFDE RNLDARLRLA
ELYEGPLNQP NRAIDQYRIA LGQGVNDLNQ LIAIGQAALN VNTATVAIQA LERALQLNPD
SVTTYQLLAR AYLIGNRLEA AAQEALQTLQ RTANRNDPEA VNARVNALLT LADIARRRND
SAAAEDYYRQ VLAVDPQSIA AHIGLGELAG EQGQWGIALG YFETAVALPG GDTNATAQFW
LGESLLRSGQ LARARTAYQR ALELQPIYPE ALLGLAQTQH ALGQADAALQ TVEQALHQRS
NYAEAHLFRG KLLQEAGRFA EARAAYDAAI GTNDRIAESF YRRALLAIRD NDYDQAIRDL
NRTVTLQPNF PEAYYWLGRA YYTQGRIENA QQAIERAITL NPDYSEAIFY SGLIAEDRAN
VAAARDAYQT LISREPTSEW GQRARAQLER LP