Gene Cthe_2702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2702 
Symbol 
ID4810696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3187059 
End bp3189296 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content40% 
IMG OID640108121 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_001039094 
Protein GI125975184 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0438] Glycosyltransferase
[COG2327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03609] polysaccharide pyruvyl transferase CsaB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000746297 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTC TTCATTTAAT AGGCGGAGGA GACGTTGGCG GAGCAAAAAG CCATGTTCTC 
TCTCTGGTCA GGGAACTTGG CAAACATATA AATGTAAAAC TTATAAGTTT CAGGACAGGT
GCTTTTGCCG ATGACGCCCG TGCAATGGGT ATAAACGTGG AAGTTGTAAA TACAGGCACC
ATTTTTTCTG ATGTGCGTAA AGTTTTAAGA ATTGTAAGGG AGGAAGGGTA CGAGCTTATT
CATTCCCATG GTGCAAAAGC CAACATGATT GCCGTTTTGG TAAAAAGGCT TACCGGGCTT
CCCGTGGTTA CTACAGTTCA CAGCGACTAC AGGCTTGATT ATTTGCAAAA TATCTTTAAA
ATGTTCTCCT TCGGCCTTAT AAACATGGTT TCATTGAGAT TCATTGACTT TCATATAGCC
GTTTCCAAAA ACTTCAAGAC AATGCTTATT GAAAGAAAGT TCAGTCCTCA AAACATATTT
ACCGTTTACA ATGGCATTAA TTTCAACCAG GAAATTAATC CTTTGCCAAA AGAGGAGTTT
TTAAAAAAAT ACAACCTGAA ATTTGGGGAA AACGATGTCA TAATCGGAAT ACTGGCCCGT
CTTGACCCTG TAAAAGGCCT GGATACATTC CTTAAGGCTG CAAATGCCGT CATTAAAACC
AATCCGACGG CAAGGTTTTT GATAGCAGGA GACGGACCCG AGCGAAAATC CCTCGAAAAA
AAGGCCGCAT CCTACGGTCT TCAGAACAAC GTTTTCTTTT TAGGTTTCGT AAACAAACCT
TACGATTTTT TAAACTCTAT AGACATAAAC ACTCTTACTT CGCTGAGCGA AAGTTTTCCC
TATGCAATCC TTGAAGGCTC TCTGCTGAAA AAAGCCACTA TAAGCAGCAA TGTGGGAGGC
ATTTCAGACC TTATTGAAAG CGGAATAAAT GGTTTTCTCT TTGAGCCGGG AGATTACGAA
ACTTTGGCCG GCCACATATT GACCCTCATA AATGACCCCG CACTGAGAAA AAAAATGGGG
GAAAAAATTC ATGAAAAGGC CAGTTCGCAC TTTTCTTTGG ACAATATGTG CAAAACACAG
CTTGACATAT ATGAAACAAT TCTTCTTCGG AGTTTTAGAA ACAGCAGGTC CAAATATCGC
TACGATGCAA TTATCTCAGG TTATTACGGC TTTAAAAACA TCGGAGATGA CGCCATGCTT
ATGGCTATAA TTGACAATCT TCGCATGTAT CGAAGAGATT TAAGAATTTT GGTCCTGTCC
AGAAATCCCT TGGAAACAGG ACTTGTATAT AATGTTGATT CAATAAACAG GTTCAACCTC
CTTAAAATCC TTCTCATCAT GAGGAATTCA AAACTTTTTA TAAACGGAGG AGGAAGCCTG
ATTCAGGACA ACACCAGTAC CCGTTCCCTT ATATATTATC TCGGAATGAT CTGGCTTGCA
AAAAAAATGG GTATGAAGGT GATGATTTAC GCCAACGGCA TAGGGCCTTT GAACAAGGAA
AAAAACCGGA AGCTTACAAA GAAAATTGTA AACCGGGTGG ATGTCATCAC TTTGAGAGAA
AAGTTGTCCT ATGAAGAATT AAACAATCTT AAAATTCAAA GTCCCAGGAT TAAGGTAACT
GCCGACCCGG CTTTTACCAT AATACCCGAA AAAATCGAGC GTGTAAATCA GCTTCTCATA
GATGAGGGAA TTGACCCGAA CGAACAACTT GTCGGCATAT CCGTGAGAAA ATGGGGCGAA
CATGAAAAAT ATGAGACCAC AATTGCGGAA CTTGCGGATT ATATAGTCGA AAAATACGGT
ATGAAACCCC TTTTTATAGC AATGCACTAT CCGGAAGACC TTGCAATAAT TCAAAATATA
ACTTCGAAAA TGAAAAACAA AAGCTTTGTA ATAACCAATA AACCTACTGT TTCAGAAATG
CTTGGAATAA TCGGCAAAAC TCAAATGCTT ATCGGAATGC GCCTTCATGC TCTTATTTTT
GCCGCAAGCC TCGGAATACC CGTGGTGGGA ATGGTGTACG AACCCAAGGT TGAAGGTTTT
ATGCAGTATA TCAACCAGGC ATCGGCGGGA CATGTAAACT CCCTGGAACT GGAACATATG
AAAAAAATAG TGGATGAAAC CTGGGAAAAC AGGGAAGCCA TAAAAAAAGA GCTGGAAAAA
AACACTGCGG TTCTTAAAGA CAAGGCTCTT GAGAACGCTA AAATTGCAAT TGAAATGATT
GATGAAAAAC CTCTTTAA
 
Protein sequence
MKVLHLIGGG DVGGAKSHVL SLVRELGKHI NVKLISFRTG AFADDARAMG INVEVVNTGT 
IFSDVRKVLR IVREEGYELI HSHGAKANMI AVLVKRLTGL PVVTTVHSDY RLDYLQNIFK
MFSFGLINMV SLRFIDFHIA VSKNFKTMLI ERKFSPQNIF TVYNGINFNQ EINPLPKEEF
LKKYNLKFGE NDVIIGILAR LDPVKGLDTF LKAANAVIKT NPTARFLIAG DGPERKSLEK
KAASYGLQNN VFFLGFVNKP YDFLNSIDIN TLTSLSESFP YAILEGSLLK KATISSNVGG
ISDLIESGIN GFLFEPGDYE TLAGHILTLI NDPALRKKMG EKIHEKASSH FSLDNMCKTQ
LDIYETILLR SFRNSRSKYR YDAIISGYYG FKNIGDDAML MAIIDNLRMY RRDLRILVLS
RNPLETGLVY NVDSINRFNL LKILLIMRNS KLFINGGGSL IQDNTSTRSL IYYLGMIWLA
KKMGMKVMIY ANGIGPLNKE KNRKLTKKIV NRVDVITLRE KLSYEELNNL KIQSPRIKVT
ADPAFTIIPE KIERVNQLLI DEGIDPNEQL VGISVRKWGE HEKYETTIAE LADYIVEKYG
MKPLFIAMHY PEDLAIIQNI TSKMKNKSFV ITNKPTVSEM LGIIGKTQML IGMRLHALIF
AASLGIPVVG MVYEPKVEGF MQYINQASAG HVNSLELEHM KKIVDETWEN REAIKKELEK
NTAVLKDKAL ENAKIAIEMI DEKPL