Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2702 |
Symbol | |
ID | 4810696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3187059 |
End bp | 3189296 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108121 |
Product | polysaccharide pyruvyl transferase |
Protein accession | YP_001039094 |
Protein GI | 125975184 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG0438] Glycosyltransferase [COG2327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03609] polysaccharide pyruvyl transferase CsaB |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000746297 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTC TTCATTTAAT AGGCGGAGGA GACGTTGGCG GAGCAAAAAG CCATGTTCTC TCTCTGGTCA GGGAACTTGG CAAACATATA AATGTAAAAC TTATAAGTTT CAGGACAGGT GCTTTTGCCG ATGACGCCCG TGCAATGGGT ATAAACGTGG AAGTTGTAAA TACAGGCACC ATTTTTTCTG ATGTGCGTAA AGTTTTAAGA ATTGTAAGGG AGGAAGGGTA CGAGCTTATT CATTCCCATG GTGCAAAAGC CAACATGATT GCCGTTTTGG TAAAAAGGCT TACCGGGCTT CCCGTGGTTA CTACAGTTCA CAGCGACTAC AGGCTTGATT ATTTGCAAAA TATCTTTAAA ATGTTCTCCT TCGGCCTTAT AAACATGGTT TCATTGAGAT TCATTGACTT TCATATAGCC GTTTCCAAAA ACTTCAAGAC AATGCTTATT GAAAGAAAGT TCAGTCCTCA AAACATATTT ACCGTTTACA ATGGCATTAA TTTCAACCAG GAAATTAATC CTTTGCCAAA AGAGGAGTTT TTAAAAAAAT ACAACCTGAA ATTTGGGGAA AACGATGTCA TAATCGGAAT ACTGGCCCGT CTTGACCCTG TAAAAGGCCT GGATACATTC CTTAAGGCTG CAAATGCCGT CATTAAAACC AATCCGACGG CAAGGTTTTT GATAGCAGGA GACGGACCCG AGCGAAAATC CCTCGAAAAA AAGGCCGCAT CCTACGGTCT TCAGAACAAC GTTTTCTTTT TAGGTTTCGT AAACAAACCT TACGATTTTT TAAACTCTAT AGACATAAAC ACTCTTACTT CGCTGAGCGA AAGTTTTCCC TATGCAATCC TTGAAGGCTC TCTGCTGAAA AAAGCCACTA TAAGCAGCAA TGTGGGAGGC ATTTCAGACC TTATTGAAAG CGGAATAAAT GGTTTTCTCT TTGAGCCGGG AGATTACGAA ACTTTGGCCG GCCACATATT GACCCTCATA AATGACCCCG CACTGAGAAA AAAAATGGGG GAAAAAATTC ATGAAAAGGC CAGTTCGCAC TTTTCTTTGG ACAATATGTG CAAAACACAG CTTGACATAT ATGAAACAAT TCTTCTTCGG AGTTTTAGAA ACAGCAGGTC CAAATATCGC TACGATGCAA TTATCTCAGG TTATTACGGC TTTAAAAACA TCGGAGATGA CGCCATGCTT ATGGCTATAA TTGACAATCT TCGCATGTAT CGAAGAGATT TAAGAATTTT GGTCCTGTCC AGAAATCCCT TGGAAACAGG ACTTGTATAT AATGTTGATT CAATAAACAG GTTCAACCTC CTTAAAATCC TTCTCATCAT GAGGAATTCA AAACTTTTTA TAAACGGAGG AGGAAGCCTG ATTCAGGACA ACACCAGTAC CCGTTCCCTT ATATATTATC TCGGAATGAT CTGGCTTGCA AAAAAAATGG GTATGAAGGT GATGATTTAC GCCAACGGCA TAGGGCCTTT GAACAAGGAA AAAAACCGGA AGCTTACAAA GAAAATTGTA AACCGGGTGG ATGTCATCAC TTTGAGAGAA AAGTTGTCCT ATGAAGAATT AAACAATCTT AAAATTCAAA GTCCCAGGAT TAAGGTAACT GCCGACCCGG CTTTTACCAT AATACCCGAA AAAATCGAGC GTGTAAATCA GCTTCTCATA GATGAGGGAA TTGACCCGAA CGAACAACTT GTCGGCATAT CCGTGAGAAA ATGGGGCGAA CATGAAAAAT ATGAGACCAC AATTGCGGAA CTTGCGGATT ATATAGTCGA AAAATACGGT ATGAAACCCC TTTTTATAGC AATGCACTAT CCGGAAGACC TTGCAATAAT TCAAAATATA ACTTCGAAAA TGAAAAACAA AAGCTTTGTA ATAACCAATA AACCTACTGT TTCAGAAATG CTTGGAATAA TCGGCAAAAC TCAAATGCTT ATCGGAATGC GCCTTCATGC TCTTATTTTT GCCGCAAGCC TCGGAATACC CGTGGTGGGA ATGGTGTACG AACCCAAGGT TGAAGGTTTT ATGCAGTATA TCAACCAGGC ATCGGCGGGA CATGTAAACT CCCTGGAACT GGAACATATG AAAAAAATAG TGGATGAAAC CTGGGAAAAC AGGGAAGCCA TAAAAAAAGA GCTGGAAAAA AACACTGCGG TTCTTAAAGA CAAGGCTCTT GAGAACGCTA AAATTGCAAT TGAAATGATT GATGAAAAAC CTCTTTAA
|
Protein sequence | MKVLHLIGGG DVGGAKSHVL SLVRELGKHI NVKLISFRTG AFADDARAMG INVEVVNTGT IFSDVRKVLR IVREEGYELI HSHGAKANMI AVLVKRLTGL PVVTTVHSDY RLDYLQNIFK MFSFGLINMV SLRFIDFHIA VSKNFKTMLI ERKFSPQNIF TVYNGINFNQ EINPLPKEEF LKKYNLKFGE NDVIIGILAR LDPVKGLDTF LKAANAVIKT NPTARFLIAG DGPERKSLEK KAASYGLQNN VFFLGFVNKP YDFLNSIDIN TLTSLSESFP YAILEGSLLK KATISSNVGG ISDLIESGIN GFLFEPGDYE TLAGHILTLI NDPALRKKMG EKIHEKASSH FSLDNMCKTQ LDIYETILLR SFRNSRSKYR YDAIISGYYG FKNIGDDAML MAIIDNLRMY RRDLRILVLS RNPLETGLVY NVDSINRFNL LKILLIMRNS KLFINGGGSL IQDNTSTRSL IYYLGMIWLA KKMGMKVMIY ANGIGPLNKE KNRKLTKKIV NRVDVITLRE KLSYEELNNL KIQSPRIKVT ADPAFTIIPE KIERVNQLLI DEGIDPNEQL VGISVRKWGE HEKYETTIAE LADYIVEKYG MKPLFIAMHY PEDLAIIQNI TSKMKNKSFV ITNKPTVSEM LGIIGKTQML IGMRLHALIF AASLGIPVVG MVYEPKVEGF MQYINQASAG HVNSLELEHM KKIVDETWEN REAIKKELEK NTAVLKDKAL ENAKIAIEMI DEKPL
|
| |