Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2989 |
Symbol | |
ID | 4811137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3506208 |
End bp | 3509162 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108410 |
Product | glycosyltransferase 36 |
Protein accession | YP_001039378 |
Protein GI | 125975468 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.684465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACTA AAGTAACAGC GAGAAATAAT AAGATAACAC CTGTTGAGTT GTTGAATCAA AAGTTTGGAA ACAAGATTAA TCTGGGCAAT TTTGCGGATG CTGTTTTTAC TGACGCGGCG TTCAAAAATG TGGCAGGCAT TGCAAATTTG CCTATGAAAG CGCCGGTAAT GCAGGTTCTT ATGGAAAACT GCATTGTTTC AAAATATCTG AAACAGTTTG TACCTGACCG GTCTGTTTGT TTTGTTGAAG AAGGACAGAA ATTTTACATA GTACTTGAAG ACGGTCAAAA AATTGAAGTG CCTGAGGATG TAAACAAGGC TCTCAGGGCT ACGGTAAGTG ATGTAAAGCA TTGGGCAGGT TATTTGACGG AAGACGGGGA GCATGTAATC GACCTTTTAA AACCGGCTCC GGGTCCGCAT TTTTATGTGA ATTTGCTTAT AGGAAACAGG CTTGGTTTTA AAAGGACATT GCAGACAACT CCGAAAAGTG TGGTTGACAG GTTCGGAAGA GGTTCGTTCC GTTCCCATGC TGCAACCCAG GTGCTGGCAA CGAGATTTGA CATGCGCCAG GAGGAAAACG GTTTTCCTGC GAACAGACAG TTCTATTTGT ATGAAGACGG CAAACAGATT TTTTATTCCG CATTAATTGA TGACAACATT GTTGAGGCTA CCTGCAAACA TTCATGCAAT CGTACGGTAA TAAAATATAA GACGGCATGT AATCTGGAAA TTACAAGAAC CATCTTCCTG GTGCCTCACA AGAAGGGATT CCCTCTTGCA ACTGAATTAC AGAGAATTGA AATAAAGAAT GCGTCGGACA AGGCAAGGAA TTTGTCCATT ACATATACGG GAATGTTTGG AACGGGTGCC GTTCATGCGA TATTTGAGGA CGTAACATAC ACAAATGTTA TCATGCAAAG TGCCGCCCTT TACAATGACA AGGGTGAGTT TATCGGAATA ACTCCTGATT ATTATCCTGA AGAATTTAAA CAGGATACAA GATTTGTCAC GATGATTGTC CGCAACGGGG ACGAGAAATC ATTCCCGCAG AGTTTCTGCA CGGACTACAA CGACTTTGTA GGCACAGGAA CATTGGAGCA TCCGGCAGGC GGATGTAATT TGAACAACAA GCTGAACCGC AAAGGTCCGG GATTCTTTGC CCTGGGTGCG CCGTTTACGG TTGAACCGGG CAAGACAGTC ATAATAGACA CTTTCACCGG TTTGTCTTCG AGCAAGGATA ATGAAAATTA CAGCGATGCA GTAATGCTCA GGGAACTGGA CAATTTGCTG CGCTATTTTG AAAAAAGCGA ATCTGTGGAA GAAACATTGA ATGAAATTAT CAACTTCCAT GAAAATTATG GCAAATACTT CCAGTTCAAT ACCGGAAACA AGCTGTTTGA TTCCGGATTT AACAGGAATT TGGCGTTCCA GGTATTGTAT CAGACATTTA TGTCCCGTTC TTTCGGACAA ACACAGAAAG GATATCGTGA AATCGGATTC AGGGAAATTC AGGACCTGTT TGCATCCATG TACTATTTTA TAAACATAGG ATATCAGGAT TTTGTAAAGG AATTGTTGTT TGAGTGGACG GCAAACGTAT ATAAAATGGG TTATGCAAAC CACAACTTCT ATTGGGTGGG CAAACAGCCG GGACTGTATT CCGATGACAG CCTGTGGCTC TTGCAGGCAT ATTACAGATA TATTATTTAT ACAAAAGATA CTTCGGTATT AAATGAGGAA GTACCGGTTG CCGACGGAAA CAATGAAAAG AGGGCTGTAA GAGAAACGCT GAAGGCTATC ATCCAGTATT CCGCTTGTAT TTCTGTCGGT GATCATGGCC TTCCGCTGCT GGATCTTGCA GACTGGAATG ACTGCCTGAA GATTGACAGC AACAGTATAG ACGGTGCAAC CAAAGAAAAG TTGTACTACG AACAGTTGAA GAAGACAAAC GGCAAATATG GAGATCGCTT TATGAGCGAT TATTCGGAAA GCGTGATGAA TGCTTTCCTC TTGAAGTTGG CAATTGACCA TTTGGCTGAA ATTGCAACTT TGGATAATGA CACTCAACTG GCCCAACAAA TGAGTGAATT GTCAAAAGAG GTTACAGACC GCATTCAGAA ACATGCCTGG AAAGAAAACT TCTTTGCCCG TGTTCTTATA AACCGTTACA AAGACGGTTC CTATACTTAT TTGGGAGCAA AGGGCGACAA GCTTTCCGCT GATCCGAACA TTGACGGCGT GTACTTCTTA AACAGTTTTG CATGGTCGGT GCTGTCCGAT GTTGCAACCG ATGAGCAAAT AGCAATAATG GTGGATGTCA TCAAAAAATA TTTGTTAACT CCGTACGGCT TGCGTTTGGT AACACCTGCC GATTTGAACA AAATTGCAAA TGATACTGCA ACAGGGCATT ACTTCTTTGG TGACAGGGAA AACGGTGCTG TCTTCAAACA TGCTTCAATG ATGGCAGTTG TTGCGCTTAT CAAGGCTGCA AAGAAAGTAA AAGACAATGA GCTTGCCAAA GAAATGGCAA GAATAGCGTA CTTTATGATA GACTTGGTAC TGCCATACAA GAACCTTGAA AATCCGTTCC AGGTTGCAGG AAATCCAAGG ATATGCACTC AATATATCAA TACTGACACA GGAGAAAATA TTGGACCTTT GTTGAGCGGG ACGGCAACCT GGCTTAACTT GAATCTTATT TCCCTGGCAG GAATAGAGTA CACCAGGGAT GGAATTTCCT TCAATCCGAT ACTTCGGGAA GAGGAAACTC AGTTGAATTT CACTTTGAAA GCGCCGAAAT GCTCATATAA GTTTAGTATT ACAAAACCGG TTGGTTTTGC TAGAATGGAA AGTTCGGAAT ATGAACTTTT TGTTGATGGA CAAAAGATTG ACAACACTGT CATTCCAATG TATACGGATG AAAAAGAACA TATAGTGACT CTTAAGTTTA AATAA
|
Protein sequence | MITKVTARNN KITPVELLNQ KFGNKINLGN FADAVFTDAA FKNVAGIANL PMKAPVMQVL MENCIVSKYL KQFVPDRSVC FVEEGQKFYI VLEDGQKIEV PEDVNKALRA TVSDVKHWAG YLTEDGEHVI DLLKPAPGPH FYVNLLIGNR LGFKRTLQTT PKSVVDRFGR GSFRSHAATQ VLATRFDMRQ EENGFPANRQ FYLYEDGKQI FYSALIDDNI VEATCKHSCN RTVIKYKTAC NLEITRTIFL VPHKKGFPLA TELQRIEIKN ASDKARNLSI TYTGMFGTGA VHAIFEDVTY TNVIMQSAAL YNDKGEFIGI TPDYYPEEFK QDTRFVTMIV RNGDEKSFPQ SFCTDYNDFV GTGTLEHPAG GCNLNNKLNR KGPGFFALGA PFTVEPGKTV IIDTFTGLSS SKDNENYSDA VMLRELDNLL RYFEKSESVE ETLNEIINFH ENYGKYFQFN TGNKLFDSGF NRNLAFQVLY QTFMSRSFGQ TQKGYREIGF REIQDLFASM YYFINIGYQD FVKELLFEWT ANVYKMGYAN HNFYWVGKQP GLYSDDSLWL LQAYYRYIIY TKDTSVLNEE VPVADGNNEK RAVRETLKAI IQYSACISVG DHGLPLLDLA DWNDCLKIDS NSIDGATKEK LYYEQLKKTN GKYGDRFMSD YSESVMNAFL LKLAIDHLAE IATLDNDTQL AQQMSELSKE VTDRIQKHAW KENFFARVLI NRYKDGSYTY LGAKGDKLSA DPNIDGVYFL NSFAWSVLSD VATDEQIAIM VDVIKKYLLT PYGLRLVTPA DLNKIANDTA TGHYFFGDRE NGAVFKHASM MAVVALIKAA KKVKDNELAK EMARIAYFMI DLVLPYKNLE NPFQVAGNPR ICTQYINTDT GENIGPLLSG TATWLNLNLI SLAGIEYTRD GISFNPILRE EETQLNFTLK APKCSYKFSI TKPVGFARME SSEYELFVDG QKIDNTVIPM YTDEKEHIVT LKFK
|
| |