Gene Cthe_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2989 
Symbol 
ID4811137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3506208 
End bp3509162 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content42% 
IMG OID640108410 
Productglycosyltransferase 36 
Protein accessionYP_001039378 
Protein GI125975468 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.684465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACTA AAGTAACAGC GAGAAATAAT AAGATAACAC CTGTTGAGTT GTTGAATCAA 
AAGTTTGGAA ACAAGATTAA TCTGGGCAAT TTTGCGGATG CTGTTTTTAC TGACGCGGCG
TTCAAAAATG TGGCAGGCAT TGCAAATTTG CCTATGAAAG CGCCGGTAAT GCAGGTTCTT
ATGGAAAACT GCATTGTTTC AAAATATCTG AAACAGTTTG TACCTGACCG GTCTGTTTGT
TTTGTTGAAG AAGGACAGAA ATTTTACATA GTACTTGAAG ACGGTCAAAA AATTGAAGTG
CCTGAGGATG TAAACAAGGC TCTCAGGGCT ACGGTAAGTG ATGTAAAGCA TTGGGCAGGT
TATTTGACGG AAGACGGGGA GCATGTAATC GACCTTTTAA AACCGGCTCC GGGTCCGCAT
TTTTATGTGA ATTTGCTTAT AGGAAACAGG CTTGGTTTTA AAAGGACATT GCAGACAACT
CCGAAAAGTG TGGTTGACAG GTTCGGAAGA GGTTCGTTCC GTTCCCATGC TGCAACCCAG
GTGCTGGCAA CGAGATTTGA CATGCGCCAG GAGGAAAACG GTTTTCCTGC GAACAGACAG
TTCTATTTGT ATGAAGACGG CAAACAGATT TTTTATTCCG CATTAATTGA TGACAACATT
GTTGAGGCTA CCTGCAAACA TTCATGCAAT CGTACGGTAA TAAAATATAA GACGGCATGT
AATCTGGAAA TTACAAGAAC CATCTTCCTG GTGCCTCACA AGAAGGGATT CCCTCTTGCA
ACTGAATTAC AGAGAATTGA AATAAAGAAT GCGTCGGACA AGGCAAGGAA TTTGTCCATT
ACATATACGG GAATGTTTGG AACGGGTGCC GTTCATGCGA TATTTGAGGA CGTAACATAC
ACAAATGTTA TCATGCAAAG TGCCGCCCTT TACAATGACA AGGGTGAGTT TATCGGAATA
ACTCCTGATT ATTATCCTGA AGAATTTAAA CAGGATACAA GATTTGTCAC GATGATTGTC
CGCAACGGGG ACGAGAAATC ATTCCCGCAG AGTTTCTGCA CGGACTACAA CGACTTTGTA
GGCACAGGAA CATTGGAGCA TCCGGCAGGC GGATGTAATT TGAACAACAA GCTGAACCGC
AAAGGTCCGG GATTCTTTGC CCTGGGTGCG CCGTTTACGG TTGAACCGGG CAAGACAGTC
ATAATAGACA CTTTCACCGG TTTGTCTTCG AGCAAGGATA ATGAAAATTA CAGCGATGCA
GTAATGCTCA GGGAACTGGA CAATTTGCTG CGCTATTTTG AAAAAAGCGA ATCTGTGGAA
GAAACATTGA ATGAAATTAT CAACTTCCAT GAAAATTATG GCAAATACTT CCAGTTCAAT
ACCGGAAACA AGCTGTTTGA TTCCGGATTT AACAGGAATT TGGCGTTCCA GGTATTGTAT
CAGACATTTA TGTCCCGTTC TTTCGGACAA ACACAGAAAG GATATCGTGA AATCGGATTC
AGGGAAATTC AGGACCTGTT TGCATCCATG TACTATTTTA TAAACATAGG ATATCAGGAT
TTTGTAAAGG AATTGTTGTT TGAGTGGACG GCAAACGTAT ATAAAATGGG TTATGCAAAC
CACAACTTCT ATTGGGTGGG CAAACAGCCG GGACTGTATT CCGATGACAG CCTGTGGCTC
TTGCAGGCAT ATTACAGATA TATTATTTAT ACAAAAGATA CTTCGGTATT AAATGAGGAA
GTACCGGTTG CCGACGGAAA CAATGAAAAG AGGGCTGTAA GAGAAACGCT GAAGGCTATC
ATCCAGTATT CCGCTTGTAT TTCTGTCGGT GATCATGGCC TTCCGCTGCT GGATCTTGCA
GACTGGAATG ACTGCCTGAA GATTGACAGC AACAGTATAG ACGGTGCAAC CAAAGAAAAG
TTGTACTACG AACAGTTGAA GAAGACAAAC GGCAAATATG GAGATCGCTT TATGAGCGAT
TATTCGGAAA GCGTGATGAA TGCTTTCCTC TTGAAGTTGG CAATTGACCA TTTGGCTGAA
ATTGCAACTT TGGATAATGA CACTCAACTG GCCCAACAAA TGAGTGAATT GTCAAAAGAG
GTTACAGACC GCATTCAGAA ACATGCCTGG AAAGAAAACT TCTTTGCCCG TGTTCTTATA
AACCGTTACA AAGACGGTTC CTATACTTAT TTGGGAGCAA AGGGCGACAA GCTTTCCGCT
GATCCGAACA TTGACGGCGT GTACTTCTTA AACAGTTTTG CATGGTCGGT GCTGTCCGAT
GTTGCAACCG ATGAGCAAAT AGCAATAATG GTGGATGTCA TCAAAAAATA TTTGTTAACT
CCGTACGGCT TGCGTTTGGT AACACCTGCC GATTTGAACA AAATTGCAAA TGATACTGCA
ACAGGGCATT ACTTCTTTGG TGACAGGGAA AACGGTGCTG TCTTCAAACA TGCTTCAATG
ATGGCAGTTG TTGCGCTTAT CAAGGCTGCA AAGAAAGTAA AAGACAATGA GCTTGCCAAA
GAAATGGCAA GAATAGCGTA CTTTATGATA GACTTGGTAC TGCCATACAA GAACCTTGAA
AATCCGTTCC AGGTTGCAGG AAATCCAAGG ATATGCACTC AATATATCAA TACTGACACA
GGAGAAAATA TTGGACCTTT GTTGAGCGGG ACGGCAACCT GGCTTAACTT GAATCTTATT
TCCCTGGCAG GAATAGAGTA CACCAGGGAT GGAATTTCCT TCAATCCGAT ACTTCGGGAA
GAGGAAACTC AGTTGAATTT CACTTTGAAA GCGCCGAAAT GCTCATATAA GTTTAGTATT
ACAAAACCGG TTGGTTTTGC TAGAATGGAA AGTTCGGAAT ATGAACTTTT TGTTGATGGA
CAAAAGATTG ACAACACTGT CATTCCAATG TATACGGATG AAAAAGAACA TATAGTGACT
CTTAAGTTTA AATAA
 
Protein sequence
MITKVTARNN KITPVELLNQ KFGNKINLGN FADAVFTDAA FKNVAGIANL PMKAPVMQVL 
MENCIVSKYL KQFVPDRSVC FVEEGQKFYI VLEDGQKIEV PEDVNKALRA TVSDVKHWAG
YLTEDGEHVI DLLKPAPGPH FYVNLLIGNR LGFKRTLQTT PKSVVDRFGR GSFRSHAATQ
VLATRFDMRQ EENGFPANRQ FYLYEDGKQI FYSALIDDNI VEATCKHSCN RTVIKYKTAC
NLEITRTIFL VPHKKGFPLA TELQRIEIKN ASDKARNLSI TYTGMFGTGA VHAIFEDVTY
TNVIMQSAAL YNDKGEFIGI TPDYYPEEFK QDTRFVTMIV RNGDEKSFPQ SFCTDYNDFV
GTGTLEHPAG GCNLNNKLNR KGPGFFALGA PFTVEPGKTV IIDTFTGLSS SKDNENYSDA
VMLRELDNLL RYFEKSESVE ETLNEIINFH ENYGKYFQFN TGNKLFDSGF NRNLAFQVLY
QTFMSRSFGQ TQKGYREIGF REIQDLFASM YYFINIGYQD FVKELLFEWT ANVYKMGYAN
HNFYWVGKQP GLYSDDSLWL LQAYYRYIIY TKDTSVLNEE VPVADGNNEK RAVRETLKAI
IQYSACISVG DHGLPLLDLA DWNDCLKIDS NSIDGATKEK LYYEQLKKTN GKYGDRFMSD
YSESVMNAFL LKLAIDHLAE IATLDNDTQL AQQMSELSKE VTDRIQKHAW KENFFARVLI
NRYKDGSYTY LGAKGDKLSA DPNIDGVYFL NSFAWSVLSD VATDEQIAIM VDVIKKYLLT
PYGLRLVTPA DLNKIANDTA TGHYFFGDRE NGAVFKHASM MAVVALIKAA KKVKDNELAK
EMARIAYFMI DLVLPYKNLE NPFQVAGNPR ICTQYINTDT GENIGPLLSG TATWLNLNLI
SLAGIEYTRD GISFNPILRE EETQLNFTLK APKCSYKFSI TKPVGFARME SSEYELFVDG
QKIDNTVIPM YTDEKEHIVT LKFK