Gene Cthe_2961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2961 
Symbol 
ID4810849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3478233 
End bp3479858 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content43% 
IMG OID640108383 
Productextracellular solute-binding protein 
Protein accessionYP_001039351 
Protein GI125975441 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAA TCTTGGCACT TACAATTACA CTTGTAATCG TTGCAACAGT ATTAACAGGC 
TGCGGAAAAA CACAGAACAA AACATCAACA TCAAACCAAG TCATTCGTGC GACTATTGCA
TCCGAGCCAA AAACCCTGGA TCCGTCCCGC AACAATGCAG TTGATGGCGG CAACTATATT
TTATGTGCAT TTGAAGGACT GACCACGCTA GGCAAAGACG GCACCATTGT TGCAGGAACG
GCTGAAAGAT GGGAAACAAG TGATGACGGG CTTGTTTGGA CATTCTACAT CCGCAAGGAT
GCAAAGTGGT CTGACGGCAA GGATGTAACT GCAAACGATT TTGTTTATTC CTGGAGAAGG
CTCGTTGATC CGGCAACAGC CGCCGATTAT GCGTATTACG CTTATTTCAT AAAAAACGGT
GAAAAAATCA ACGCCGGTGA AGCCGACGTC AGTACGTTGG GTGTTCGGGC TGTCAACGAT
AAAACCCTTG AGGTAACCCT TGAAAGCCCA TGTCCGTTCT TTACCGAAAT AGTTGCCTTC
CCTGCTCTCG TTCCTCTCAG GGAAGATATT ATATCCGCCA ATAAAGACAA GTGGGCGCTT
GAACCGTCTA CTTATATAGG CAACGGACCT TACAAACTGA CCAGCTGGGA CCATGATTCA
AAAATTGTAT TTGAAAAGAA TGAAAACTAC TGGGACAAAA ATAATGTAAT TGCACCTAAA
ATCGAATGGT ATCTGATGAA TGACCAGAAT GCCATCTTAA GCGCATTTAA AAACGGACAA
GTTGCCTATG CAAAGAATAT TCCTTCGGAT GAGCTGGCTG CTGAAAAAGC AGCAGGTAAT
CTGAAAATTT TTCCGTTAAT TGGAACATAT TATATAGACT TTGTAAACAA TAAACCTCCT
TTTAACGATG TAAGGGTAAG AAAAGCGTTT TCTCTGGCAA TTGACCGCAA CTACCTGGTA
GAAAACGTCA AAAAAGGTGG TGAAACTCCG GCAACAGCTT TTGTACCATA CGGTATAGCT
GACGTTAATC CGGAACCCGA CTTCCGCACA GTAGGCGGCG ACTATATTTC AGTAAAACCT
GAAGATTACG AAAAGAATGT GGCTGAAGCC AAAAGGCTTC TTGCTGAGGC AGGTTACCCG
GACGGAAAAG GATTCCCGAA AATTACTTTT GGTCTGAACT CAGGTGCGGG TCATGAGCCT
ATAGCTGAAG CTTTGCAGCA GATGTGGAAG GAAAATCTGG GTGTTGAAGT TGAAATCCTG
GCTCAGGAAT GGAATGTATT CCAGCAGTCA CGAAAAGACG GCGTTTACAA TATAAACAGA
AACGGTTGGA TCGGAGACTA TATGGATCCG TCAACTTTCA TGGACATATT TACAACCGGA
AACGGTCAGA ATAATGCCAT GTACAGCAAT CCAAAGTATG ATGAACTTAT TTCCGCAGCA
AGAAGAGAAA CTGACCCGGC TAAACGCATT CAAATGTACC ATGATGCGGA AAAGATACTT
ATGGATGACG CCGCAATAGC TCCTTTGTAC TTCTATACTG ATCCTATAGT CATATCACCG
AATCTTAAGG GAGTGTTACA TTCACAACTT GGTTTCGTAA TCTTCAAATG GGCATATTTT
GAATAG
 
Protein sequence
MKRILALTIT LVIVATVLTG CGKTQNKTST SNQVIRATIA SEPKTLDPSR NNAVDGGNYI 
LCAFEGLTTL GKDGTIVAGT AERWETSDDG LVWTFYIRKD AKWSDGKDVT ANDFVYSWRR
LVDPATAADY AYYAYFIKNG EKINAGEADV STLGVRAVND KTLEVTLESP CPFFTEIVAF
PALVPLREDI ISANKDKWAL EPSTYIGNGP YKLTSWDHDS KIVFEKNENY WDKNNVIAPK
IEWYLMNDQN AILSAFKNGQ VAYAKNIPSD ELAAEKAAGN LKIFPLIGTY YIDFVNNKPP
FNDVRVRKAF SLAIDRNYLV ENVKKGGETP ATAFVPYGIA DVNPEPDFRT VGGDYISVKP
EDYEKNVAEA KRLLAEAGYP DGKGFPKITF GLNSGAGHEP IAEALQQMWK ENLGVEVEIL
AQEWNVFQQS RKDGVYNINR NGWIGDYMDP STFMDIFTTG NGQNNAMYSN PKYDELISAA
RRETDPAKRI QMYHDAEKIL MDDAAIAPLY FYTDPIVISP NLKGVLHSQL GFVIFKWAYF
E