Gene Cthe_0910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0910 
Symbol 
ID4810531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1086340 
End bp1088007 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content40% 
IMG OID640106329 
Productextracellular solute-binding protein 
Protein accessionYP_001037337 
Protein GI125973427 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.463169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAAAAA GAGCCTTGTT GGTTGTTGTG CTGGTGATTG CAAGTATTGT TTTTTCTGCT 
TGCAGCATTG ATTTAAGCAG ATTGTTGGAA GAAGGAAACA ATGACTATGA TTTTTTAAAT
AATGGTGTGG ATGATGTACT GGATACAGGG CCTGTAAAAA ACGGTGTTTT AAATTTGTTT
TCCACCGAGC CCGACACTCT CAATCCCATA TTGACTTCCA ATGCCTATGT AAAGGAGTAT
TCCTGCTTTG TGTTTGAAAG CCTTGTGAGG TTGGACGGAA ATCAGAAGGC TGTGCCGCTT
CTGGCCGAAA GCTGGGAAGG ATCCGATGAT GGATTGGTAT GGACATTTTA TTTAAAAGAA
AATATTTATT GGCACGATGG TATACCTTTT TCAGCGGAAG ACGTGGAATT TACGGCAAGC
GTTATAATGA ATGCGGGGGT AAACAGTCCG TACAAGACAT GTTTTGAAAA TGTGGAAAGC
TTTCTTGCTC AAGACAGCAG GACATTTAAG GTGCTTTTAA AAAGCCCCAA TTCTTTCACT
CCCGAACTTA TGACTTTTCC CGTAATTCCG AAGCATTATT TTCTTGGGGA AGATATTCTG
ACCACTCCCA AAAACAACAG TCCGATAGGT ACCGGACCGT ATAAGTTCGC GGAATACAGG
CAAGGTGAGT ATATCAGGCT TACATGTAAT GAAAATTGGT GGAACAAAGA TGATGGTATT
GAAAACGGAA TTGACCTTCC CTATATCCAG GAGGTCAATA TAAAAATATA TGGCAAAAAT
CAGTCTGTAA TGAATGCTTT TCAGTCACAG CAAGTGGATG TCATTACTTT AGACAGGACC
AATTGGACGG GCTACAGTGG AAGGTCCGAT ATAATTTTAA AAAAATATGT AAGTAATGAA
TTTGAGTTTG TTGCATTTAA TCTCTCAAAC AAAATACTGA AGGAAAGGGA AGTAAGGACG
GCTATAGCAT ATGCTGTTGA CAGGGAGCAG ATTATAAGCA GTATTTTGCC GGGGGAGGCT
GTGGCGTCGG ATTTGCCCGT TATTCCCGAT ACATGGCTCA ATGATACCAA TGTTGTATCT
TATGAAAGAG ATGTGGAAAA GGCAAAGCAG ATACTTTCAG ATGCCGGATG GAAAGAAAGC
AACGGTATAT TTTACAAAAG AATCAACGGT GTTAACACTC CGCTTTCACT GGAGCTTATG
GTAAATGATG ACAACGAAGT TCGGCTGTCC GTGGCGGAAA TGATAAAAGA ACAGCTGAAA
GAGGCGGGAA TAGAAATAGA AATTAAAAAG GTCAAATGGG AGGACGAACT TAACGGAGTA
CAGAGCGGTA AATTTGACAT GGCGCTTATC GGATGCACTG TGGCATCCAT TCCGGATATA
TCTTTTCTGT ATTCATCAGC ACAAATCGGG ACAGGGCTTA ATATTTCCGG TTACAGCAAT
GAAGAGGTTG ACCGGTATCT TACCTTGATT TTGAAGGAAA AGGATCCGTC AATGAAGAAA
GCTTATTTTA TTAACATGAA AGAAATAATA AATCGGGATG TGCCTTGTTT GGGATTGTAT
TTTTATAATA ATATGGTTAT ATACAACAAA AGGTTAAGAG GAGAGTTCAA TCCCAGCATA
TGGGGCAAAT ATTACGATTT TACCCGGTGG TATATACCTG TTGAGTAA
 
Protein sequence
MRKRALLVVV LVIASIVFSA CSIDLSRLLE EGNNDYDFLN NGVDDVLDTG PVKNGVLNLF 
STEPDTLNPI LTSNAYVKEY SCFVFESLVR LDGNQKAVPL LAESWEGSDD GLVWTFYLKE
NIYWHDGIPF SAEDVEFTAS VIMNAGVNSP YKTCFENVES FLAQDSRTFK VLLKSPNSFT
PELMTFPVIP KHYFLGEDIL TTPKNNSPIG TGPYKFAEYR QGEYIRLTCN ENWWNKDDGI
ENGIDLPYIQ EVNIKIYGKN QSVMNAFQSQ QVDVITLDRT NWTGYSGRSD IILKKYVSNE
FEFVAFNLSN KILKEREVRT AIAYAVDREQ IISSILPGEA VASDLPVIPD TWLNDTNVVS
YERDVEKAKQ ILSDAGWKES NGIFYKRING VNTPLSLELM VNDDNEVRLS VAEMIKEQLK
EAGIEIEIKK VKWEDELNGV QSGKFDMALI GCTVASIPDI SFLYSSAQIG TGLNISGYSN
EEVDRYLTLI LKEKDPSMKK AYFINMKEII NRDVPCLGLY FYNNMVIYNK RLRGEFNPSI
WGKYYDFTRW YIPVE