Gene Cthe_2194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2194 
Symbol 
ID4811059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2616652 
End bp2618157 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content44% 
IMG OID640107600 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001038589 
Protein GI125974679 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2382] Enterochelin esterase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.29628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAGAA AGGTTCTTAG TGTATTATTA ATTTGCCTGG TGCTTATAGC ATGTTTGGGC 
ACCGCAGTAA ACATTTCATC GGCAGCATCG CTGCCAACTA TGCCGCCGTC GGGGTATGAC
CAGGTAAGGG GTGGCATCCA GAGAGGGCAG GTTGTTAATA TTTCTTATTA TTCCACAGCA
ACAAACGGTA CACGGCCCGC AAAAGTTTAT TTGCCACCGG GATACTCGAC CAGTAAAAGG
TATAGCGTTT TGTACCTATT GCATGGAATA GGGGGAAGCG AAGGCGATTG GTTTGCCGAT
TGGGGAGGCA GAGCCAGCAT AATTGCCGAT AATCTGATTG CAGAGGGAAA AATCAAGCCT
TTGATAATAG TTACACCCAA TACTAACGCA GCAGGACCTG GGATAGGTGA TGGTTACGAA
AACTTTACAA AGGATTTAAT TAATTGCCTT ATTCCCTATA TAGAATCACG CTATTCCGTT
TATACTGACC GTGAACATCG GGCAATTGCC GGTCTTTCAA TGGGAGGAGG TCAATCCTTT
AATATTGGTT TGACCAACCT GGATAAATTT GCCTATATTG GTCCTATTTC TTCAGCTCCG
AACACCTATC CCAATAACAG GCTGTTCCCC GATGGAGGAG CTGCTGCAAG GCAGAAGCTG
AAATTGCTCT TCATTGCATG CGGAACCAAT GATTCTCTGA TAGGATTCGG ACAAAGGGTA
CACGAATTTT GCGTTGCCAA TAATATTAAC CATATCTATT GGCTTATCCA GGGAGGAGGA
CACGATTATA ATGTTTGGAA AGCGGGTTTG TGGAACTTCC TCCAATTAGC GGAACAGGCA
GGATTAACAG ATTATAATGC GCCAACACCA CCGCCACCGG CTCCAAGGTC AGCTTTTACA
CGTATCGAAG CGGAAGACTT CGATAACATG TCGGGAATAG AAAATGAAAG TTGTAGTGAA
GGCGGACTGA ATATAGGTTA TATAGAGAAT GGGGATTATG TTGCTTACAG TAATATAGAT
TTTGGTAACG GAGCAAAGGA ATTTCAGGCC AGGGTGGCAA GTGCTACCAG TGGAGGAAAA
ATCGAGATAA GGCTTGACAG TATTACAGGT CCATTAATAG GAACGTGCTC GGTTTCAGGT
ACCGGCGGTT GGCAGCAATG GGTTGATGTG AAATGCGAGG TCAGCGGCGT AAGCGGAACT
CATGATCTCT ATTTGAAATT TACGGGTGGC AGCGGTTATC TGTTCAATAT AAACTGGTGG
AAGTTCACTC AGGCCGATTC AAACCCAACG CCAACACCAC CGCCCAATGA GAATTTGGGC
GATTTGAACG GAGACGGAAA TATAAACTCG ACAGACCTTC AGATTTTAAA GAAGCATTTA
CTCCGTATAA CTTTGCTTAC GGGAAAAGAA CTTTCCAATG CGGATGTAAC CAAAGACGGC
AAAGTAGATT CAACCGATTT AACTTTATTG AAAAGATATA TACTTCGGTT TGTAACGAAT
TTTTAG
 
Protein sequence
MLRKVLSVLL ICLVLIACLG TAVNISSAAS LPTMPPSGYD QVRGGIQRGQ VVNISYYSTA 
TNGTRPAKVY LPPGYSTSKR YSVLYLLHGI GGSEGDWFAD WGGRASIIAD NLIAEGKIKP
LIIVTPNTNA AGPGIGDGYE NFTKDLINCL IPYIESRYSV YTDREHRAIA GLSMGGGQSF
NIGLTNLDKF AYIGPISSAP NTYPNNRLFP DGGAAARQKL KLLFIACGTN DSLIGFGQRV
HEFCVANNIN HIYWLIQGGG HDYNVWKAGL WNFLQLAEQA GLTDYNAPTP PPPAPRSAFT
RIEAEDFDNM SGIENESCSE GGLNIGYIEN GDYVAYSNID FGNGAKEFQA RVASATSGGK
IEIRLDSITG PLIGTCSVSG TGGWQQWVDV KCEVSGVSGT HDLYLKFTGG SGYLFNINWW
KFTQADSNPT PTPPPNENLG DLNGDGNINS TDLQILKKHL LRITLLTGKE LSNADVTKDG
KVDSTDLTLL KRYILRFVTN F