Gene Cthe_0612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0612 
Symbol 
ID4808214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp750623 
End bp752134 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content42% 
IMG OID640106026 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_001037040 
Protein GI125973130 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000214976 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTATGA AATTCTTGTT TCTTGCTCTC TTCGTTGCCG TTATGATCGG CGTCGGAATT 
TACAGCCGGA AAAAAATAAA TAACGAACAG GACTTTTTAT TGGGTGGAAG AAAAATGGGA
CCGTGGATTT CGGCTTTTGC CTATGGTACA ACATATTTTT CAGCGGTTAT TTTCATAGGC
TATGCCGGAA AAAACGGATG GAATTTTGGA ATTTCTGCCG TTTGGATTGG AATCGCAAAT
GCAGTTTTAG GTTGTCTTCT CTCCTGGCTT GTCCTTGCAA AGAGAACCCG TAAGATGACG
CATGATTTAA ACGTCTCAAC AATGCCCGAT TTCTTTGAAA AAAGATATGA CAGCAAGGGT
TTGAAAATAG TTTCGTCAAT CATTATCTTC GTATTCCTTG TTCCTTATTC GGCATCGGTT
TACCAGGGAT TGAGCTATTT AATTGAAAAA ACAATTGCAG AGCCTTTGGG TGCAACTTTT
GGAATTACGA TTTCTTTTGA GGCAATTATG CTGATAATGG CAGCCTTGAC GGGAATATAT
CTTTTGCTTG GAGGATATGT TGCCACTGCA ATTAATGATT TAATACAGGG CATTATAATG
ATACTCGGAA TTATTTTAAT GGTAGCCTTT GTTGTAAATC ACGAAAATGT AGGAGGAATT
GCCGAAGGAC TGTCGAGGCT TTCACAAATT CCGGACGTGG GAGAAGGGCT GGTCAAGCCT
TTCGGACCGC AGCCTTTAAG CCTTCTTTCT TTGGTTATAC TTACGAGCCT TGGCACTTGG
GGATTACCTC AGATGATTCA TAAATTTTAT GCGGTAAAAG ACGATGATTC CATAAAGAAG
GCTACTATTG TTTCGACGAT TTTCGCAACA TTGATATCCG GCGGAGCTTA TTTCATAGGT
GTCTTTGGGA GGCTCTTTGT TACACTGGAT GAAATAGGCG GAGATCTTGA CAGGATTGTA
CCTACCATGC TTGAAAAGGC ATTGCCGGAT TATTTGATGG GTATTGTAAT TATTCTGGTG
CTTTCGGCTT CGATGTCAAC TCTTTCTTCA CTGGTGCTTG TATCAAGTTC GGCAATTTCC
ATGGACCTTG TAAAGGGAGT ATTTTTCCCG AAGATGAAGA GTGAAAAGGT CATGGTTCTT
ATGAGAATAC TTTGTGCTTT GTTTGTGGCA TTTTCCTTCG TCGTCGGTGT AAGACCCAGC
TCCATTTTGA CACTTATGTC ATTGTCCTGG GGAACTGTTG CAGGTTCATT CCTTGCTCCT
TTCTTGTACG GCCTGTACTG GAAAGGTACT ACAAAGGCGG GAGCGTGGAC GGGTATGATA
GTTGGGTTCT TATGCTCCGT GGGAGGTTCA ATGATTTTTG GTTTAAAGAA TGCACCGAAT
GTGGGAGCGG CAGCCATGCT CATATCTCTT ATAGTGGTAC CTGTGGTAAG CCTCTTTACG
GCAAAGCTTC CAAATGCGCA TATTAACATG ATATACGGTG AGGAAGACAG TAAGCAGAAA
TTGGCCTCAT AA
 
Protein sequence
MFMKFLFLAL FVAVMIGVGI YSRKKINNEQ DFLLGGRKMG PWISAFAYGT TYFSAVIFIG 
YAGKNGWNFG ISAVWIGIAN AVLGCLLSWL VLAKRTRKMT HDLNVSTMPD FFEKRYDSKG
LKIVSSIIIF VFLVPYSASV YQGLSYLIEK TIAEPLGATF GITISFEAIM LIMAALTGIY
LLLGGYVATA INDLIQGIIM ILGIILMVAF VVNHENVGGI AEGLSRLSQI PDVGEGLVKP
FGPQPLSLLS LVILTSLGTW GLPQMIHKFY AVKDDDSIKK ATIVSTIFAT LISGGAYFIG
VFGRLFVTLD EIGGDLDRIV PTMLEKALPD YLMGIVIILV LSASMSTLSS LVLVSSSAIS
MDLVKGVFFP KMKSEKVMVL MRILCALFVA FSFVVGVRPS SILTLMSLSW GTVAGSFLAP
FLYGLYWKGT TKAGAWTGMI VGFLCSVGGS MIFGLKNAPN VGAAAMLISL IVVPVVSLFT
AKLPNAHINM IYGEEDSKQK LAS