Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0612 |
Symbol | |
ID | 4808214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 750623 |
End bp | 752134 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106026 |
Product | SSS family solute/sodium (Na+) symporter |
Protein accession | YP_001037040 |
Protein GI | 125973130 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000214976 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTATGA AATTCTTGTT TCTTGCTCTC TTCGTTGCCG TTATGATCGG CGTCGGAATT TACAGCCGGA AAAAAATAAA TAACGAACAG GACTTTTTAT TGGGTGGAAG AAAAATGGGA CCGTGGATTT CGGCTTTTGC CTATGGTACA ACATATTTTT CAGCGGTTAT TTTCATAGGC TATGCCGGAA AAAACGGATG GAATTTTGGA ATTTCTGCCG TTTGGATTGG AATCGCAAAT GCAGTTTTAG GTTGTCTTCT CTCCTGGCTT GTCCTTGCAA AGAGAACCCG TAAGATGACG CATGATTTAA ACGTCTCAAC AATGCCCGAT TTCTTTGAAA AAAGATATGA CAGCAAGGGT TTGAAAATAG TTTCGTCAAT CATTATCTTC GTATTCCTTG TTCCTTATTC GGCATCGGTT TACCAGGGAT TGAGCTATTT AATTGAAAAA ACAATTGCAG AGCCTTTGGG TGCAACTTTT GGAATTACGA TTTCTTTTGA GGCAATTATG CTGATAATGG CAGCCTTGAC GGGAATATAT CTTTTGCTTG GAGGATATGT TGCCACTGCA ATTAATGATT TAATACAGGG CATTATAATG ATACTCGGAA TTATTTTAAT GGTAGCCTTT GTTGTAAATC ACGAAAATGT AGGAGGAATT GCCGAAGGAC TGTCGAGGCT TTCACAAATT CCGGACGTGG GAGAAGGGCT GGTCAAGCCT TTCGGACCGC AGCCTTTAAG CCTTCTTTCT TTGGTTATAC TTACGAGCCT TGGCACTTGG GGATTACCTC AGATGATTCA TAAATTTTAT GCGGTAAAAG ACGATGATTC CATAAAGAAG GCTACTATTG TTTCGACGAT TTTCGCAACA TTGATATCCG GCGGAGCTTA TTTCATAGGT GTCTTTGGGA GGCTCTTTGT TACACTGGAT GAAATAGGCG GAGATCTTGA CAGGATTGTA CCTACCATGC TTGAAAAGGC ATTGCCGGAT TATTTGATGG GTATTGTAAT TATTCTGGTG CTTTCGGCTT CGATGTCAAC TCTTTCTTCA CTGGTGCTTG TATCAAGTTC GGCAATTTCC ATGGACCTTG TAAAGGGAGT ATTTTTCCCG AAGATGAAGA GTGAAAAGGT CATGGTTCTT ATGAGAATAC TTTGTGCTTT GTTTGTGGCA TTTTCCTTCG TCGTCGGTGT AAGACCCAGC TCCATTTTGA CACTTATGTC ATTGTCCTGG GGAACTGTTG CAGGTTCATT CCTTGCTCCT TTCTTGTACG GCCTGTACTG GAAAGGTACT ACAAAGGCGG GAGCGTGGAC GGGTATGATA GTTGGGTTCT TATGCTCCGT GGGAGGTTCA ATGATTTTTG GTTTAAAGAA TGCACCGAAT GTGGGAGCGG CAGCCATGCT CATATCTCTT ATAGTGGTAC CTGTGGTAAG CCTCTTTACG GCAAAGCTTC CAAATGCGCA TATTAACATG ATATACGGTG AGGAAGACAG TAAGCAGAAA TTGGCCTCAT AA
|
Protein sequence | MFMKFLFLAL FVAVMIGVGI YSRKKINNEQ DFLLGGRKMG PWISAFAYGT TYFSAVIFIG YAGKNGWNFG ISAVWIGIAN AVLGCLLSWL VLAKRTRKMT HDLNVSTMPD FFEKRYDSKG LKIVSSIIIF VFLVPYSASV YQGLSYLIEK TIAEPLGATF GITISFEAIM LIMAALTGIY LLLGGYVATA INDLIQGIIM ILGIILMVAF VVNHENVGGI AEGLSRLSQI PDVGEGLVKP FGPQPLSLLS LVILTSLGTW GLPQMIHKFY AVKDDDSIKK ATIVSTIFAT LISGGAYFIG VFGRLFVTLD EIGGDLDRIV PTMLEKALPD YLMGIVIILV LSASMSTLSS LVLVSSSAIS MDLVKGVFFP KMKSEKVMVL MRILCALFVA FSFVVGVRPS SILTLMSLSW GTVAGSFLAP FLYGLYWKGT TKAGAWTGMI VGFLCSVGGS MIFGLKNAPN VGAAAMLISL IVVPVVSLFT AKLPNAHINM IYGEEDSKQK LAS
|
| |