Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4898 |
Symbol | |
ID | 6143804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 5017457 |
End bp | 5018926 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619701 |
Product | solute/sodium symporter (SSS) family protein |
Protein accession | YP_001746808 |
Protein GI | 170682270 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGTC ATATCTTTTT AGTCGGCTTT ATTATTTATG CCATCGCCAT GATTTGGCTT GGTTGGTTTG TTTCTCGTAA TCAGAAAAGC GGAGAAGATT TTCTGCTGGG CGGGCGTTCG CTGCCGCTGT TTCTGACGCT GGGGTCAACC GTTGCCACGA TGGTTGGCAC GGGTTCCAGT ATGGGCGCAG TGGGCTTTGG CTACAGCAAT GGCTGGGCCG GAATGCTTTA TGGCGTCGGT GGGGCCGTAG GTATATTACT GGTGGCGTGG CTGTTTGCAC CGGTACGTAA ATTACGATTT ATGACCATGA GTGAAGAAAT ATCTTATTAT ACTGGCGGCA GCCATTTAAT TAAAAATATT GTCGGCCTGA TGATATTTAT TGCCTCTATT GGCTGGCTGG GGGCGCATAT ATTAGGTGGA AGTATGTATC TTGCCTGGGC AACAGGTATT GATCTTACGG TGGCAAAATT AATTATTGCC TTAGCTTTTG CGATTTACGT TATTATCGGC GGTTATTCGG CGGTAGTGTG GACAGATACT ATTCAGGCCC TAATTCTGTT CTTTGGCTTT ATATTAATGG CTATTCTTGC CGTTGTTCAC GTTGGCGGCT GGAGTGCGAT TGAGCAGGCG ATGGACCCAA AAGCGATGAG CCTGTTTGCG ATTGATAAAA TGGGCGTGCT GCCTGCGCTT TCGCTGGCAC TGGTTATTGG CGTAGGTGTG CTGGCAACAC CGTCTTATCG CCAGCGTATT TACTCGGGTA AAGATGTTCC CTCTGTACGC CGTTCGTTTG TTATTACTGG GGTGTTATAT CTGTTCTTCT CGATTTTGCC TGCCATTATC GGCATGGCGG CGTTCACCAT GAACCCAAAT CTGGAAAACA GCAACTACGC CTTCTTGTTC GCGACCAGTT TCTTACCTGC CATTCTTGGG CTGGTGGTGC TGATTGCCGG GCTTTCAGCC ACCATGTCTT CTGCCAGCTC CGATGCCATC GCGGCGGTGG CGATTATGAT GCGCGACGTC TACACCATGA TTACCGGCAG AATGCCGCCG GAGGATAAGG CAATCACTTA CTCACGCTGG ATGTTGACCT TTGTTATTGG CCTGGCGCTG GTCTTTGCTC TGACCTCTAA CGACATCATC AGCTACATTA CCAAAATGAT TTCGATGCTG ATGTCCGGGT TGTTTATCTG CTCGATTCTC GGTCGCTTCT GGCTACGCTT TAACTGGCAG GGCGCGCTGG CGGCGTTAGT GAGTGGTATG GTGATGTCAA TTGTGGTGCT GATGAATGCC GACTGGCTGG CTTACTGGGG TAACCCGTGT ATTCCGTCAG TGCTGGGCAG CCTGGTCGCT GGCGTACTGG TGACGTTGGT AACGCCAGCA AGTCAGGTCA GCCGGGAAGA AGCTTTGGCT ATTATCACCA ATGAGCGTGA AAACCAGAGT GTCGTGATCA CCAAACCTGA AGAAGCATAA
|
Protein sequence | MNSHIFLVGF IIYAIAMIWL GWFVSRNQKS GEDFLLGGRS LPLFLTLGST VATMVGTGSS MGAVGFGYSN GWAGMLYGVG GAVGILLVAW LFAPVRKLRF MTMSEEISYY TGGSHLIKNI VGLMIFIASI GWLGAHILGG SMYLAWATGI DLTVAKLIIA LAFAIYVIIG GYSAVVWTDT IQALILFFGF ILMAILAVVH VGGWSAIEQA MDPKAMSLFA IDKMGVLPAL SLALVIGVGV LATPSYRQRI YSGKDVPSVR RSFVITGVLY LFFSILPAII GMAAFTMNPN LENSNYAFLF ATSFLPAILG LVVLIAGLSA TMSSASSDAI AAVAIMMRDV YTMITGRMPP EDKAITYSRW MLTFVIGLAL VFALTSNDII SYITKMISML MSGLFICSIL GRFWLRFNWQ GALAALVSGM VMSIVVLMNA DWLAYWGNPC IPSVLGSLVA GVLVTLVTPA SQVSREEALA IITNERENQS VVITKPEEA
|
| |