Gene EcSMS35_4898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4898 
Symbol 
ID6143804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5017457 
End bp5018926 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content50% 
IMG OID641619701 
Productsolute/sodium symporter (SSS) family protein 
Protein accessionYP_001746808 
Protein GI170682270 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGTC ATATCTTTTT AGTCGGCTTT ATTATTTATG CCATCGCCAT GATTTGGCTT 
GGTTGGTTTG TTTCTCGTAA TCAGAAAAGC GGAGAAGATT TTCTGCTGGG CGGGCGTTCG
CTGCCGCTGT TTCTGACGCT GGGGTCAACC GTTGCCACGA TGGTTGGCAC GGGTTCCAGT
ATGGGCGCAG TGGGCTTTGG CTACAGCAAT GGCTGGGCCG GAATGCTTTA TGGCGTCGGT
GGGGCCGTAG GTATATTACT GGTGGCGTGG CTGTTTGCAC CGGTACGTAA ATTACGATTT
ATGACCATGA GTGAAGAAAT ATCTTATTAT ACTGGCGGCA GCCATTTAAT TAAAAATATT
GTCGGCCTGA TGATATTTAT TGCCTCTATT GGCTGGCTGG GGGCGCATAT ATTAGGTGGA
AGTATGTATC TTGCCTGGGC AACAGGTATT GATCTTACGG TGGCAAAATT AATTATTGCC
TTAGCTTTTG CGATTTACGT TATTATCGGC GGTTATTCGG CGGTAGTGTG GACAGATACT
ATTCAGGCCC TAATTCTGTT CTTTGGCTTT ATATTAATGG CTATTCTTGC CGTTGTTCAC
GTTGGCGGCT GGAGTGCGAT TGAGCAGGCG ATGGACCCAA AAGCGATGAG CCTGTTTGCG
ATTGATAAAA TGGGCGTGCT GCCTGCGCTT TCGCTGGCAC TGGTTATTGG CGTAGGTGTG
CTGGCAACAC CGTCTTATCG CCAGCGTATT TACTCGGGTA AAGATGTTCC CTCTGTACGC
CGTTCGTTTG TTATTACTGG GGTGTTATAT CTGTTCTTCT CGATTTTGCC TGCCATTATC
GGCATGGCGG CGTTCACCAT GAACCCAAAT CTGGAAAACA GCAACTACGC CTTCTTGTTC
GCGACCAGTT TCTTACCTGC CATTCTTGGG CTGGTGGTGC TGATTGCCGG GCTTTCAGCC
ACCATGTCTT CTGCCAGCTC CGATGCCATC GCGGCGGTGG CGATTATGAT GCGCGACGTC
TACACCATGA TTACCGGCAG AATGCCGCCG GAGGATAAGG CAATCACTTA CTCACGCTGG
ATGTTGACCT TTGTTATTGG CCTGGCGCTG GTCTTTGCTC TGACCTCTAA CGACATCATC
AGCTACATTA CCAAAATGAT TTCGATGCTG ATGTCCGGGT TGTTTATCTG CTCGATTCTC
GGTCGCTTCT GGCTACGCTT TAACTGGCAG GGCGCGCTGG CGGCGTTAGT GAGTGGTATG
GTGATGTCAA TTGTGGTGCT GATGAATGCC GACTGGCTGG CTTACTGGGG TAACCCGTGT
ATTCCGTCAG TGCTGGGCAG CCTGGTCGCT GGCGTACTGG TGACGTTGGT AACGCCAGCA
AGTCAGGTCA GCCGGGAAGA AGCTTTGGCT ATTATCACCA ATGAGCGTGA AAACCAGAGT
GTCGTGATCA CCAAACCTGA AGAAGCATAA
 
Protein sequence
MNSHIFLVGF IIYAIAMIWL GWFVSRNQKS GEDFLLGGRS LPLFLTLGST VATMVGTGSS 
MGAVGFGYSN GWAGMLYGVG GAVGILLVAW LFAPVRKLRF MTMSEEISYY TGGSHLIKNI
VGLMIFIASI GWLGAHILGG SMYLAWATGI DLTVAKLIIA LAFAIYVIIG GYSAVVWTDT
IQALILFFGF ILMAILAVVH VGGWSAIEQA MDPKAMSLFA IDKMGVLPAL SLALVIGVGV
LATPSYRQRI YSGKDVPSVR RSFVITGVLY LFFSILPAII GMAAFTMNPN LENSNYAFLF
ATSFLPAILG LVVLIAGLSA TMSSASSDAI AAVAIMMRDV YTMITGRMPP EDKAITYSRW
MLTFVIGLAL VFALTSNDII SYITKMISML MSGLFICSIL GRFWLRFNWQ GALAALVSGM
VMSIVVLMNA DWLAYWGNPC IPSVLGSLVA GVLVTLVTPA SQVSREEALA IITNERENQS
VVITKPEEA