Gene EcSMS35_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3858 
Symbol 
ID6147299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3927035 
End bp3928306 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content46% 
IMG OID641618684 
Productserine transporter family protein 
Protein accessionYP_001745824 
Protein GI170682329 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00814] serine transporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCACA ACACACTACC GAAACACGAT CAGAAATTGC CGTTTACACG CTACGACTTC 
GGCTGGGTTT TATTATGCAT AGGCATGGCG ATTGGTGCCG GAACCGTGCT GATGCCAGTA
CAAATTGGCT TGAAGGGAAT TTGGGTATTT ATTACCGCAG CGATCATTGC TTATCCTGCC
ACCTGGGTAG TGCAGGACAT TTATTTAAAA ACCCTTTCTG AAAGCGATTC CTGTAATGAC
TACACCGATA TTATCAGTCA TTACCTGGGG AAGAACTGGG GAATTTTCCT CGGGGTTATC
TACTTTTTGA TGATTATCCA CGGGATTTTT ATCTACTCTC TCTCCGTGGT TTTCGACAGC
GCCTCGTACC TGAAAACCTT CGGTTTAACC GATGCCGATC TTTCACAATC TCTACTTTAT
AAAGTCGCCA TTTTCGCCGT ACTGGTGGCA ATTGCGTCTG GTGGTGAACG ATTACTGTTT
AAGATTTCCG GGCCAATGGT GGTGGTCAAA GTAGGGATTA TCGTCGTGTT CGGTTTTGCG
ATGATCCCGC ACTGGAATTT CGCCAATATA ACCGCCTTCC CGCAAGCCTC CGTCTTTTTC
CGCGATGTCC TGCTTACCAT TCCCTTTTGC TTCTTTTCTG CGGTATTTAT TCAGGTACTT
AACCCAATGA ATATTGCCTA TCGTAAACGG GAAGCGGATA AAGTACTGGC AACCCGGCTC
GCGCTGCGTA CCCACCGAAT TAGTTATGTC ACGCTCATCG CGGTGATCCT GTTTTTTGCC
TTTTCGTTTA CCTTCTCAAT TAGCCACGAA GAAGCCGTTT CTGCCTTTGA ACAAAATATC
TCGGCACTGG CACTGGCCGC GCAGGTGATC CCTGGGCATA TCATTCATAT CACCTCTACG
GTGCTGAATA TCTTTGCCGT ACTGACCGCA TTCTTTGGCA TTTATCTCGG TTTCCACGAG
GCCATTAAAG GCATTATTCT CAATCTGTTA AGCCGAATTA TTGATACCAA GAAAATTAAT
TCACGCGTGC TGACTCTGGC GATCTGCGCT TTTATCGTCA TTACGTTGAC GATTTGGGTT
TCGTTTCGTG TATCGGTGCT GGTGTTCTTT CAGTTGGGAA GCCCGTTATA TGGCATTGTG
TCGTGCCTCA TTCCGTTTTT CCTGATCTAT AAAGTCGCAC AACTGGAAAA ACTTCGCGGA
TTTAAAGCCT GGCTGATTCT GCTTTACGGC ATTTTGCTAT GCTTGTCGCC ACTGTTGAAG
CTGATTGAGT AA
 
Protein sequence
MQHNTLPKHD QKLPFTRYDF GWVLLCIGMA IGAGTVLMPV QIGLKGIWVF ITAAIIAYPA 
TWVVQDIYLK TLSESDSCND YTDIISHYLG KNWGIFLGVI YFLMIIHGIF IYSLSVVFDS
ASYLKTFGLT DADLSQSLLY KVAIFAVLVA IASGGERLLF KISGPMVVVK VGIIVVFGFA
MIPHWNFANI TAFPQASVFF RDVLLTIPFC FFSAVFIQVL NPMNIAYRKR EADKVLATRL
ALRTHRISYV TLIAVILFFA FSFTFSISHE EAVSAFEQNI SALALAAQVI PGHIIHITST
VLNIFAVLTA FFGIYLGFHE AIKGIILNLL SRIIDTKKIN SRVLTLAICA FIVITLTIWV
SFRVSVLVFF QLGSPLYGIV SCLIPFFLIY KVAQLEKLRG FKAWLILLYG ILLCLSPLLK
LIE