Gene EcSMS35_3407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3407 
Symbol 
ID6146010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3486907 
End bp3488238 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content51% 
IMG OID641618236 
Productserine transporter family protein 
Protein accessionYP_001745385 
Protein GI170679999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00814] serine transporter 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATTG CATCGAATAA AGGCGTCATT GCAGACGCTT CGACCCCGGC GGGTCGTGCT 
GGAATGAGTG AGAGCGAGTG GCGAGAAGCG ATCAAATTCG ACAGTACTGA CACCGGCTGG
GTGATTATGA GTATCGGGAT GGCGATTGGC GCGGGGATTG TTTTTCTCCC GGTGCAGGTC
GGTTTGATGG GGTTGTGGGT ATTTTTGCTC TCATCGGTGA TTGGTTACCC GGCAATGTAT
CTGTTTCAGC GGTTGTTTAT TAATACGTTG GCAGAATCAC CAGAATGTAA AGATTACCCG
AGCGTCATTA GCGGTTATTT AGGTAAAAAC TGGGGCATCC TGTTAGGTGC ACTCTATTTC
GTAATGCTGG TGATTTGGAT GTTCGTCTAT TCCACCGCCA TCACCAACGA TAGTGCTTCC
TACCTGCATA CCTTCGGCGT GACGGAAGGG TTGCTGTCAG ACAGTCCCTT TTATGGTCTG
GTACTGATTT GCATTCTGGT GGCGATCTCT TCACGCGGCG AGAAATTGTT ATTCAAAATT
TCGACCGGCA TGGTGCTGAC CAAGCTGCTG GTGGTCGCGG CGCTGGGCGT ATCGATGGTG
GGAATGTGGC ATCTGTATAA CGTCGGTTCG CTACCGCCGC TGGGGCTGCT GGTGAAAAAC
GCCATTATTA CGCTGCCGTT TACCCTGACG TCGATTCTGT TTATCCAGAC GTTAAGCCCG
ATGGTGATCT CTTATCGCTC GCGGGAAAAA TCGATTGAAG TGGCGCGGCA TAAAGCATTG
CGGGCAATGA ATATCGCGTT TGGCATTTTG TTTGTCACCG TCTTTTTCTA CGCAGTGTCG
TTCACGCTGG CGATGGGACA TGACGAAGCG GTAAAAGCCT ACGAGCAGAA TATTTCCGCG
CTGGCGATTG CGGCGCAGTT TATTAGCGGT GACGGCGCAG CGTGGGTGAA AGTGGTCAGC
GTCATTCTCA ATATCTTTGC AGTAATGACC GCGTTTTTTG GCGTCTATTT AGGCTTTCGC
GAAGCAACGC AAGGGATCGT AATGAACATC CTGCGTCGCA AGATGCCTGC CGAGAAGATT
AACGAAAATC TCGTTCAGCG CGGCATCATG ATTTTCGCCA TTTTGCTGGC CTGGAGCGCC
ATCGTACTGA ACGCACCGGT GTTGAGCTTC ACCTCTATCT GTAGCCCGAT TTTCGGCATG
GTAGGGTGCC TGATCCCGGC GTGGCTGGTT TACAAAGTAC CGGCATTGCA CAAATACAAA
GGGATGTCTC TGTACCTGAT TATCGTCACT GGTTTGTTGC TTTGTGTTTC TCCGTTCCTG
GCATTTTCTT GA
 
Protein sequence
MEIASNKGVI ADASTPAGRA GMSESEWREA IKFDSTDTGW VIMSIGMAIG AGIVFLPVQV 
GLMGLWVFLL SSVIGYPAMY LFQRLFINTL AESPECKDYP SVISGYLGKN WGILLGALYF
VMLVIWMFVY STAITNDSAS YLHTFGVTEG LLSDSPFYGL VLICILVAIS SRGEKLLFKI
STGMVLTKLL VVAALGVSMV GMWHLYNVGS LPPLGLLVKN AIITLPFTLT SILFIQTLSP
MVISYRSREK SIEVARHKAL RAMNIAFGIL FVTVFFYAVS FTLAMGHDEA VKAYEQNISA
LAIAAQFISG DGAAWVKVVS VILNIFAVMT AFFGVYLGFR EATQGIVMNI LRRKMPAEKI
NENLVQRGIM IFAILLAWSA IVLNAPVLSF TSICSPIFGM VGCLIPAWLV YKVPALHKYK
GMSLYLIIVT GLLLCVSPFL AFS