Gene EcSMS35_2993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2993 
Symbol 
ID6144579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3074481 
End bp3075710 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content44% 
IMG OID641617862 
Productserine transporter family protein 
Protein accessionYP_001745014 
Protein GI170684281 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00814] serine transporter 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.07384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAATA TTTGGTCAAA AGAAGAAACT CTGTGGAGTT TCGCGCTCTA CGGCACAGCC 
GTTGGTGCAG GCACGCTCTT CCTTCCTATT CAGTTAGGTT CGGCAGGGGC TGTGGTCCTG
TTTATTACTG CTCTGGTCGC CTGGCCTTTA ACATATTGGC CACATAAAGC CTTATGCCAG
TTCATCCTCT CATCAAAAAC ATCAGCAGGT GAAGGGATAA CGGGCGCGGT AACACACTAC
TATGGCAAGA AGATTGGTAA TCTGATTACC ACGCTGTACT TCATCGCCTT TTTTGTCGTC
GTGTTGATAT ATGCCGTGGC AATTACCAAC TCACTTACGG AACAGCTGGC AAAGCATATG
GTTATTGATC TTCGCATCCG TATGTTGGTG AGTCTGGGTG TTGTATTAAT CCTGAATCTC
ATTTTTCTGA TGGGACGCCA TGCCACTATT CGGGTAATGG GATTTTTGGT ATTCCCATTG
ATTGCCTATT TCTTATTTCT TTCTATTTAC CTGGTCGGTA GTTGGCAACC TGATCTATTA
ACTACCCAGG TAGAGTTCAA TCAGAATACC CTTCACCAGA TATGGATATC GATTCCCGTG
ATGGTTTTCG CCTTTAGCCA TACGCCCATT ATTTCTACGT TTGCCATAGA CAGACGTGAA
AAATATGGCG AACATGCTAT GGATAAATGC AAAAAAATTA TGAAAGTCGC TTATCTCATC
ATCTGCATAA GTGTACTGTT CTTTGTCTTT AGCTGCCTGC TTTCTATTCC ACCTTCGTAT
ATTGAAGCGG CTAAAGAAGA AGGGGTTACC ATTTTATCGG CGCTTTCTAT GCTGCCGAAC
GCCCCAGCAT GGTTGTCAAT TTCCGGGATT ATTGTCGCAG TAGTTGCGAT GTCGAAATCA
TTCCTGGGTA CGTACTTTGG CGTTATTGAA GGTGCCACAG AGGTCGTCAA AACAACACTA
CAGCAGGTTG GTGTAAAGAA AAGTCGTGCA TTTAACCGCG CACTATCAAT TATGTTGGTA
TCGCTGATTA CCTTCATTGT TTGTTGCATT AACCCGAACG CGATTTCGAT GATTTACGCG
ATCAGCGGCC CGCTCATTGC CATGATACTT TTCATCATGC CTACGCTGTC AACGTATCTC
ATCCCGGCGC TTAAACCCTG GCGTTCCATC GGAAATCTGA TTACGCTGAT CGTGGGTATC
TTGTGCGTAT CGGTAATGTT CTTTAGCTAA
 
Protein sequence
MSNIWSKEET LWSFALYGTA VGAGTLFLPI QLGSAGAVVL FITALVAWPL TYWPHKALCQ 
FILSSKTSAG EGITGAVTHY YGKKIGNLIT TLYFIAFFVV VLIYAVAITN SLTEQLAKHM
VIDLRIRMLV SLGVVLILNL IFLMGRHATI RVMGFLVFPL IAYFLFLSIY LVGSWQPDLL
TTQVEFNQNT LHQIWISIPV MVFAFSHTPI ISTFAIDRRE KYGEHAMDKC KKIMKVAYLI
ICISVLFFVF SCLLSIPPSY IEAAKEEGVT ILSALSMLPN APAWLSISGI IVAVVAMSKS
FLGTYFGVIE GATEVVKTTL QQVGVKKSRA FNRALSIMLV SLITFIVCCI NPNAISMIYA
ISGPLIAMIL FIMPTLSTYL IPALKPWRSI GNLITLIVGI LCVSVMFFS