Gene EcSMS35_3565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3565 
Symbol 
ID6146434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3648719 
End bp3649822 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content52% 
IMG OID641618393 
ProductHis/Glu/Gln/Arg/opine ABC transporter permease 
Protein accessionYP_001745540 
Protein GI170684312 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0765] ABC-type amino acid transport system, permease component 
TIGRFAM ID[TIGR01726] amine acid ABC transporter, permease protein, 3-TM region, His/Glu/Gln/Arg/opine family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.931761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAG TATTGCTGTC TCATCCCCCG CGCCCGGCGA GCCATAACTC AAGCCGCGCG 
ATGGTGTGGG TGCGAAAAAA TCTGTTCTCC AGCTTGAGCA ATAGCCTGCT GACTATTGGC
TGCATATGGT TGATGTGGGA ACTGATCCCA CCGTTGCTGA ACTGGGCATT TTTGCAAGCT
AACTGGGTTG GCTCAACGCG TGCCGACTGC ACAAAAGCCG GTGCCTGTTG GGTCTTCATC
CACGAACGAT TTGGTCAGTT TATGTATGGG CTTTACCCAC ACGACCAACG CTGGCGAATT
AACCTGGCAT TACTGATTGG GCTTGTATCG ATCGCACCAA TGTTCTGGAA AATACTCCCG
TATCGCGGTC GCTATATTGC GGTATGGGCG GTGATTTACC CACTGATTGT CTGGTGGCTG
ATGTATGGCG GGTTTCTTGG TCTTGAGCGG GTTGAAACCC GGCAATGGGG CGGGCTGACG
CTAACTTTAA TTATTGCATC AGTTGGGATT GCGGGGGCGC TGCCGTGGGG GATCTTACTG
GCGTTAGGTC GTCGCTCCCA TATGCCGATT GTGCGTATCT TATCGGTCAT TTTTATCGAG
TTCTGGCGCG GTGTACCGCT GATTACCGTT CTGTTTATGT CTTCGGTCAT GCTGCCGTTG
TTTATGGCAG AAGGCACCAG TATCGACAAA TTGATCCGCG CGCTGGTTGG CGTGATCCTG
TTTCAGTCAG CATATGTTGC GGAAGTCGTG CGAGGCGGAT TACAGGCGCT GCCTAAAGGG
CAATATGAAG CGGCAGAGTC GCTGGCGTTG GGTTACTGGA AAACCCAGGG GCTGGTTATT
CTGCCACAGG CGTTGAAGCT GGTGATTCCT GGGCTGGTAA ATACCATCAT CGCACTCTTC
AAAGATACCA GCCTGGTGAT CATCATCGGG TTGTTCGATC TTTTCAGTAG CGTTCAGCAG
GCAACCGTTG ATCCCGCCTG GTTGGGTATG TCGACGGAAG GGTATGTTTT CGCCGCACTG
ATCTACTGGA TCTTCTGTTT CAGCATGTCG CGCTATAGCC AGCATCTGGA AAAACGTTTT
AACACCGGGC GTACACCGCA TTGA
 
Protein sequence
MTKVLLSHPP RPASHNSSRA MVWVRKNLFS SLSNSLLTIG CIWLMWELIP PLLNWAFLQA 
NWVGSTRADC TKAGACWVFI HERFGQFMYG LYPHDQRWRI NLALLIGLVS IAPMFWKILP
YRGRYIAVWA VIYPLIVWWL MYGGFLGLER VETRQWGGLT LTLIIASVGI AGALPWGILL
ALGRRSHMPI VRILSVIFIE FWRGVPLITV LFMSSVMLPL FMAEGTSIDK LIRALVGVIL
FQSAYVAEVV RGGLQALPKG QYEAAESLAL GYWKTQGLVI LPQALKLVIP GLVNTIIALF
KDTSLVIIIG LFDLFSSVQQ ATVDPAWLGM STEGYVFAAL IYWIFCFSMS RYSQHLEKRF
NTGRTPH