Gene EcSMS35_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3564 
Symbol 
ID6147128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3647528 
End bp3648709 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content50% 
IMG OID641618392 
ProductHis/Glu/Gln/Arg/opine ABC transporter permease 
Protein accessionYP_001745539 
Protein GI170682235 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4597] ABC-type amino acid transport system, permease component 
TIGRFAM ID[TIGR01726] amine acid ABC transporter, permease protein, 3-TM region, His/Glu/Gln/Arg/opine family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.982473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCATC GCCGCTCAAC CGTTAAAGGC TCACTCTCCT TTGCCAACCC TACGGTTCGC 
GCCTGGTTAT TCCAGATCCT TGCCGTTGTT GCTGTTGTTG GCATTGTTGG TTGGCTATTT
CACAACACTG TGACGAATCT CAGTAATCGT GGCATTACTT CAGGTTTTGC CTTTCTGGAT
CGCGGCGCTG GCTTCGGTAT TGTCCAGCAT TTGATCGATT ACCAGCAGGG TGATACCTAC
GGACGCGTTT TCATTGTTGG CTTACTCAAT ACGCTACTGG TTTCTGCATT GTGTATTGTA
TTCGCTTCTG TTCTGGGCTT CTTTATCGGT CTGGCGCGAC TTTCGGATAA CTGGCTGCTA
CGAAAGCTTT CCACAATTTA TATTGAGATC TTCCGTAATA TTCCGCCGCT GCTGCAAATC
TTCTTCTGGT ACTTTGCCGT GCTGCGCAAT TTGCCCGGAC CACGCCAGGC AGTGAGTGCG
TTTGATCTGG CCTTTTTGAG CAATCGTGGG CTTTATATTC CGTCGCCGCA GCTGGGAGAC
GGATTTATTG CGTTTATCCT GGCTGTTGTT ATGGCTATAG TCCTTTCTGT TGGGCTATTC
CGCTTTAATA AAACATACCA GATAAAAACC GGACAACTGC GCCGCACCTG GCCGATCGCC
GCAGTGTTGA TCATTGGTTT GCCTTTACTG GCGCAATGGC TTTTTGGCGC AGCACTACAC
TGGGATGTCC CAGCCCTTCG AGGCTTTAAT TTCCGCGGCG GGATGGTTTT AATTCCTGAA
CTGGCAGCCT TAACGCTGGC GCTTTCGGTT TATACTTCTG CATTTATCGC CGAAATTATC
CGCGCAGGGA TCCAGGCAGT GCCTTATGGT CAACATGAAG CAGCTCGGTC ACTGGGATTA
CCCAATCCGG TCACGCTACG CCAGGTCATT ATTCCCCAGG CATTGCGGGT GATTATTCCA
CCGTTAACCA GCCAGTATCT CAACATCGTC AAGAACTCCT CTCTTGCCGC CGCTATTGGC
TATCCCGATA TGGTTTCGCT GTTTGCTGGC ACCGTGCTGA ACCAAACGGG GCAAGCCATC
GAGACGATAG CCATGACCAT GTCGGTCTAT CTGATTATCA GCCTGACTAT CTCGCTGCTG
ATGAATATCT ATAACCGCCG CATCGCGATC GTTGAACGCT AA
 
Protein sequence
MSHRRSTVKG SLSFANPTVR AWLFQILAVV AVVGIVGWLF HNTVTNLSNR GITSGFAFLD 
RGAGFGIVQH LIDYQQGDTY GRVFIVGLLN TLLVSALCIV FASVLGFFIG LARLSDNWLL
RKLSTIYIEI FRNIPPLLQI FFWYFAVLRN LPGPRQAVSA FDLAFLSNRG LYIPSPQLGD
GFIAFILAVV MAIVLSVGLF RFNKTYQIKT GQLRRTWPIA AVLIIGLPLL AQWLFGAALH
WDVPALRGFN FRGGMVLIPE LAALTLALSV YTSAFIAEII RAGIQAVPYG QHEAARSLGL
PNPVTLRQVI IPQALRVIIP PLTSQYLNIV KNSSLAAAIG YPDMVSLFAG TVLNQTGQAI
ETIAMTMSVY LIISLTISLL MNIYNRRIAI VER