Gene EcSMS35_1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1896 
SymboloppB 
ID6142758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1916660 
End bp1917580 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content46% 
IMG OID641616772 
Productoligopeptide transporter permease 
Protein accessionYP_001743950 
Protein GI170681715 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.22801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.152903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAT TTATTCTACG TCGCTGTCTG GAAGCGATTC CGACGCTATT TATTCTTATT 
ACTATTTCGT TCTTTATGAT GCGCCTCGCA CCGGGCAGTC CTTTTACCGG CGAACGTACT
TTACCACCGG AAGTGATGGC CAATATCGAA GCGAAATATC ATCTTAATGA TCCGATCATG
ACACAGTATT TCAGCTACCT GAAACAACTG GCGCATGGCG ATTTCGGTCC ATCGTTTAAA
TATAAAGATT ATTCGGTCAA CGACCTGGTT GCATCCAGTT TTCCCGTTTC TGCCAAACTG
GGAGCCGCAG CATTTTTCCT TGCGGTAATA CTGGGTGTTA GTGCTGGCGT TATTGCCGCA
TTAAAACAAA ACACCAAATG GGACTATACC GTGATGGGGC TGGCAATGAC CGGGGTTGTT
ATCCCCAGTT TTGTGGTTGC GCCATTATTA GTCATGATAT TTGCGATCAT TTTGCATTGG
CTGCCGGGCG GTGGCTGGAA TGGTGGTGCG CTTAAATTCA TGATACTGCC AATGGTGGCG
TTGTCACTCG CCTATATCGC CAGTATTGCG CGTATTACCC GTGGCTCTAT GATTGAAGTA
TTACACTCCA ACTTTATTCG TACTGCCCGG GCGAAAGGGT TACCTATGCG GCGGATCATT
TTACGCCACG CATTAAAACC TGCTCTGTTA CCCGTGCTCT CCTATATGGG CCCTGCATTT
GTCGGCATTA TTACCGGTTC TATGGTCATC GAAACCATTT ATGGTTTGCC GGGGATTGGG
CAATTGTTCG TTAATGGTGC ATTGAACCGT GACTATTCCT TAGTGTTAAG CCTGACCATC
CTGGTTGGTG CTTTAACCAT TTTGTTTAAT GCCATTGTCG ATGTGCTATA TGCGGTTATC
GACCCGAAAA TCCGTTACTG A
 
Protein sequence
MLKFILRRCL EAIPTLFILI TISFFMMRLA PGSPFTGERT LPPEVMANIE AKYHLNDPIM 
TQYFSYLKQL AHGDFGPSFK YKDYSVNDLV ASSFPVSAKL GAAAFFLAVI LGVSAGVIAA
LKQNTKWDYT VMGLAMTGVV IPSFVVAPLL VMIFAIILHW LPGGGWNGGA LKFMILPMVA
LSLAYIASIA RITRGSMIEV LHSNFIRTAR AKGLPMRRII LRHALKPALL PVLSYMGPAF
VGIITGSMVI ETIYGLPGIG QLFVNGALNR DYSLVLSLTI LVGALTILFN AIVDVLYAVI
DPKIRY