Gene EcSMS35_1893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1893 
SymboloppF 
ID6143224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1913711 
End bp1914715 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID641616769 
Productoligopeptide ABC transporter, ATP-binding protein OppF 
Protein accessionYP_001743947 
Protein GI170680026 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.854707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.049175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTG TAACCGAAGG AAGAAAAGTC CTCCTCGAAA TCGCCGATCT GAAAGTGCAC 
TTTGAAATCA AAGATGGCAA ACAGTGGTTC TGGCAACCGC CGAAAACGCT CAAAGCCGTC
GATGGTGTAA CTCTTCGCCT GTATGAAGGG GAAACATTAG GTGTGGTAGG GGAATCGGGA
TGCGGTAAGT CCACCTTTGC TCGCGCCATC ATCGGTTTGG TCAAGGCGAC CGACGGTCAT
GTTGCCTGGT TAGGTAAAGA GTTGCTGGGC ATGAAGCCCG ATGAATGGCG TGCCGTTCGC
AGTGATATTC AGATGATTTT CCAGGATCCG TTGGCATCGC TGAACCCGCG TATGACCATC
GGCGAGATCA TCGCTGAACC ACTGCGTACT TATCATCCGA AAATGTCACG CCAGGAAGTT
CGCGAGCGCG TGAAGGCGAT GATGCTGAAA GTCGGGTTAT TGCCTAACCT GATTAACCGC
TATCCGCATG AGTTCTCTGG TGGGCAGTGC CAGCGTATCG GGATTGCACG TGCACTTATT
CTTGAACCGA AGCTGATTAT CTGTGATGAG CCGGTGTCGG CGCTGGATGT GTCAATTCAG
GCGCAGGTGG TCAACCTGCT CCAGCAGTTA CAACGTGAGA TGGGATTGTC ATTAATTTTT
ATCGCTCACG ACCTGGCCGT GGTAAAACAC ATTTCCGATC GTGTGTTGGT GATGTATCTC
GGCCATGCGG TAGAACTGGG GACCTATGAT GAGGTCTACC ACAATCCACT ACATCCTTAC
ACCAGGGCAT TGATGTCGGC AGTCCCCATA CCTGATCCGG ATCTGGAGAA GAACAAAACC
ATCCAGTTAC TGGAAGGGGA ATTACCGTCG CCGATCAACC CGCCTTCCGG TTGTGTTTTC
CGTACCCGTT GCCCGATTGC CGGTCCGGAG TGCGCCAAAA CACGTCCTGT GCTGGAGGGC
AGTTTCAGAC ACGCCGTTTC TTGCCTGAAA GTCGATCCAC TTTAA
 
Protein sequence
MNAVTEGRKV LLEIADLKVH FEIKDGKQWF WQPPKTLKAV DGVTLRLYEG ETLGVVGESG 
CGKSTFARAI IGLVKATDGH VAWLGKELLG MKPDEWRAVR SDIQMIFQDP LASLNPRMTI
GEIIAEPLRT YHPKMSRQEV RERVKAMMLK VGLLPNLINR YPHEFSGGQC QRIGIARALI
LEPKLIICDE PVSALDVSIQ AQVVNLLQQL QREMGLSLIF IAHDLAVVKH ISDRVLVMYL
GHAVELGTYD EVYHNPLHPY TRALMSAVPI PDPDLEKNKT IQLLEGELPS PINPPSGCVF
RTRCPIAGPE CAKTRPVLEG SFRHAVSCLK VDPL