Gene EcHS_A1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1356 
SymboloppF 
ID5593332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1352821 
End bp1353825 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID640920511 
Productoligopeptide ABC transporter, ATP-binding protein OppF 
Protein accessionYP_001458070 
Protein GI157160752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0000101924 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCTG TAACTGAAGG AAGAAAAGTC CTCCTTGAAA TCGCCGATCT TAAAGTGCAC 
TTTGAAATCA AAGATGGCAA ACAGTGGTTC TGGCAACCGC CGAAAACGCT CAAAGCCGTC
GATGGTGTCA CTCTTCGCCT GTATGAAGGG GAAACATTAG GTGTGGTAGG GGAATCGGGA
TGCGGTAAGT CCACCTTTGC TCGCGCCATC ATCGGTTTGG TCAAGGCGAC CGACGGTCAT
GTTGCCTGGT TAGGTAAAGA GTTGCTGGGC ATGAAGCCCG ATGAATGGCG TGCCGTTCGC
AGTGATATTC AGATGATTTT CCAGGATCCG TTGGCATCGC TAAACCCGCG TATGACCATC
GGCGAGATCA TCGCTGAACC ACTGCGTACT TATCATCCGA AAATGTCACG CCAGGAAGTT
CGCGAGCGCG TGAAGGCGAT GATGCTGAAA GTCGGGTTAT TGCCTAACCT GATTAACCGC
TATCCGCATG AGTTCTCCGG TGGGCAGTGC CAGCGTATCG GGATTGCTCG TGCTCTTATT
CTTGAACCGA AGCTGATTAT CTGCGATGAG CCGGTGTCGG CGCTGGACGT GTCAATTCAG
GCGCAGGTGG TCAACCTGCT CCAGCAGCTG CAACGTGAGA TGGGATTGTC ATTAATTTTT
ATCGCTCATG ACCTGGCCGT GGTAAAACAC ATTTCCGATC GTGTGTTGGT GATGTATCTC
GGCCATGCGG TAGAACTGGG GACCTATGAT GAGGTCTACC ACAATCCACT ACATCCTTAC
ACCAGGGCAT TGATGTCGGC AGTCCCCATA CCTGATCCGG ATCTGGAGAA GAACAAAACC
ATCCAGTTAC TGGAAGGGGA ATTACCGTCG CCGATCAACC CGCCTTCCGG TTGTGTTTTC
CGTACCCGTT GCCCGATTGC CGGTCCGGAG TGCGCCAAAA CACGTCCTGT TCTGGAGGGG
AGTTTCAGAC ACGCCGTTTC TTGCCTGAAA GTCGATCCGC TTTAA
 
Protein sequence
MNAVTEGRKV LLEIADLKVH FEIKDGKQWF WQPPKTLKAV DGVTLRLYEG ETLGVVGESG 
CGKSTFARAI IGLVKATDGH VAWLGKELLG MKPDEWRAVR SDIQMIFQDP LASLNPRMTI
GEIIAEPLRT YHPKMSRQEV RERVKAMMLK VGLLPNLINR YPHEFSGGQC QRIGIARALI
LEPKLIICDE PVSALDVSIQ AQVVNLLQQL QREMGLSLIF IAHDLAVVKH ISDRVLVMYL
GHAVELGTYD EVYHNPLHPY TRALMSAVPI PDPDLEKNKT IQLLEGELPS PINPPSGCVF
RTRCPIAGPE CAKTRPVLEG SFRHAVSCLK VDPL