Gene ECH74115_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1115 
SymbolpqiB 
ID6966951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1146086 
End bp1147726 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID643385121 
Productparaquat-inducible protein B 
Protein accessionYP_002269620 
Protein GI209399494 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000291875 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCTA ATAATGGGGA AGCCAAAATC CAGAAAGTGA AGAACTGGTC TCCCGTGTGG 
ATATTTCCTA TCGTCACGGC GCTCATTGGG GCCTGGGTTC TTTTTTATCA TTACAGCCAT
CAGGGACCGG AAGTGACCCT GATCACCGCG AATGCGGAAG GAATTGAAGG TGGCAAAACC
ACCATTAAAA GCCGTAGCGT TGACGTCGGC GTGGTTGAAA GCGCCACACT GGCTGATGAT
TTGACACACG TTGAAATCAA AGCGCGGCTG AATTCCGGTA TGGAAAAATT GCTGCATAAA
GATACCGTCT TTTGGGTGGT GAAACCGCAG ATTGGTCGAG AAGGGATTAG CGGTCTGGGA
ACCCTGCTGT CTGGGGTTTA TATCGAACTG CAGCCAGGCG CGAAAGGCAG CAAAATGGAT
AAATACGATT TGCTGGACTC GCCACCGTTG GCCCCGCCTG ATGCGAAAGG TATCCGTGTG
GTTCTCGATA GCAAAAAAGC CGGGCAGCTC TCGCCAGGAG ATCCGGTGCT GTTCCGTGGC
TATCGGGTAG GTTCGGTTGA AACCAGCACC TTCGATACGC AAAAACGCAA TATCAGTTAT
CAACTGTTCA TCAATGCACC TTATGACCGA CTGGTGACCA GCAATGTTCG CTTCTGGAAA
GATAGTGGCA TTGCGGTTGA TCTGACGTCA GCGGGAATGC GTGTGGAGAT GGGCTCATTG
ACAACGCTGC TGAGTGGCGG TGTCAGCTTT GATGTGCCGG AAGGTCTGGA TTTAGGGCAG
CCAGTGGCAC CGAAAACAGC TTTCGTTTTG TATGATGATC AGAAGAGCAT TCAGGATTCG
TTGTACACCG ATCACATTGA TTATCTGATG TTCTTTAAAG ATTCGGTACG CGGTCTGCAA
CCGGGAGCTC CGGTAGAATT CCGGGGTATT CGCCTCGGTA CCGTAAGCAA AGTGCCATTC
TTTGCGCCGA ATATGCGTCA GACATTTAAC GATGATTACC GTATTCCGGT GCTGATTCGT
ATCGAGCCAG AGCGGCTGAA AATGCAGCTT GGCGAAAATG CGGATGTTGT TGAGCACCTT
GGCGAATTGT TGAAACGTGG TTTACGCGGA TCGCTGAAAA CCGGAAACCT GGTCACTGGC
GCACTGTATG TTGATCTCGA TTTCTATCCA AATACGCCTG CAATAACCGG TATTCGTGAA
TTTAATGGTT ATCAGATTAT CCCGACCGTT AGCGGCGGCC TGGCGCAAAT CCAGCAACGA
CTGATGGAAG CGTTGGATAA GATCAACAAA CTGCCATTGA ATCCGATGAT TGAACAGGCA
ACCAGTACGC TTTCTGAAAG TCAGCGCACA ATGAAAAACC TGCAAACGAC GCTGGATAGC
ATGAACAAGA TCCTCGCCAG CCAGTCGATG CAGCAGTTGC CGACGGATAT GCAGTCAACG
TTGCGTGAAT TGAATCGCAG CATGCAGGGC TTCCAGCCCG GCTCCGCAGC CTACAACAAG
ATGGTGGCGG ATATGCAGCG CCTTGATCAG GTGTTGCGAG AACTGCAACC GGTGCTGAAA
ACGCTCAATG AGAAGAGTAA CGCGCTGGTA TTTGAAGCGA AGGACAAAAA AGATCCAGAG
CCGAAGAGGG CGAAACAATG A
 
Protein sequence
MESNNGEAKI QKVKNWSPVW IFPIVTALIG AWVLFYHYSH QGPEVTLITA NAEGIEGGKT 
TIKSRSVDVG VVESATLADD LTHVEIKARL NSGMEKLLHK DTVFWVVKPQ IGREGISGLG
TLLSGVYIEL QPGAKGSKMD KYDLLDSPPL APPDAKGIRV VLDSKKAGQL SPGDPVLFRG
YRVGSVETST FDTQKRNISY QLFINAPYDR LVTSNVRFWK DSGIAVDLTS AGMRVEMGSL
TTLLSGGVSF DVPEGLDLGQ PVAPKTAFVL YDDQKSIQDS LYTDHIDYLM FFKDSVRGLQ
PGAPVEFRGI RLGTVSKVPF FAPNMRQTFN DDYRIPVLIR IEPERLKMQL GENADVVEHL
GELLKRGLRG SLKTGNLVTG ALYVDLDFYP NTPAITGIRE FNGYQIIPTV SGGLAQIQQR
LMEALDKINK LPLNPMIEQA TSTLSESQRT MKNLQTTLDS MNKILASQSM QQLPTDMQST
LRELNRSMQG FQPGSAAYNK MVADMQRLDQ VLRELQPVLK TLNEKSNALV FEAKDKKDPE
PKRAKQ