Gene EcSMS35_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2168 
SymbolpqiB 
ID6146633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2172695 
End bp2174335 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID641617044 
Productparaquat-inducible protein B 
Protein accessionYP_001744218 
Protein GI170683101 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000764872 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.254429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCTA ATAATGGGGA AGCCAAAATC CAGAAAGTGA AGAACTGGTC TCCCGTGTGG 
ATATTTCCTA TCGTCACGGC GCTCATTGGG GCCTGGGTTC TTTTTTATCA TTACAGCCAT
CAGGGGCCGG AAGTGACCCT GATCACCGCG AATGCGGAAG GAATAGAAGG TGGTAAAACC
ACCATTAAAA GCCGTAGCGT TGACGTCGGC GTGGTTGAAA GCGCCACACT GGCTGATGAT
TTGACGCACG TTGAAATCAA AGCGCGGCTG AATTCCGGTA TGGAAAAATT GCTGCATAAA
GATACCGTCT TTTGGGTGGT GAAACCGCAG ATTGGTCGAG AAGGGATTAG CGGCCTGGGA
ACGCTGCTGT CTGGGGTTTA TATCGAACTG CAGCCAGGCG CAAAAGGCAG CAAAATGGAT
AAATACGATT TGCTGGACTC GCCACCGTTG GCCCCGCCTG ATGCGAAAGG TATCCGTGTG
ATTCTCGATA GCAAAAAAGC CGGGCAGCTC TCGCCAGGAG ATCCCGTGCT GTTCCGTGGC
TATCGGGTAG GTTCGGTTGA AACCAGCACC TTCGATACGC AAAAACGCAA TATCAGCTAT
CAACTGTTCA TCAATGCACC TTATGACCGA CTGGTGACCA GCAATGTTCG CTTCTGGAAA
GATAGTGGCA TTGCGGTTGA TTTGACGTCA GCGGGAATGC GTGTGGAGAT GGGCTCATTG
ACAACGCTGC TTAGTGGCGG TGTCAGCTTT GATGTGCCGG AAGGTCTGGA TTTAGGGCAG
CCAGTGGCAC CGAAAACAGC TTTCGTTTTG TATGATGATC AGAAGAGCAT TCAGGATTCG
TTGTACACCG ATCACATTGA TTATCTGATG TTCTTTAAAG ATTCGGTACG CGGTCTGCAA
CCGGGAGCTC CGGTAGAATT CCGGGGTATT CGCCTGGGTA CCGTAAGCAA AGTGCCATTC
TTTGCGCCGA ATATGCGTCA GACATTTAAC GATGATTACC GTATTCCGGT ACTGATTCGT
ATCGAGCCAG AGCGGCTGAA AATGCAGCTT GGAGAAAATG CGGATGTTGT TGAGCACCTT
GGCGAATTGT TGAAACGTGG TTTACGCGGA TCGCTGAAAA CCGGAAACCT GGTCACTGGC
GCACTGTATG TTGATCTCGA TTTCTATCCA AATACGCCTG CAATAACCGG TATTCGTGAA
TTTAATGGTT ATCAGATTAT CCCGACCGTT AGCGGCGGTC TGGCGCAAAT CCAGCAACGA
CTGATGGAAG CGTTGGATAA GATCAACAAA CTGCCATTGA ATCCGATGAT TGAACAGGCA
ACCAGTACGC TTTCTGAAAG TCAGCGCACA ATGAAAAACC TGCAAACGAC GCTGGATAGC
ATGAACAAGA TCCTCGCCAG CCAGTCGATG CAGCAGTTAC CGACGGATAT GCAGTCAACG
TTGCGTGAAT TGAATCGCAG TATGCAGGGC TTCCAGCCTG GCTCCGCAGC CTACAACAAG
ATGGTGGCGG ATATGCAGCG ACTTGATCAG GTGTTGCGAG AACTGCAACC GGTGCTGAAA
ACGCTCAACG AGAAGAGTAA CGCGCTGGTA TTTGAAGCGA AGGACAAAAA AGATCCAGAG
CCGAAGAGGG CGAAACAATG A
 
Protein sequence
MESNNGEAKI QKVKNWSPVW IFPIVTALIG AWVLFYHYSH QGPEVTLITA NAEGIEGGKT 
TIKSRSVDVG VVESATLADD LTHVEIKARL NSGMEKLLHK DTVFWVVKPQ IGREGISGLG
TLLSGVYIEL QPGAKGSKMD KYDLLDSPPL APPDAKGIRV ILDSKKAGQL SPGDPVLFRG
YRVGSVETST FDTQKRNISY QLFINAPYDR LVTSNVRFWK DSGIAVDLTS AGMRVEMGSL
TTLLSGGVSF DVPEGLDLGQ PVAPKTAFVL YDDQKSIQDS LYTDHIDYLM FFKDSVRGLQ
PGAPVEFRGI RLGTVSKVPF FAPNMRQTFN DDYRIPVLIR IEPERLKMQL GENADVVEHL
GELLKRGLRG SLKTGNLVTG ALYVDLDFYP NTPAITGIRE FNGYQIIPTV SGGLAQIQQR
LMEALDKINK LPLNPMIEQA TSTLSESQRT MKNLQTTLDS MNKILASQSM QQLPTDMQST
LRELNRSMQG FQPGSAAYNK MVADMQRLDQ VLRELQPVLK TLNEKSNALV FEAKDKKDPE
PKRAKQ