Gene B21_00962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00962 
SymbolpqiB 
ID8113441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1017758 
End bp1019398 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID644847224 
Producthypothetical protein 
Protein accessionYP_002998797 
Protein GI251784493 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000379419 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTA ATAATGGGGA AGCCAAAATC CAGAAAGTGA AGAACTGGTC TCCCGTGTGG 
ATATTTCCTA TCGTCACGGC GCTCATTGGG GCCTGGGTTC TTTTTTATCA TTACAGCCAT
CAGGGACCGG AAGTGACCCT GATCACCGCG AATGCGGAAG GAATTGAAGG TGGCAAAACC
ACCATTAAAA GCCGTAGCGT TGACGTCGGC GTGGTTGAAA GCGCCACACT GGCTGATGAT
TTGACGCACG TTGAAATCAA AGCGCGGCTG AATTCCGGTA TGGAAAAATT GCTGCATAAA
GACACCGTCT TTTGGGTGGT GAAACCGCAG ATTGGTCGCG AAGGGATTAG CGGCCTGGGA
ACGCTGCTGT CTGGAGTTTA TATCGAACTG CAGCCAGGCG CGAAAGGCAG CAAAATGGAT
AAATACGATT TGCTGGACTC GCCACCGTTG GCCCCGCCTG ATGCGAAAGG TATCCGTGTG
ATTCTCGATA GCAAAAAAGC CGGGCAGCTC TCGCCAGGAG ATCCGGTGCT GTTCCGTGGC
TATCGGGTAG GTTCGGTTGA AACCAGCACC TTCGATACAC AAAAACGCAA TATCAGCTAT
CAACTGTTCA TCAATGCACC TTATGACCGA CTGGTGACCA ACAATGTTCG CTTCTGGAAA
GATAGTGGCA TTGCGGTTGA TCTGACGTCA GCAGGGATGC GTGTGGAGAT GGGCTCATTG
ACAACGCTGC TGAGTGGCGG TGTCAGCTTT GATGTGCCGG AAGGTCTGGA TTTAGGGCAG
CCAGTGGCAC CGAAAACAGC TTTCGTTTTG TATGATGATC AGAAGAGCAT TCAGGATTCG
TTGTACACCG ATCACATTGA TTATCTGATG TTCTTTAAAG ATTCGGTACG CGGTCTGCAA
CCGGGAGCTC CGGTAGAATT CCGGGGTATT CGCCTGGGTA CCGTAAGCAA AGTGCCATTC
TTTGCGCCGA ATATGCGTCA GACATTTAAC GATGATTACC GTATTCCGGT ACTGATTCGT
ATCGAGCCAG AGCGGCTGAA AATGCAGCTT GGCGAAAATG CGGATGTTGT TGAGCACCTT
GGCGAATTGT TGAAACGTGG TTTACGCGGA TCGCTGAAAA CCGGAAACCT GGTCACTGGT
GCACTGTATG TTGATCTCGA TTTCTATCCA AATACGCCTG CAATAACCGG TATTCGTGAA
TTTAATGGTT ATCAGATTAT CCCGACCGTT AGCGGCGGCC TGGCGCAAAT CCAGCAACGA
CTGATGGAAG CGTTGGATAA GATCAACAAA CTGCCATTGA ATCCGATGAT TGAACAGGCA
ACCAGTACGC TTTCTGAAAG TCAGCGCACA ATGAAAAACC TGCAAACGAC GCTGGATAGC
ATGAACAAGA TCCTCGCTAG CCAGTCGATG CAGCAGTTGC CGACGGATAT GCAGTCAACG
TTGCGTGAAT TGAATCGCAG CATGCAGGGC TTCCAGCCTG GCTCCGCAGC CTACAACAAG
ATGGTGGCGG ATATGCAGCG CCTTGATCAG GTGTTGCGAG AACTGCAACC GGTGCTGAAA
ACGCTCAATG AGAAGAGTAA CGCGCTGGTA TTTGAAGCGA AGGACAAAAA AGATCCAGAG
CCGAAGAGGG CGAAACAATG A
 
Protein sequence
MESNNGEAKI QKVKNWSPVW IFPIVTALIG AWVLFYHYSH QGPEVTLITA NAEGIEGGKT 
TIKSRSVDVG VVESATLADD LTHVEIKARL NSGMEKLLHK DTVFWVVKPQ IGREGISGLG
TLLSGVYIEL QPGAKGSKMD KYDLLDSPPL APPDAKGIRV ILDSKKAGQL SPGDPVLFRG
YRVGSVETST FDTQKRNISY QLFINAPYDR LVTNNVRFWK DSGIAVDLTS AGMRVEMGSL
TTLLSGGVSF DVPEGLDLGQ PVAPKTAFVL YDDQKSIQDS LYTDHIDYLM FFKDSVRGLQ
PGAPVEFRGI RLGTVSKVPF FAPNMRQTFN DDYRIPVLIR IEPERLKMQL GENADVVEHL
GELLKRGLRG SLKTGNLVTG ALYVDLDFYP NTPAITGIRE FNGYQIIPTV SGGLAQIQQR
LMEALDKINK LPLNPMIEQA TSTLSESQRT MKNLQTTLDS MNKILASQSM QQLPTDMQST
LRELNRSMQG FQPGSAAYNK MVADMQRLDQ VLRELQPVLK TLNEKSNALV FEAKDKKDPE
PKRAKQ