Gene SbBS512_E2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2366 
SymbolpqiB 
ID6268613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2154394 
End bp2156034 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID641726370 
Productparaquat-inducible protein B 
Protein accessionYP_001880852 
Protein GI187731372 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.547013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTA ATAATGGGGA AGCCAAAATC CAGAAAGTGA AGAACTGGTC TCCCGTGTGG 
ATATTTCCTA TCGTCACGGC GCTCATTGGG GCCTGGGTTC TATTTTATCA TTACAGCCAT
CAGGGACCGG AAGTGACCCT GATCACCGCG AATGCGGAAG GAATTGAAGG TGGCAAAACC
ACCATTAAAA GCCGTAGCGT TGACGTCGGC GTGGTTGAAA GCGCCACACT GGCTGATGAT
TTGACGCACG TTGAAATCAA AGCGCGGCTG AATTCCGGTA TGGAAAAATT GCTGCATAAA
GACACCGTCT TTTGGGTGGT GAAACCGCAG ATTGGTCGCG AAGGGATTAG CGGCCTGGGA
ACGCTGCTGT CTGGAGTTTA TATCGAACTG CAGCCAGGCG CGAAAGGCAG CAAAATGGAT
AAATACGATT TGCTGGACTC GCCACCGTTG GCCCCGCCTG ATGCGAAAGG TATCCGTGTG
ATTCTCGATA GCAAAAAAGC CGGGCAGCTC TCGCCAGGAG ATCCGGTGCT GTTCCGTGGC
TATCGGGTAG GTTCGGTTGA AACCAGCACC TTCGATACGC AAAAACGCAA TATCAGTTAT
CAACTGTTCA TCAATGCACC TTATGACCGA CTGGTGACCA GCAATGTTCG CTTCTGGAAA
GATAGTGGCA TTGCGGTTGA TCTGACGTCA GCGGGGATGC GTGTGGAGAT GGGCTCATTG
ACAACGCTGC TTAGTGGCGG TGTTAGCTTT GATGTGCCGG AAGGTCTGGA TTTAGGGCAG
CCAGTGGCAC CGAAAACAGC TTTCGTTTTG TATGATGATC AGAAGAGCAT TCAGGATTCG
TTGTACACCG ATCACATTGA TTATCTGATG TTCTTTAAAG ATTCGGTACG CGGTCTGCAA
CCGGGAGCTC CGGTAGAATT CCGGGGTATT CGCCTGGGTA CCGTAAGCAA AGTGCCATTC
TTTGCGCCGA ATATGCGTCA GACATTTAAC GATGATTACC GTATTCCGGT ACTGATTCGT
ATCGAGCCAG AGCGGCTGAA AATGCAGCTT GGCGAAAATA CGGATGTTGT TGAGCACCTT
GGCGAATTGT TGAAACGTGG TTTACGCGGA TCGCTGAAAA CCGGAAACCT GGTCACTGGT
GCACTGTATG TTGATCTCGA TTTCTATCCA AATACGCCTG CAATAACCGG TATTCGTGAA
TTTAATGGTT ATCAGATTAT CCCTACTGTT AGCGGCGGCC TGGCGCAAAT CCAGCAACGA
CTGATGGAAG CGTTGGATAA GATCAACAAA CTGCCATTGA ATCCGATGAT TGAACAGGCA
ACCAGTACGT TTTCTGAAAG TCAGCGCACA ATGAAAAACC TGCAAACGAC GCTGGATAGC
ATGAACAAGA TCCTCGCTAG CCAGTCGATG CAGCAGTTGC CGACGGATAT GCAGTCAACG
TTGCGTGAAT TGAATCGCAG CATGCAGGGC TTCCAGCCTG GCTCCGCAGC CTACAACAAG
ATGGTGGCGG ATATGCAGCG CCTTGATCAG GTGTTGCGAG AACTGCAACC GGTGCTGAAA
ACGCTCAATG AGAAGAGTAA CGCGCTGGTA TTTGAAGCGA AGGACAAAAA AGATCCAGAG
CCGAAGAGGG CGAAACAATG A
 
Protein sequence
MESNNGEAKI QKVKNWSPVW IFPIVTALIG AWVLFYHYSH QGPEVTLITA NAEGIEGGKT 
TIKSRSVDVG VVESATLADD LTHVEIKARL NSGMEKLLHK DTVFWVVKPQ IGREGISGLG
TLLSGVYIEL QPGAKGSKMD KYDLLDSPPL APPDAKGIRV ILDSKKAGQL SPGDPVLFRG
YRVGSVETST FDTQKRNISY QLFINAPYDR LVTSNVRFWK DSGIAVDLTS AGMRVEMGSL
TTLLSGGVSF DVPEGLDLGQ PVAPKTAFVL YDDQKSIQDS LYTDHIDYLM FFKDSVRGLQ
PGAPVEFRGI RLGTVSKVPF FAPNMRQTFN DDYRIPVLIR IEPERLKMQL GENTDVVEHL
GELLKRGLRG SLKTGNLVTG ALYVDLDFYP NTPAITGIRE FNGYQIIPTV SGGLAQIQQR
LMEALDKINK LPLNPMIEQA TSTFSESQRT MKNLQTTLDS MNKILASQSM QQLPTDMQST
LRELNRSMQG FQPGSAAYNK MVADMQRLDQ VLRELQPVLK TLNEKSNALV FEAKDKKDPE
PKRAKQ