Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2168 |
Symbol | pqiB |
ID | 6146633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2172695 |
End bp | 2174335 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617044 |
Product | paraquat-inducible protein B |
Protein accession | YP_001744218 |
Protein GI | 170683101 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000764872 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.254429 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCTA ATAATGGGGA AGCCAAAATC CAGAAAGTGA AGAACTGGTC TCCCGTGTGG ATATTTCCTA TCGTCACGGC GCTCATTGGG GCCTGGGTTC TTTTTTATCA TTACAGCCAT CAGGGGCCGG AAGTGACCCT GATCACCGCG AATGCGGAAG GAATAGAAGG TGGTAAAACC ACCATTAAAA GCCGTAGCGT TGACGTCGGC GTGGTTGAAA GCGCCACACT GGCTGATGAT TTGACGCACG TTGAAATCAA AGCGCGGCTG AATTCCGGTA TGGAAAAATT GCTGCATAAA GATACCGTCT TTTGGGTGGT GAAACCGCAG ATTGGTCGAG AAGGGATTAG CGGCCTGGGA ACGCTGCTGT CTGGGGTTTA TATCGAACTG CAGCCAGGCG CAAAAGGCAG CAAAATGGAT AAATACGATT TGCTGGACTC GCCACCGTTG GCCCCGCCTG ATGCGAAAGG TATCCGTGTG ATTCTCGATA GCAAAAAAGC CGGGCAGCTC TCGCCAGGAG ATCCCGTGCT GTTCCGTGGC TATCGGGTAG GTTCGGTTGA AACCAGCACC TTCGATACGC AAAAACGCAA TATCAGCTAT CAACTGTTCA TCAATGCACC TTATGACCGA CTGGTGACCA GCAATGTTCG CTTCTGGAAA GATAGTGGCA TTGCGGTTGA TTTGACGTCA GCGGGAATGC GTGTGGAGAT GGGCTCATTG ACAACGCTGC TTAGTGGCGG TGTCAGCTTT GATGTGCCGG AAGGTCTGGA TTTAGGGCAG CCAGTGGCAC CGAAAACAGC TTTCGTTTTG TATGATGATC AGAAGAGCAT TCAGGATTCG TTGTACACCG ATCACATTGA TTATCTGATG TTCTTTAAAG ATTCGGTACG CGGTCTGCAA CCGGGAGCTC CGGTAGAATT CCGGGGTATT CGCCTGGGTA CCGTAAGCAA AGTGCCATTC TTTGCGCCGA ATATGCGTCA GACATTTAAC GATGATTACC GTATTCCGGT ACTGATTCGT ATCGAGCCAG AGCGGCTGAA AATGCAGCTT GGAGAAAATG CGGATGTTGT TGAGCACCTT GGCGAATTGT TGAAACGTGG TTTACGCGGA TCGCTGAAAA CCGGAAACCT GGTCACTGGC GCACTGTATG TTGATCTCGA TTTCTATCCA AATACGCCTG CAATAACCGG TATTCGTGAA TTTAATGGTT ATCAGATTAT CCCGACCGTT AGCGGCGGTC TGGCGCAAAT CCAGCAACGA CTGATGGAAG CGTTGGATAA GATCAACAAA CTGCCATTGA ATCCGATGAT TGAACAGGCA ACCAGTACGC TTTCTGAAAG TCAGCGCACA ATGAAAAACC TGCAAACGAC GCTGGATAGC ATGAACAAGA TCCTCGCCAG CCAGTCGATG CAGCAGTTAC CGACGGATAT GCAGTCAACG TTGCGTGAAT TGAATCGCAG TATGCAGGGC TTCCAGCCTG GCTCCGCAGC CTACAACAAG ATGGTGGCGG ATATGCAGCG ACTTGATCAG GTGTTGCGAG AACTGCAACC GGTGCTGAAA ACGCTCAACG AGAAGAGTAA CGCGCTGGTA TTTGAAGCGA AGGACAAAAA AGATCCAGAG CCGAAGAGGG CGAAACAATG A
|
Protein sequence | MESNNGEAKI QKVKNWSPVW IFPIVTALIG AWVLFYHYSH QGPEVTLITA NAEGIEGGKT TIKSRSVDVG VVESATLADD LTHVEIKARL NSGMEKLLHK DTVFWVVKPQ IGREGISGLG TLLSGVYIEL QPGAKGSKMD KYDLLDSPPL APPDAKGIRV ILDSKKAGQL SPGDPVLFRG YRVGSVETST FDTQKRNISY QLFINAPYDR LVTSNVRFWK DSGIAVDLTS AGMRVEMGSL TTLLSGGVSF DVPEGLDLGQ PVAPKTAFVL YDDQKSIQDS LYTDHIDYLM FFKDSVRGLQ PGAPVEFRGI RLGTVSKVPF FAPNMRQTFN DDYRIPVLIR IEPERLKMQL GENADVVEHL GELLKRGLRG SLKTGNLVTG ALYVDLDFYP NTPAITGIRE FNGYQIIPTV SGGLAQIQQR LMEALDKINK LPLNPMIEQA TSTLSESQRT MKNLQTTLDS MNKILASQSM QQLPTDMQST LRELNRSMQG FQPGSAAYNK MVADMQRLDQ VLRELQPVLK TLNEKSNALV FEAKDKKDPE PKRAKQ
|
| |