Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2169 |
Symbol | pqiA |
ID | 6147320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2174340 |
End bp | 2175593 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617045 |
Product | paraquat-inducible protein A |
Protein accession | YP_001744219 |
Protein GI | 170680788 |
COG category | [S] Function unknown |
COG ID | [COG2995] Uncharacterized paraquat-inducible protein A |
TIGRFAM ID | [TIGR00155] integral membrane protein, PqiA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000394614 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.228835 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGAAC ATCATCATGC CGCGAAGCAC ATCCTGTGCT CGCAGTGTGA CATGCTGGTG GCGTTACCGC GCCTTGAGCA TGGTCAGAAA GCGGCATGTC CCCGGTGTGG CACAACGTTA ACCGTGGCGT GGGATGCCCC TCGGCAGCGT CCCACCGCCT ATGCGTTGGC TGCACTGTTC ATGCTGTTGC TGTCCAACTT GTTTCCTTTT GTGAATATGA ACGTTGCGGG AGTCACCAGT GAAATTACAT TACTGGAAAT TCCCGGCGTG CTTTTTTCTG AGGACTACGC CAGCCTCGGC ACCTTTTTCC TGTTGTTTGT GCAACTGGTT CCCGCGTTTT GTCTGATAAC CATTCTGTTA CTGGTGAATC GCGCGGAATT ACCGGTCCGT TTAAAAGAGC AACTGGCACG GGTGCTTTTT CAACTCAAAA CCTGGGGAAT GGCGGAAATT TTCCTCGCGG GAGTGCTGGT CAGTTTCGTT AAACTGATGG CCTACGGCAG CATTGGTGTC GGCAGTAGCT TTCTCCCCTG GTGTTTATTT TGTGTCCTGC AACTGCGCGC CTTTCAGTGC GTTGATCGTC GCTGGTTATG GGACGATATC GCCCCGATGC CAGAACTGCG CCAGCCGCTA AAACCTGGCG TCACGGGGAT ACGTCAGGGG CTGCGTTCTT GCTCCTGTTG TACGGCAATC CTTCCTGCTG ATGAACCCGT ATGCCCGCGC TGTGGTACTA AAGGGTACGT TCGGCGTAGA AACAGCCTGC AGTGGACACT CGCGCTGCTT GTAACGTCCA TCATGCTGTA CCTTCCGGCT AATATTTTGC CCATCATGGT GACGGATTTA TTAGGCTCGA AGATGCCATC GACGATTCTC GCTGGGGTCA TCCTGTTATG GAGCGAAGGA TCTTATCCCG TCGCTGCGGT TATCTTTCTG GCCAGTATTA TGGTGCCAAC GTTAAAGATG ATCGCCATAG CGTGGTTGTG TTGGGATGCC AAAGGGCATG GCAAGCGCGA CAGTGAAAGA ATGCATTTGA TTTATGAAGT TGTTGAGTTT GTAGGCCGCT GGTCGATGAT TGACGTTTTC GTTATCGCGG TGCTCTCGGC GCTGGTGCGT ATGGGAGGTT TAATGAGTAT TTATCCGGCA ATGGGTGCAT TAATGTTTGC TTTAGTCGTC ATAATGACAA TGTTTTCTGC TATGACGTTT GACCCGCGTT TGTCGTGGGA TCGTCAACCT GAATCAGAGC ATGAGGAGTC CTGA
|
Protein sequence | MCEHHHAAKH ILCSQCDMLV ALPRLEHGQK AACPRCGTTL TVAWDAPRQR PTAYALAALF MLLLSNLFPF VNMNVAGVTS EITLLEIPGV LFSEDYASLG TFFLLFVQLV PAFCLITILL LVNRAELPVR LKEQLARVLF QLKTWGMAEI FLAGVLVSFV KLMAYGSIGV GSSFLPWCLF CVLQLRAFQC VDRRWLWDDI APMPELRQPL KPGVTGIRQG LRSCSCCTAI LPADEPVCPR CGTKGYVRRR NSLQWTLALL VTSIMLYLPA NILPIMVTDL LGSKMPSTIL AGVILLWSEG SYPVAAVIFL ASIMVPTLKM IAIAWLCWDA KGHGKRDSER MHLIYEVVEF VGRWSMIDVF VIAVLSALVR MGGLMSIYPA MGALMFALVV IMTMFSAMTF DPRLSWDRQP ESEHEES
|
| |