Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1114 |
Symbol | pqiA |
ID | 6970711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1144828 |
End bp | 1146081 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385120 |
Product | paraquat-inducible protein A |
Protein accession | YP_002269619 |
Protein GI | 209396072 |
COG category | [S] Function unknown |
COG ID | [COG2995] Uncharacterized paraquat-inducible protein A |
TIGRFAM ID | [TIGR00155] integral membrane protein, PqiA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0100863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.917256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGAAC ATCATCATGC CGCGAAGCAC ATCCTGTGCT CGCAGTGTGA CATGCTGGTG GCGTTACCGC GCCTTGAGCA TGGTCAGAAA GCGGCATGTC CCCGGTGTGG CACAACGTTA ACCGTGGCGT GGGATGCCCC CCGGCAGCGT CCGACCGCCT ATGCGTTGGC TGCACTGTTC ATGCTGTTGC TGTCCAACTT GTTTCCTTTT GTGAATATGA ACGTTGCAGG AGTTACCAGT GAAATTACAT TACTGGAAAT TCCCGGCGTG CTTTTTTCTG AGGACTACGC CAGCCTCGGC ACCTTTTTCC TGTTGTTTGT GCAACTGGTT CCCGCGTTTT GTCTGATAAC CATTCTGTTA CTGGTGAATC GCGCGGAATT ACCGGTCCGT TTAAAAGAGC AACTGGCACG GGTGCTTTTT CAACTCAAAA CCTGGGGAAT GGCGGAGATT TTCCTCGCGG GTGTGCTGGT CAGTTTCGTT AAACTGATGG CTTACGGCAG CATTGGCGTA GGCAGCAGCT TTCTCCCCTG GTGTTTATTT TGTGTCCTGC AACTGCGCGC TTTTCAGTGC GTTGATCGTC GCTGGTTATG GGACGATATC GCCCCGATGC CAGAACTGCG CCAGCCGCTA AAACCAGGCG TCACGGGGAT ACGTCAGGGG CTGCGTTCGT GCTCCTGTTG TACGGCAATC CTTCCTGCTG ATGAACCCGT GTGCCCGCGC TGTGGTACTA AAGGGTACGT TCGACGTAGA AACAGCCTGC AATGGACACT CGCGCTGCTT GTTACGTCCA TCATGCTGTA TCTTCCGGCT AATATTTTGC CCATCATGGT GACGGATTTA TTAGGCTCGA AGATGCCATC GACGATTCTC GCTGGGGTCA TCCTGTTATG GAGCGAAGGC TCTTATCCCG TCGCTGCGGT GATCTTTCTG GCCAGTATTA TGGTGCCAAC GTTAAAGATG ATCGCCATCG CGTGGCTGTG TTGGGATGCC AAAGGGCATG GCAAGCGCGA CAGTGAAAGA ATGCATTTGA TTTATGAAGT TGTTGAGTTT GTAGGCCGCT GGTCGATGAT TGACGTTTTC GTTATCGCGG TGCTCTCGGC GCTGGTGCGT ATGGGAGGTT TAATGAGTAT TTATCCGGCA ATGGGTGCAT TAATGTTTGC TTTAGTCGTC ATAATGACAA TGTTTTCTGC TATGACGTTT GACCCGCGTT TGTCGTGGGA TCGTCAACCT GAATCAGAGC ATGAGGAGTC CTGA
|
Protein sequence | MCEHHHAAKH ILCSQCDMLV ALPRLEHGQK AACPRCGTTL TVAWDAPRQR PTAYALAALF MLLLSNLFPF VNMNVAGVTS EITLLEIPGV LFSEDYASLG TFFLLFVQLV PAFCLITILL LVNRAELPVR LKEQLARVLF QLKTWGMAEI FLAGVLVSFV KLMAYGSIGV GSSFLPWCLF CVLQLRAFQC VDRRWLWDDI APMPELRQPL KPGVTGIRQG LRSCSCCTAI LPADEPVCPR CGTKGYVRRR NSLQWTLALL VTSIMLYLPA NILPIMVTDL LGSKMPSTIL AGVILLWSEG SYPVAAVIFL ASIMVPTLKM IAIAWLCWDA KGHGKRDSER MHLIYEVVEF VGRWSMIDVF VIAVLSALVR MGGLMSIYPA MGALMFALVV IMTMFSAMTF DPRLSWDRQP ESEHEES
|
| |