Gene EcHS_A1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1059 
SymbolpqiA 
ID5595115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1073615 
End bp1074868 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID640920224 
Productparaquat-inducible protein A 
Protein accessionYP_001457789 
Protein GI157160471 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000000508779 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGAAC ATCATCATGC CGCGAAGCAC ATCCTGTGCT CGCAGTGTGA CATGCTGGTG 
GCGTTACCGC GCCTTGAGCA TGGTCAGAAA GCGGCATGTC CCCGGTGTGG CACAACGTTA
ACCGTGGCGT GGGATGCCCC TCGGCAGCGT CCCACCGCCT ATGCGTTGGC TGCACTGTTC
ATGCTGTTGC TGTCCAACTT GTTTCCTTTT GTGAATATGA ACGTTGCAGG AGTTACCAGT
GAAATTACAT TACTGGAAAT TCCCGGCGTG CTTTTTTCTG AGGACTACGC CAGCCTCGGC
ACCTTTTTCC TGTTGTTTGT GCAACTGGTT CCCGCGTTTT GTCTGATAAC CATTCTGTTA
CTGGTGAATC GCGCGGAATT ACCGGTCCGT TTAAAAGAGC AACTGGCACG GGTGCTTTTT
CAACTCAAAA CCTGGGGAAT GGCGGAGATT TTCCTCGCTG GTGTGCTGGT CAGTTTCGTT
AAACTGATGG CTTACGGCAG CATTGGGGTA GGCAGCAGCT TTCTCCCCTG GTGTTTATTT
TGTGTCCTGC AACTGCGCGC TTTTCAGTGC GTTGATCGTC GCTGGTTATG GGACGACATC
GCCCCGATGC CAGAACTGCG CCAGCCGCTA AAACCAGGCG TCACGGGGAT ACGTCAGGGG
CTGCGTTCGT GCTCCTGTTG TACGGCAATC CTTCCTGCTG ATGAACCCGT GTGCCCGCGT
TGTAGTACCA AAGGGTACGT TCGGCGTAGA AACAGCCTGC AGTGGACACT CGCGCTGCTT
GTAACGTCCA TCATGCTGTA TCTTCCGGCT AATATTTTGC CCATCATGGT GACGGATTTA
TTAGGCTCGA AGATGCCGTC GACGATTCTC GCTGGGGTCA TTCTGTTATG GAGCGAAGGC
TCTTATCCCG TCGCTGCGGT GATCTTTCTG GCCAGTATTA TGGTGCCAAC GTTAAAGATG
ATCGCCATCG CGTGGCTGTG TTGGGATGCC AAAGGGCATG GCAAGCGCGA CAGTGAAAGA
ATGCATTTGA TTTATGAAGT TGTTGAGTTT GTAGGCCGCT GGTCGATGAT TGACGTTTTC
GTTATCGCGG TGCTCTCGGC GCTGGTGCGT ATGGGAGGTT TAATGAGTAT TTATCCGGCA
ATGGGTGCAT TAATGTTTGC TTTAGTCGTC ATAATGACAA TGTTTTCTGC TATGACGTTT
GACCCGCGTT TGTCGTGGGA TCGTCAACCT GAATCAGAGC ATGAGGAGTC CTGA
 
Protein sequence
MCEHHHAAKH ILCSQCDMLV ALPRLEHGQK AACPRCGTTL TVAWDAPRQR PTAYALAALF 
MLLLSNLFPF VNMNVAGVTS EITLLEIPGV LFSEDYASLG TFFLLFVQLV PAFCLITILL
LVNRAELPVR LKEQLARVLF QLKTWGMAEI FLAGVLVSFV KLMAYGSIGV GSSFLPWCLF
CVLQLRAFQC VDRRWLWDDI APMPELRQPL KPGVTGIRQG LRSCSCCTAI LPADEPVCPR
CSTKGYVRRR NSLQWTLALL VTSIMLYLPA NILPIMVTDL LGSKMPSTIL AGVILLWSEG
SYPVAAVIFL ASIMVPTLKM IAIAWLCWDA KGHGKRDSER MHLIYEVVEF VGRWSMIDVF
VIAVLSALVR MGGLMSIYPA MGALMFALVV IMTMFSAMTF DPRLSWDRQP ESEHEES