Gene EcE24377A_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1065 
SymbolpqiA 
ID5587548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1090651 
End bp1091904 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID640924769 
Productparaquat-inducible protein A 
Protein accessionYP_001462183 
Protein GI157155514 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000565994 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGAAC ATCATCATGC CGCGAAGCAC ATCCTGTGCT CGCAGTGTGA CATGCTGGTG 
GCGTTACCGC GCCTTGAGCA TGGTCAGAAA GCGGCATGTC CCCGGTGTGG CACAACGTTA
ACCGTGGCGT GGGATGCCCC TCGGCAGCGT CCCACCGCCT ATGCGTTGGC TGCACTGTTC
ATGCTGTTGC TGTCCAACTT GTTTCCTTTT GTGAATATGA ACGTTGCAGG AGTTACCAGT
GAAATTACAT TACTGGAAAT TCCCGGCGTG CTTTTTTCTG AGGACTACGC CAGCCTCGGC
ACCTTTTTCC TGTTGTTTGT GCAACTGGTT CCCGCGTTTT GTCTGATAAC CATTCTGTTA
CTGGTGAATC GCGCGGAATT ACCGGTCCGT TTAAAAGAGC AACTGGCACG GGTGCTTTTT
CAACTCAAAA CCTGGGGAAT GGCGGAGATT TTCCTCGCTG GTGTGCTGGT CAGTTTCGTT
AAACTGATGG CTTACGGCAG CATTGGGGTA GGCAGCAGCT TTCTCCCCTG GTGTTTATTT
TGTGTCCTGC AACTGCGCGC TTTTCAGTGC GTTGATCGTC GCTGGTTATG GGACGACATC
GCCCCGATGC CAGAACTGCG CCAGCCGCTA AAACCAGGCG TCACGGGGAT ACGTCAGGGG
CTGCGTTCGT GCTCCTGTTG TACGGCAATC CTTCCTGCTG ATGAACCCGT GTGCCCGCGT
TGTAGTACCA AAGGGTACGT TCGGCGTAGA AACAGCCTGC AGTGGACACT CGCGCTGCTT
GTAACGTCCA TCATGCTGTA TCTTCCGGCT AATATTTTGC CCATCATGGT GACGGATTTA
TTAGGCTCGA AGATGCCGTC GACGATTCTC GCTGGGGTCA TTCTGTTATG GAGCGAAGGC
TCTTATCCCG TCGCTGCGGT GATCTTTCTG GCCAGTATTA TGGTGCCAAC GTTAAAGATG
ATCGCCATCG CGTGGCTGTG TTGGGATGCC AAAGGGCATG GCAAGCGCGA CAGTGAAAGA
ATGCATTTGA TTTATGAAGT TGTTGAGTTT GTAGGCCGCT GGTCGATGAT TGACGTTTTC
GTTATCGCGG TGCTCTCGGC GCTGGTGCGT ATGGGAGGTT TAATGAGTAT TTATCCGGCA
ATGGGTGCAT TAATGTTTGC TTTAGTCGTC ATAATGACAA TGTTTTCTGC TATGACGTTT
GACCCGCGTT TGTCGTGGGA TCGTCAACCT GAATCAGAGC ATGAGGAGTC CTGA
 
Protein sequence
MCEHHHAAKH ILCSQCDMLV ALPRLEHGQK AACPRCGTTL TVAWDAPRQR PTAYALAALF 
MLLLSNLFPF VNMNVAGVTS EITLLEIPGV LFSEDYASLG TFFLLFVQLV PAFCLITILL
LVNRAELPVR LKEQLARVLF QLKTWGMAEI FLAGVLVSFV KLMAYGSIGV GSSFLPWCLF
CVLQLRAFQC VDRRWLWDDI APMPELRQPL KPGVTGIRQG LRSCSCCTAI LPADEPVCPR
CSTKGYVRRR NSLQWTLALL VTSIMLYLPA NILPIMVTDL LGSKMPSTIL AGVILLWSEG
SYPVAAVIFL ASIMVPTLKM IAIAWLCWDA KGHGKRDSER MHLIYEVVEF VGRWSMIDVF
VIAVLSALVR MGGLMSIYPA MGALMFALVV IMTMFSAMTF DPRLSWDRQP ESEHEES