Gene EcSMS35_2169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2169 
SymbolpqiA 
ID6147320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2174340 
End bp2175593 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID641617045 
Productparaquat-inducible protein A 
Protein accessionYP_001744219 
Protein GI170680788 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000394614 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.228835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGAAC ATCATCATGC CGCGAAGCAC ATCCTGTGCT CGCAGTGTGA CATGCTGGTG 
GCGTTACCGC GCCTTGAGCA TGGTCAGAAA GCGGCATGTC CCCGGTGTGG CACAACGTTA
ACCGTGGCGT GGGATGCCCC TCGGCAGCGT CCCACCGCCT ATGCGTTGGC TGCACTGTTC
ATGCTGTTGC TGTCCAACTT GTTTCCTTTT GTGAATATGA ACGTTGCGGG AGTCACCAGT
GAAATTACAT TACTGGAAAT TCCCGGCGTG CTTTTTTCTG AGGACTACGC CAGCCTCGGC
ACCTTTTTCC TGTTGTTTGT GCAACTGGTT CCCGCGTTTT GTCTGATAAC CATTCTGTTA
CTGGTGAATC GCGCGGAATT ACCGGTCCGT TTAAAAGAGC AACTGGCACG GGTGCTTTTT
CAACTCAAAA CCTGGGGAAT GGCGGAAATT TTCCTCGCGG GAGTGCTGGT CAGTTTCGTT
AAACTGATGG CCTACGGCAG CATTGGTGTC GGCAGTAGCT TTCTCCCCTG GTGTTTATTT
TGTGTCCTGC AACTGCGCGC CTTTCAGTGC GTTGATCGTC GCTGGTTATG GGACGATATC
GCCCCGATGC CAGAACTGCG CCAGCCGCTA AAACCTGGCG TCACGGGGAT ACGTCAGGGG
CTGCGTTCTT GCTCCTGTTG TACGGCAATC CTTCCTGCTG ATGAACCCGT ATGCCCGCGC
TGTGGTACTA AAGGGTACGT TCGGCGTAGA AACAGCCTGC AGTGGACACT CGCGCTGCTT
GTAACGTCCA TCATGCTGTA CCTTCCGGCT AATATTTTGC CCATCATGGT GACGGATTTA
TTAGGCTCGA AGATGCCATC GACGATTCTC GCTGGGGTCA TCCTGTTATG GAGCGAAGGA
TCTTATCCCG TCGCTGCGGT TATCTTTCTG GCCAGTATTA TGGTGCCAAC GTTAAAGATG
ATCGCCATAG CGTGGTTGTG TTGGGATGCC AAAGGGCATG GCAAGCGCGA CAGTGAAAGA
ATGCATTTGA TTTATGAAGT TGTTGAGTTT GTAGGCCGCT GGTCGATGAT TGACGTTTTC
GTTATCGCGG TGCTCTCGGC GCTGGTGCGT ATGGGAGGTT TAATGAGTAT TTATCCGGCA
ATGGGTGCAT TAATGTTTGC TTTAGTCGTC ATAATGACAA TGTTTTCTGC TATGACGTTT
GACCCGCGTT TGTCGTGGGA TCGTCAACCT GAATCAGAGC ATGAGGAGTC CTGA
 
Protein sequence
MCEHHHAAKH ILCSQCDMLV ALPRLEHGQK AACPRCGTTL TVAWDAPRQR PTAYALAALF 
MLLLSNLFPF VNMNVAGVTS EITLLEIPGV LFSEDYASLG TFFLLFVQLV PAFCLITILL
LVNRAELPVR LKEQLARVLF QLKTWGMAEI FLAGVLVSFV KLMAYGSIGV GSSFLPWCLF
CVLQLRAFQC VDRRWLWDDI APMPELRQPL KPGVTGIRQG LRSCSCCTAI LPADEPVCPR
CGTKGYVRRR NSLQWTLALL VTSIMLYLPA NILPIMVTDL LGSKMPSTIL AGVILLWSEG
SYPVAAVIFL ASIMVPTLKM IAIAWLCWDA KGHGKRDSER MHLIYEVVEF VGRWSMIDVF
VIAVLSALVR MGGLMSIYPA MGALMFALVV IMTMFSAMTF DPRLSWDRQP ESEHEES