Gene SbBS512_E2367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2367 
SymbolpqiA 
ID6271377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2156039 
End bp2157292 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID641726371 
Productparaquat-inducible protein A 
Protein accessionYP_001880853 
Protein GI187731013 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.658309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGAAC ATCATCATGC CGCGAAGCAC ATCCTGTGCT CGCAGTGTGA CATGCTGGTG 
GCGTTACCGC GCCTTGAGCA TGGTCAGAAA GCGGCATGTC CCCGGTGTGG CACAACGTTA
ACCGTGGCGT GGGATGCCCC TCGGCAGCGT CCGACCGCCT ATGCGTTGGC TGCACTGTTC
ATGCTGTTGC TGTCCAACTT GTTTCCTTTT GTGAATATGA ACGTTGCAGG AGTTACCAGT
GAAATTACAT TACTGGAAAT TCCCGGCGTG CTTTTTTCTG AGGACTACGC CAGCCTCGGC
ACCTTTTTCC TGTTGTTTGT GCAACTGGTT CCCTCGTTTT GTCTGATAAC CATTCTGTTA
CTGGTGAATC GCGCGGAATT ACCGGTCCGT TTAAAAGAGC AACTGGCACG GGTGCTTTTT
CAACTCAAAA CCTGGGGAAT GGCGGAGATT TTCCTCGCGG GTGTGCTGGT CAGTTTCGTT
AAACTGATGG CTTACGGCAG CATTGGGGTA GGCAGCAGCT TTCTCCCCTG GTGTTTATTT
TGTGTCCTGC AACTGCGCGC TTTTCAGTGC GTTGATCGTC GCTGGTTATG GGACGACATC
GCCCCGATGC CAGAACTGCG CCAGCCGCTA AAACCAGGCG TCACGGGGAT ACGTCAGGGG
CTGCGTTCGT GCTCCTGTTG TACGGCAATC CTTCCTGCTG ATGAACCCGT GTGCCCGCGT
TGTAGTACCA AAGGGTACGT TCGGCGTAGA AACAGCCTGC AGTGGACACT CGCGCTGCTT
GTAACGTCCA TCATGCTGTA TCTTCCGGCT AATATTTTGC CCATCATGGT GACGGATTTA
TTAGGCTCGA AGATGCCGTC GACGATTCTC GCTGGGGTCA TTCTGTTATG GAGCGAAGGC
TCTTATCCCG TCGCTGCGGT GATCTTTCTG GCCAGTATTA TGGTGCCAAC GTTAAAGATG
ATCGCCATCG CGTGGCTGTG TTGGGATGCC AAAGGGCATG GCAAGCGCGA CAGTGAAAGA
ATGCATTTGA TTTATGAAGT TGTTGAGTTT GTAGGCCGCT GGTCGATGAT TGACGTTTTC
GTTATCGCGG TGCTCTCGGC GCTGGTGCGT ATGGGAGGTT TAATGAGTAT TTATCCGGCA
ATGGGTGCAT TAATGTTTGC TTTAGTCGTC ATAATGACAA TGTTTTCTGC TATGACGTTT
GACCCGCGTT TGTCGTGGGA TCGTCAACCT GAATCAGAGC ATGAGGAGTC CTGA
 
Protein sequence
MCEHHHAAKH ILCSQCDMLV ALPRLEHGQK AACPRCGTTL TVAWDAPRQR PTAYALAALF 
MLLLSNLFPF VNMNVAGVTS EITLLEIPGV LFSEDYASLG TFFLLFVQLV PSFCLITILL
LVNRAELPVR LKEQLARVLF QLKTWGMAEI FLAGVLVSFV KLMAYGSIGV GSSFLPWCLF
CVLQLRAFQC VDRRWLWDDI APMPELRQPL KPGVTGIRQG LRSCSCCTAI LPADEPVCPR
CSTKGYVRRR NSLQWTLALL VTSIMLYLPA NILPIMVTDL LGSKMPSTIL AGVILLWSEG
SYPVAAVIFL ASIMVPTLKM IAIAWLCWDA KGHGKRDSER MHLIYEVVEF VGRWSMIDVF
VIAVLSALVR MGGLMSIYPA MGALMFALVV IMTMFSAMTF DPRLSWDRQP ESEHEES