Gene YPK_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0739 
Symbol 
ID6089781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp797813 
End bp800713 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content55% 
IMG OID641595800 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001719493 
Protein GI170022988 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.828108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCATT TCTTTATTCG TCGCCCCAAG TTTGCCATTG TTATCGCATT GGTGATCACC 
CTGGTTGGCT GGGTTTCACT GTATGTTATC CCGGTCGAGC AATACCCGGA TATCACGCCG
CCGGTGGTCT CCGTCAGTGC TGTTTATCCC GGTGCCAGCG CCAGAGACGT GGCGCAAGCC
GTCGCGTCAC CACTGGAAGC GCAGGTCAAT GGCGTCAGCC ATATGTTGTA CATGGAATCA
ACCAGCGCCA ATAACGGCAG CTACCAATTG AGCATTACCT TTGCCAGTGG CACCGATCCC
GATATGGCGG CGGTTGAGGT ACAAAACCGC ATTTCGCAGG TGAGCGCGCA ACTGCCTGCG
GAGGTGAATG AAAATGGCAT CTCGGTACGC AAACGTGCGT CAAACCTACT ATTAGGGGTC
AGCGTTTTCT CCCCGCAACA GACCCATGAC GCACTGTTTG TCAGTAACTA CACCAGCATA
CAATTGCGTG ATGCTATTGC ACGTATCAGT GGTGTCGGTG ATGTACAGGT CTTTGGTGCG
CGCGATTACA GTATGCGTGT CTGGCTGGAT CCACAGCGCA TGGAATCACT GAATGTCAGC
GTGCAGGATA TCGTTGCCGC GTTACAACAG CAGAACGTAC AAGCGGCGGC AGGGCAGATA
GGCTCTTCAC CGTCGATGCC CAATCAACAA CAGACACTGA CCATCAGTGG GCAGGGGCGC
TTGACCGATG CACGGCAGTT TGCCGATGTG ATTATCCGCA GTAATCCACA GGGCGGTATG
ATCCGCTTGG GCGATGTGGC CCGCGTGGCC TTGGGGGCAC AGAATTATCA GGTCAGTGCG
GCACAAAACC AGACTGAATC AGCCTTCTTA GTGGTGTATC CGGTTCCCGG AGCCAATGCA
CTTAATGTGG CCAATGGCGT CCGTGATGAA ATGGCCCGGC TGTCGGCCGC CTTTCCGGCG
GATCTCACCT ATGAAATTAA TTACGACTCC ACCCTGCCGG TGACGGCGAC ACTGCATGAG
ATTGCGGTGT CACTGACCTT GACACTGATT GTGGTGCTCG CCGTGGTGTA TCTGTTTTTA
CAAAGTCTGC GCGCGACCTT TATTGTGGCA CTCACGGTTC CGGTCTCGTT GCTTGGCACC
TTTGCTGTGC TGTATGTCTT TGGTTATTCA GCCAATACCC TTAGCCTGTT TGCCATTATT
CTGGCACTGA CCATTGTGGT GGATGATGCC ATTGTGGTGG TCGAAAACGT TGAGCGGTTA
TTGTCGAACG ACCCTCATCT TTCACCGGCA GAGGCTACCC GGCAGGCGAT GAGCCAGATT
GCCGGGCCGA TCATCGCCAC CACACTGGTG TTGATGGCGG TATTTGTGCC AATCGCTATC
TTGCCGGGGA TTATTGGCGA GTTGTATCGC CAGTTTGCGG TAACGCTTTC GGCGGCGGTC
ATTCTCTCCA GTATCAATGC GTTAACGCTG AGTCCGGCGC TGTGTGCCGT CCTGCTTAAG
CGGCGCACAC TCGCAACGAC GGGCATGTTC GGTACGATTA ACAAGGGGCT TGATCGCGCC
CGTGATGGTT ATGTTGGCTT AACGGGGCGG ATCAACCGCC GTGCGGTATT TAGTATCGCC
GCGCTGCTAC TGGTGGGGTT AGCAACATGG TGGGGTTATA GCCGACTGCC AACCTCGTTC
CTACCAGAGG AAGATCAGGG CTATTTCTTT GTCAGCTTAC AATTGCCAGA TGGTGCCTCA
CTGAACCGAA CCCAAACGGT GATGGACCAG ATGTATCAAC AGGTGAGCAC GAATGAGGCC
GTTGAGGATG TAATAAAAAT TACGGGGTTC AGCCTGCTCA GTGGCAACAA TGCGCCCAAT
GCCGGTTTTG CTATTGTGAT GCTAAAACCT TGGGGCCAGC GGCCGCATAT TGATCGGGTG
CTGGCCAGTA TTCAGGCCAA TCTGGCGGCT ATCCCCTCGG CAATGATTAT GGCGGTGAAC
CCGCCAGCGA TTGCTGGTTT GGGCAGTGCT TCGGGCTTTG ATTTGCGCAT TCAGGCACTG
CTAGGCCAAT CGCCGCAAGA GCTGGCGCAG GTGAGCCAAG GGATTATTTT TGCCGCCAAT
CAGGACCCGA CATTGAGCCG GGTCTTTACC ACCTTCAGTG CTTCGGTGCC TGAAACCAAT
TTGAGTATCG ATCGTGACCG CGCGGCCTTG TTACAAGTGC CGGTCAGCCG AATCTTCCAA
ACGCTGCAAA CCTCGCTGGG GGGGATGAAT GCCGGTGATT TCACCTTGAA TAACCGTATG
TTCCGGGTGC AATTACAAAA CGATATGAAT TTCCGCCAGC GCACGGCGCA GATCAATAAC
CTGAATGTAC GCAGTGATAA CGGGGCATTA GTCAGTTTAG CGAACCTGGT CACGTTGACC
CCGTCAGTGG GCGCGCCCTT TATCAGTAAT TTCAACCAGT TCCCTTCGGT TGCGATCAGC
GGTTCAGCGG CTGATGGGGC CAGCTCAGGT CAGGCGATGG CGGCAATGGA AGCGCTATTG
GCGCAGAATT TACCGCAAGG CTACAGCTAC AGTTGGAGCG GGATGTCATG GCAAGAGCAG
CAGACCGGTG GGCAAGTGGT GTTTATTTAT CTTGCGGCGC TGGTCTTCGC TTATCTGTTT
TTAGTCGCCC AGTATGAAAG CTGGAGTATC CCGCTGGTGG TGGTTCTCTC GGTGGTGTTT
GCCGTGGGGG GAGCGGTGGC GGGGCTATCG GCGATGGGGT TTGCCAACGA TGTGTATGCG
CAAATAGGCT TAGTGCTACT GATCGGGTTG GCGGCAAAGA ATGCGATTCT GATTGTCGAA
TTCTCCAAGG CGCGGCGAGA AGAGGGGGCG AGTATGCGGA GGCTGCACAG GACGGCGCTA
AACAGCGTTT CCGCGCCGTG A
 
Protein sequence
MLHFFIRRPK FAIVIALVIT LVGWVSLYVI PVEQYPDITP PVVSVSAVYP GASARDVAQA 
VASPLEAQVN GVSHMLYMES TSANNGSYQL SITFASGTDP DMAAVEVQNR ISQVSAQLPA
EVNENGISVR KRASNLLLGV SVFSPQQTHD ALFVSNYTSI QLRDAIARIS GVGDVQVFGA
RDYSMRVWLD PQRMESLNVS VQDIVAALQQ QNVQAAAGQI GSSPSMPNQQ QTLTISGQGR
LTDARQFADV IIRSNPQGGM IRLGDVARVA LGAQNYQVSA AQNQTESAFL VVYPVPGANA
LNVANGVRDE MARLSAAFPA DLTYEINYDS TLPVTATLHE IAVSLTLTLI VVLAVVYLFL
QSLRATFIVA LTVPVSLLGT FAVLYVFGYS ANTLSLFAII LALTIVVDDA IVVVENVERL
LSNDPHLSPA EATRQAMSQI AGPIIATTLV LMAVFVPIAI LPGIIGELYR QFAVTLSAAV
ILSSINALTL SPALCAVLLK RRTLATTGMF GTINKGLDRA RDGYVGLTGR INRRAVFSIA
ALLLVGLATW WGYSRLPTSF LPEEDQGYFF VSLQLPDGAS LNRTQTVMDQ MYQQVSTNEA
VEDVIKITGF SLLSGNNAPN AGFAIVMLKP WGQRPHIDRV LASIQANLAA IPSAMIMAVN
PPAIAGLGSA SGFDLRIQAL LGQSPQELAQ VSQGIIFAAN QDPTLSRVFT TFSASVPETN
LSIDRDRAAL LQVPVSRIFQ TLQTSLGGMN AGDFTLNNRM FRVQLQNDMN FRQRTAQINN
LNVRSDNGAL VSLANLVTLT PSVGAPFISN FNQFPSVAIS GSAADGASSG QAMAAMEALL
AQNLPQGYSY SWSGMSWQEQ QTGGQVVFIY LAALVFAYLF LVAQYESWSI PLVVVLSVVF
AVGGAVAGLS AMGFANDVYA QIGLVLLIGL AAKNAILIVE FSKARREEGA SMRRLHRTAL
NSVSAP