Gene YPK_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2003 
Symbol 
ID6088398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2229281 
End bp2230651 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content48% 
IMG OID641597070 
ProductHlyD family type I secretion membrane fusion protein 
Protein accessionYP_001720743 
Protein GI170024238 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.295152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCGT TTATTCAGCA CATAAAAAAC CGCTGGACGC GTTGTTTTAC CCCCGCGCGT 
ACCCGTGATG AATATGATTT TTTGCCTGCT TATTTGGAAA TTGTCGAGCG CCCGATTGCC
CCATTAGCCC GTCGTACAGC GTGGTTACTC ATACTGACAT TATTGCTGGT ATTGATATGG
GCGATTATTG GAAAACTTGA TATCCATGCC TCAGCCAGCG GCAAAGTGAT TGTTGCGGAA
CATTCCAAAA TTATCCAACC CGCAGAACCT GGAGTGGTGA CGGAAATTAA TGTCCGAGAT
GGGGATACCG TTGATGCCGG GCAGGTGCTT ATTGCATTAA ACCCTATTGG CATTGATGCC
GAAGTACGCA ATATCAATCA GCAACTACAG TATCGTCAGT TAGAAGCCGC TCGGTTAGTG
GCTCTCCTGG CTGATGAACC GCTAAAAAAT TTCAAGCCGC CTTATAACAG CCCTGAATCT
CTGGTGGTGG CATCCAGAGC ACTGTTAACA AAAGAATATG CCGAAGTGAC AGCTGAATTG
TCACGTCAGG ATAGCGAACT GGCGGTGAAT CAGGCGCATC TTCAGGCCGG AATTGAGCAC
AACTCTCATC AAAAAGCATT GCTGAAAAAT ATTCATCGGC GGCTACAAGC GTCGCGTACG
CTGGCTAAAT CTAAGTCAAT TGCCGAGGTC GAGCTTCTGG TACAGGAGCG GGAATGGTTA
ACGGCAACGG CTGAAGCCAA CCGTTTGCAG AGTGAACAAG ATATTTTACG GGCCAAAGCA
CACAGTTTGG CGCAAACCCG TATGCACTAT CTGGCGAAAA TAGACCGTGA TAATCGGGAG
CGATTGAATA AAACCCACGA AATTATTCAT CAGCTTCAGC AAGAGCAAAT CAAAATGATG
GATAAACAAC GCCAGCAAAC CCTAAGAGCT CCGGTGGCTG GCGTTATCCA GCAATTAGCG
GTTCATACCT TAGGGGGGGT GGTTACCACG GCGCAACCGT TAATGGTTCT GGTTCCGGAG
GATTATCAGT TGGAACTTGA TGTCATGATC CTCAATAAAG ATGTGGGGTT CGTCCTGCCA
GGCCAAGCGG TAGAAGTCAA AGTTGACAGT TTCCCTTATA CCCGCTTCGG TACGTTATCC
GGGGAAGTAA AACATGTTTC CCGTGATGCA ATGGAAGATC AACAACGGGG ACTGGTATTT
CCAGCGCGTA TCCGTTTACT CAGCGATACC CTGATGGTGG AGGGTAAACC GGTGCGGTTG
TCTGCGGGGA TGGCCATCAA TGCGGAAATT AAAACAGGTC GCCGTCGGGT CATTGATTAT
CTGCTGAGTC CATTGCAGCA ATATCAATCT GAAGCCATGA GGGAGCGCTA A
 
Protein sequence
MSAFIQHIKN RWTRCFTPAR TRDEYDFLPA YLEIVERPIA PLARRTAWLL ILTLLLVLIW 
AIIGKLDIHA SASGKVIVAE HSKIIQPAEP GVVTEINVRD GDTVDAGQVL IALNPIGIDA
EVRNINQQLQ YRQLEAARLV ALLADEPLKN FKPPYNSPES LVVASRALLT KEYAEVTAEL
SRQDSELAVN QAHLQAGIEH NSHQKALLKN IHRRLQASRT LAKSKSIAEV ELLVQEREWL
TATAEANRLQ SEQDILRAKA HSLAQTRMHY LAKIDRDNRE RLNKTHEIIH QLQQEQIKMM
DKQRQQTLRA PVAGVIQQLA VHTLGGVVTT AQPLMVLVPE DYQLELDVMI LNKDVGFVLP
GQAVEVKVDS FPYTRFGTLS GEVKHVSRDA MEDQQRGLVF PARIRLLSDT LMVEGKPVRL
SAGMAINAEI KTGRRRVIDY LLSPLQQYQS EAMRER