Gene YPK_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1022 
Symbol 
ID6088290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1146925 
End bp1148085 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID641596085 
Productmajor facilitator transporter 
Protein accessionYP_001719776 
Protein GI170023271 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTG CACTACTGGC GTTGGCGTTG TGTGCTTTTG CTATTGGTAC TACTGAATTT 
GTCATTATGG GGTTATTGCC CCAGGTAGCG GGTGATTTGC ACATATCGAT TCCAACTGCG
GGCTGGTTGA TCAGTGGTTA TGCGCTAGGC GTGGCAATTG GCGCGCCCAT CATGGCAGTG
CTGACCGCGA AATTACCGCG CAAGAAGACA CTGTTACTGT TAATGGTGAT TTTTATCATC
GGTAACCTGA TGTGTGCCTT GGCATACAGT TATGACTTCC TGATGTTCGC GCGAGTGATC
ACCGCGTTGT GTCATGGGGC CTTTTTTGGT ATCGGCGCGG TGGTCGCCGC AAATCTGGTG
GCACCAAACC GGCGGGCCTC GGCGGTGGCA CTGATGTTTA CGGGGCTGAC GCTGGCGAAT
GTATTGGGTG TCCCACTGGG GACCGCTCTG GGTCAGGCCT TTGGCTGGCG TTCGACATTT
TGGGTGGTAT CGGTCATCGG TTTGTTCTCG TTGGCAGCCC TGTATAGCAA GTTGCCCTCC
TCCAGCGAGG AAGCACCGAC TGAGCTTCGT AAGGAGATTG CCGCTTTGCG TGGCGGTGGA
ATTTGGCTCT CCTTACTGAT GACCGTATTT TTTGCCGCAG CCATGTTTGC GCTCTTTACC
TACATTGCCC CCATTTTGAC GGAGGTCACA CAGGTTTCTG AGCATGGCGT CAGTTGGACG
TTACTGCTAA TGGGGGTTGG CTTGACGCTC GGTAATATCG TCGGGGGCAG GCTAGCTGAC
TGGCGTTTAT CGGTCAGTTT AACCATGACA TTCTTGTTGA TCGCGGTATT TTCTGCCCTG
TTTAGTTGGA CCAGTTATTC ACTGTTGGCG GCGGAAGTGA CACTGTTTTT GTGGTCAGCC
GCCGCATTTT CTGCAGTGCC TGCGTTGCAA ATTAATGTCG TCGCTTATGG CAAGAAAGCC
CCTAATCTGG TGTCAACGCT GAATATTGCG GCCTTTAATG TGGGTAACGC CTTAGGGGCG
TGGGTCGGGG GGGTTGTGAT TGCCAAAGGG CTTGGTTTGA CGGCGGTGCC GCTGGCCGCC
GCGGCACTGG CGGTCATGGG GTTGTTGCTG TGTCTGTTTA CCTTTTCCCG CGCGCGTACT
ATTGGGAATA AAATGGCTTA G
 
Protein sequence
MPVALLALAL CAFAIGTTEF VIMGLLPQVA GDLHISIPTA GWLISGYALG VAIGAPIMAV 
LTAKLPRKKT LLLLMVIFII GNLMCALAYS YDFLMFARVI TALCHGAFFG IGAVVAANLV
APNRRASAVA LMFTGLTLAN VLGVPLGTAL GQAFGWRSTF WVVSVIGLFS LAALYSKLPS
SSEEAPTELR KEIAALRGGG IWLSLLMTVF FAAAMFALFT YIAPILTEVT QVSEHGVSWT
LLLMGVGLTL GNIVGGRLAD WRLSVSLTMT FLLIAVFSAL FSWTSYSLLA AEVTLFLWSA
AAFSAVPALQ INVVAYGKKA PNLVSTLNIA AFNVGNALGA WVGGVVIAKG LGLTAVPLAA
AALAVMGLLL CLFTFSRART IGNKMA