Gene YPK_0968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0968 
Symbol 
ID6090888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1085839 
End bp1087074 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content48% 
IMG OID641596031 
Productextracellular solute-binding protein 
Protein accessionYP_001719722 
Protein GI170023217 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.155665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA ATAAAATCAC TGCGCTTGTT TTAACCGCAC TGGCAGTCAC CCAGTTTGCT 
GGTTTCGCTG CCCACGCTGC TACACAGCAA TTAACAGTTT GGGAAGACAT CAAAAAATCT
GCCTGTATTA AAGAGGCTAT CGCTGATTTT GAAAAACAGC ATCAGGTTAA GGTCAATGTG
CTGGAAATGC CTTACGCACA ACAAATTGAA AAACTCCGCC TTGATGGCCC TGCAGGTATC
GGCCCTGATG TGTTGGTGAT TCCCAATGAT CAGTTAGGTG GTGCGGTAGT GCAAGGTTTG
CTGACACCGC TCAGCGTTGA TCCAACCATA GTCACTACTT TTACTAAACC TTCTATCGCT
GCCTTCACCA TGGATAATGC CCTCTACGGT TTACCGAAAG CCGTGGAAAC GCTGGTGATG
ATCTACAACA AAGACATGCT GCCAACGCCG TTAGCTACCT TGGATGAGTA CGCCGCATTC
TCTAAGAAAC AACGCGCAGA AAATAAATAT GGTCTGTTGG CGAAGTTCGA TCAGATCTAT
TACAGCTGGG GAGCGATTGA GCCAATGGGC GGTTACATCT TTGGTAAAGA TGCTAACGGT
AGCTTGAAGG CTAACGATAT CGGGCTAAAT ACGCCAGGGG CTGTTGAGGC CGTAACCTAT
TTGAAAACAT TCTATGCTAA CGGTCTGTTT CCAATTGGCA CCATCGGTGA TAACGGCTTG
AATGCTATTG ACTCATTATT CACTGAGAAA AAAGCGGCTG CGGTAATTAA CGGGCCATGG
GCATTCCAAC CGTATGAAGC CGCTGGTATT AACTTTGGTG TGTCACCACT GCCAGCATTA
CCGAACGGCA AAGATATGAG CTCCTTCCTC GGTGTGAAAG GGTATGTCGT TTCTACCTGG
AGCAAAGATA AGGCACTCGC CCAGCAGTTC ATCGAATTTA TTAACCAACC GCAATACGTG
AAAACCCGCT ATCAGGTCAC CAAAGAGATC CCCGCGTTGA CGGCCATGAT TGACGATCCA
TTGATTAAAA ATGATGAAAA AGCCAGTGCG GTAGCCATTC AGGCAAGCCG TGCCTCTGCG
ATGCCTGGTA TTCCAGAAAT GGGCGAAGTG TGGGGACCTG CGAACTCAGC ATTGGAGCTA
AGCGTAACGG GCAAACAGGA GCCTAAAGTC GCTCTCGATA ACGCCGTTAA GCAGATCAAT
ATGCAAATCG AGGCCATGCA GGCCAGTAAT CAGTAA
 
Protein sequence
MKINKITALV LTALAVTQFA GFAAHAATQQ LTVWEDIKKS ACIKEAIADF EKQHQVKVNV 
LEMPYAQQIE KLRLDGPAGI GPDVLVIPND QLGGAVVQGL LTPLSVDPTI VTTFTKPSIA
AFTMDNALYG LPKAVETLVM IYNKDMLPTP LATLDEYAAF SKKQRAENKY GLLAKFDQIY
YSWGAIEPMG GYIFGKDANG SLKANDIGLN TPGAVEAVTY LKTFYANGLF PIGTIGDNGL
NAIDSLFTEK KAAAVINGPW AFQPYEAAGI NFGVSPLPAL PNGKDMSSFL GVKGYVVSTW
SKDKALAQQF IEFINQPQYV KTRYQVTKEI PALTAMIDDP LIKNDEKASA VAIQASRASA
MPGIPEMGEV WGPANSALEL SVTGKQEPKV ALDNAVKQIN MQIEAMQASN Q