Gene YPK_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1798 
Symbol 
ID6088359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1998052 
End bp1999344 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content41% 
IMG OID641596866 
Productextracellular solute-binding protein 
Protein accessionYP_001720542 
Protein GI170024037 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.763472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG CGATCCTACA CACGCTAATA GCATCCACTC TGGCTCTATT GTCACATCAA 
GCTTTTGCAG CACAGGACGA CGTCAATTTA CGGATGTCAT GGTGGGGAGG GAACGGCCGC
CACCAGGTAA CATTGAAAGC TCTGGAAGAA TTCCATAAAC AACATCCGAA TATTAATGTT
AAAGCGGAAT ATACCGGTTG GGATGGCCAT TTATCTCGTT TGACTACACA GATTGCAGGC
GGTACTGAAC CTGATGTCAT GCAGACTAAC TGGAACTGGC TGCCTATTTT TTCTAAAGAC
GGAACAGGAT TCTTCGATTT AAATAACGTA AAAGAACAAC TGGATTTGGC GCAATTCGAT
CCAATCGAAC TGCAACAGAC TACCGTTAAT GGCAAACTGA ATGGTATTCC GATCTCTGTT
ACTGCACGTA TATTCTATTT CAATGATGCG ACCTGGGCTA AAGCAGGTTT AGAATATCCA
AAGACATGGG ATGAGTTACT CAACGCAGGC CAAGTATTTA AAGAAAAATT GGGTGACCAA
TATTACCCTG TTGTATTAGA GCATCAAGAT ACTTTAGCGT TAATCCGTTC TTATATGACA
CAGAAATACA ATATTCCTAC CGTTGATGAA GCAAATAAGA AATTTGCCTA CACACCAGAG
CAGTGGGTTG AGTTCTTTAC CATGTACAAA AAGATGGTAG ATAGCCATGT TATGCCATCT
TCCAAATACT ATGCTTCATT CGGTAAGAGT AATATGTACG AAATGAAGCC GTGGATTAAT
GGTGAGTGGG CAGGAACTTA TATGTGGAAC TCAACCATTA CTAAATACTC TGATAACCTG
ACCAAACCTG CAAAGTTAGA TCTGGGCCCA TATCCAATGT TGCCTGATGC AAAAGATGCT
GGCTTATTCT TTAAACCTGC ACAAATGCTC TCAATTGGTA AATCAACTAA GCATCCTAAA
GAAAGTGCGA TGCTAATTAA CTTCCTGCTA AACAGCAAAG AAGGTGTTGA TGCATTAGGC
CTTGAACGCG GTGTACCGCT GAGCGCAGCA GCTGTAGCCC AATTGCGTGC CAATGGTGTG
ATTAAAGATG AAGATCCTTC TGTTGCCGGT TTGAATATGG CATTGGAATT ACCACATGAA
ATGAAGACCT CACCGTACTT CGATGATCCA CAAATTGTTT CACTGTTCGG TGATGCAATT
CAATATATAG ACTATGGTCA GAAAAGCGTA GAAGAAACAG CAGAATACTT TAATAAGCAA
GGTGATCGTA TTCTTAAACG TGCAATGCGT TAA
 
Protein sequence
MKKAILHTLI ASTLALLSHQ AFAAQDDVNL RMSWWGGNGR HQVTLKALEE FHKQHPNINV 
KAEYTGWDGH LSRLTTQIAG GTEPDVMQTN WNWLPIFSKD GTGFFDLNNV KEQLDLAQFD
PIELQQTTVN GKLNGIPISV TARIFYFNDA TWAKAGLEYP KTWDELLNAG QVFKEKLGDQ
YYPVVLEHQD TLALIRSYMT QKYNIPTVDE ANKKFAYTPE QWVEFFTMYK KMVDSHVMPS
SKYYASFGKS NMYEMKPWIN GEWAGTYMWN STITKYSDNL TKPAKLDLGP YPMLPDAKDA
GLFFKPAQML SIGKSTKHPK ESAMLINFLL NSKEGVDALG LERGVPLSAA AVAQLRANGV
IKDEDPSVAG LNMALELPHE MKTSPYFDDP QIVSLFGDAI QYIDYGQKSV EETAEYFNKQ
GDRILKRAMR