Gene YPK_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2001 
Symbol 
ID6087684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2225377 
End bp2226879 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content50% 
IMG OID641597068 
ProductL-arabinose isomerase 
Protein accessionYP_001720741 
Protein GI170024236 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTAT TCAAGCAATC AGAAGTGTGG TTTGTCATTG GTAGCCAGAA TCTGTATGGC 
CCTAAAACCC TGCAACAAGT TATGGATAAT GCACATCAGG TGGTCAATAG CCTGAACAGC
GAAGCGGGTT TACCCGTAAA ACTGGTATTA AAACCGTTGG TTACAACACC GGATGAAATC
ACCGCATTAT GTCGTGAAGC TAACTACGAC ACGGCCTGTA TCGGTATCAT GACCTGGCTG
CACACCTTCT CTCCGGCCAA AATGTGGATT GGCGGCCTGA GCATTCTGAA TAAACCGCTG
TTACAGTTCC ATACCCAGTT TAATGCCCAA ATCCCGTGGG AAACGATGGA TATGGACTTT
ATGAACCTAA ACCAGACCGC ACACGGTGGC CGTGAATTTG GTTTCATTGG TGCCCGCATG
CGCCAGCAAC ACAGTGTGAT AACCGGTCAC TGGCAGGATA AAGAAGCCCA CCAGCGCATT
GGTCAGTGGA TGCGCGTCGC CGCCGCAAAA CAAGAAAGTC AACAACTGAA AGTGGCGCGC
TTTGGCGATA ACATGCGTGA AGTCGCCGTA ACTGAAGGGG ATAAAGTCGC TGCCCAGATC
CAATTTGGCT ATTCCGTTAA TGCTTATGGC ATTGGGGATT TAGTCGCCGT GGTCGATGCC
GTCAGTAAAG GTGATATCGA TACGCTGGTT GAAGAATATG AGGCCACCTA TCGCTTTAGC
GATGCGGTGA AACTCAATGG TGATAAGCGC GAAAACTTAC TGGATGCAGC ACGTATTGAG
CTAGGTATGA AGCGTTTTCT GGAGCAAGGT GGTTTTAAAG CCTTCACCAC TAACTTTGAA
AATCTTTATG GTTTGAAGCA GTTACCTGGC CTGGCAGTCC AGCGACTCAT GCAACAGGGT
TACGGTTTTG GTGGCGAAGG CGACTGGAAA ACCGCCGCAT TACTGCGCAT CTTAAAAGTG
ATGGGAACCG GCCTGAAAGG CGGCACTTCC TTTATGGAGG ATTACACTTA TAACTTCCAG
CCAGGTAATG ACTTAGTTGT TGGCTCACAT ATGCTGGAAG TCTGCCCGTC GATCGCCAAA
GAAGAGAAGC CCCTGCTGGA TGTGCAACAC CTTGGCATTG GAGGGAAAGC TGACCCTGCC
CGTTTGATTT TCTCTACCCC CGCAGGCCCG GCGCTGAATG CCAGTTTGAT CGATATGGGG
AACCGTTTCC GCTTGCTGGT TAATGTGGTT GATACCGTTG AACAACCTCA TCCATTGCCA
AAATTACCGG TTGCCCGGGC TATCTGGCAA GCACAACCTT CACTGGCAAC GGCTGCTGAA
GCTTGGATCA TCGCCGGTGG CGCACACCAT ACGGTATTCT CACAAGCGGT GGGTGTCGAT
GAACTGCGTT TATATGCCGA AATGCACGGT ATTGAATTCT TGTTGATCGA CAATGACACG
ACGTTACCGG CGTTCAAAAA CGAAATCCGT TGGAACGAGG TGTACTATCA GCTCAATCGC
TAA
 
Protein sequence
MDVFKQSEVW FVIGSQNLYG PKTLQQVMDN AHQVVNSLNS EAGLPVKLVL KPLVTTPDEI 
TALCREANYD TACIGIMTWL HTFSPAKMWI GGLSILNKPL LQFHTQFNAQ IPWETMDMDF
MNLNQTAHGG REFGFIGARM RQQHSVITGH WQDKEAHQRI GQWMRVAAAK QESQQLKVAR
FGDNMREVAV TEGDKVAAQI QFGYSVNAYG IGDLVAVVDA VSKGDIDTLV EEYEATYRFS
DAVKLNGDKR ENLLDAARIE LGMKRFLEQG GFKAFTTNFE NLYGLKQLPG LAVQRLMQQG
YGFGGEGDWK TAALLRILKV MGTGLKGGTS FMEDYTYNFQ PGNDLVVGSH MLEVCPSIAK
EEKPLLDVQH LGIGGKADPA RLIFSTPAGP ALNASLIDMG NRFRLLVNVV DTVEQPHPLP
KLPVARAIWQ AQPSLATAAE AWIIAGGAHH TVFSQAVGVD ELRLYAEMHG IEFLLIDNDT
TLPAFKNEIR WNEVYYQLNR