Gene YPK_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3040 
Symbol 
ID6090703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp3340062 
End bp3341168 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content53% 
IMG OID641598120 
Productxylose isomerase domain-containing protein 
Protein accessionYP_001721766 
Protein GI170025261 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGATC AAAGAACGGC GGTTGAACCA AAAACGGCGG TTGATCAGAC AACAGAGGTT 
ATTCGAAAAG TAGAAGTTGC TCAGAAAATA GAACGTGATC GGAAAGCAGA GGTTGTTCGA
AAAGCAGAAG TTAATCAGAA AATAGACATT GTTCGAAAAA CAGAGATAGT TCGAAAAACA
GAGACAGTGC GAAAAACAGA AGCTGATCGG CATACCACCG CGGTGCAGTT AGGCATTAAT
CCGCTGACAT GGACTAACGA CGATCTGCCT TCTCTGGGGG CGGAAACATC GCTTGAAACC
TGCCTGAGCG AAGGTAAAGA GGCGGGTTTT GCGGGCTTTG AACTGGGCAA TAAATTTTCA
CGCGAGGCGG GGGTTTTAGG GCCAATTCTA GAGCATCATC AACTGAAATT GGTCTCCGGG
TGGTATTCCG GGCGGCTGTT GGAACGCTCG GTTGAAGAGG AGATTGTGGC GGTTCGGTCA
CATCTGGCAC TGCTGCGTGA ACTGGGGGCT AAGGTGATGG TATTTGCTGA AGTGTCCGGT
TGTATTCATG GTGAACAGCA AACGCCAGTG CATCTGCGCC CAAGGTTTCC GCCAGCGCGT
TGGGCGGAGT ATGGCGCGAA GCTCACCGCC TTTGCCCGTT ACACGCAGGC ACTGGGGGTG
CAGATTGCTT ACCACCACCA TATGGGCACG GTGATCGAAA GCGCGCAAGA TATCGATAAT
CTGATGATTC ATACTGGCGA AGAGGTCGGG CTGCTACTCG ACACGGGCCA CCTCACCTTT
GCGGGGGCGG ATCCGCTGGC GGTGGCACAA CGCTGGATTG CGCGGATCAA CCATGTTCAT
TGCAAAGATG TGCGCACGTC GGTGCTGGCC GATGTCAAAA ACCGTAAAAC CAGTTTTCTG
GACGCGGTGT TGAGTGGCGT ATTTACCGTT CCGGGTGACG GCGGTGTGGA CTATCCACCC
ATCATGGCGT TACTCAAACA ACACGGTTAT CAGGGCTGGC TGGTGGTCGA GGCAGAGCAA
GATCCTAATG TCGCCCACCC AATGACCTAT GCGCGCATGG GCTACCACAA TTTAAGCCGT
CTGGCACATA ACGCTGGGCT GATTTAA
 
Protein sequence
MVDQRTAVEP KTAVDQTTEV IRKVEVAQKI ERDRKAEVVR KAEVNQKIDI VRKTEIVRKT 
ETVRKTEADR HTTAVQLGIN PLTWTNDDLP SLGAETSLET CLSEGKEAGF AGFELGNKFS
REAGVLGPIL EHHQLKLVSG WYSGRLLERS VEEEIVAVRS HLALLRELGA KVMVFAEVSG
CIHGEQQTPV HLRPRFPPAR WAEYGAKLTA FARYTQALGV QIAYHHHMGT VIESAQDIDN
LMIHTGEEVG LLLDTGHLTF AGADPLAVAQ RWIARINHVH CKDVRTSVLA DVKNRKTSFL
DAVLSGVFTV PGDGGVDYPP IMALLKQHGY QGWLVVEAEQ DPNVAHPMTY ARMGYHNLSR
LAHNAGLI