Gene YPK_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2026 
Symbol 
ID6087466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2254591 
End bp2255760 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content49% 
IMG OID641597093 
Producttetratricopeptide repeat protein 
Protein accessionYP_001720766 
Protein GI170024261 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000487285 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGAAA TGCTGTTTCT GCTGTTGCCC GTTGCTGCCG CCTATGGTTG GTATATGGGG 
CGCAGAAGCG CTCAGCAGGA TAAGCAACAG GATGCTAACC GCTTGTCACG TGAATACGTG
GCTGGGGTTA ACTTCCTTCT CTCGAACCAG CAAGATAAAG CAGTCGATCT CTTCCTCGAA
ATGTTGAAAG AAGACAGTTC TACGGTCGAG GCTCATCTTA CTCTGGGTAA TCTGTTCCGC
TCACGCGGCG AAGTTGATCG CGCCATTCGC ATCCATCAAG CCTTGATGGA GAGTGCATCC
CTGACATTTG AACAACGGCT ATTGGCTGTT CAGCAACTCG GTCGGGATTA CATGGCCGCA
GGTTTATATG ATCGGGCAGA AGATATGTTC AACCAATTGG TTGAAGAGCA AGATTTTCGG
CTCGGCGCAT TACAACAATT ACTGATAATT CATCAGGCAA CCAGTGATTG GCATAATGCT
ATTGAAGTTG CGGAAAAATT GGTCAAGATG GGCAAAGATA ATCAGCGTCT GGAAATTGCG
CACTTCTATT GTGAACTTGC GTTGCAAGCA ATGGGCAGTG ACGATCTGGA TAAGGCCATG
GGGTTACTGA AAAAAGCGGC AACGGCGGAT AAACAATGTG CCCGAGTTTC TATTATGCGT
GGCCGTGTGC ATCTCGCTAA AGGTGAGTAT GCCAAAGGCG TTGAAGCGTT GGAGCGAGTA
CTGGAGCAAG ATAAAGAAGT CGTCAGTGAA GCATTACCGA TGCTCAGTGA ATGCTACCAA
CACCTGCAAC AGCCCCAGGC TTGGGCTAAT TTCCTTAAAC GTTGTGTGGA AGATAATACC
GGTGCGGCGG CTGAACTGAT GTTGGCGGAG GTCCTTGAAC AGCAGGAGGG CCATGATGTC
GCGCAAACGT ATATTAACCG CCAATTACAG CGCCACCCAA CGATGCGCGG TTTTTATCGT
CTAATGGATT ACCACTTAGC AGATGCAGAA GAAGGGCGTG CAAAAGAGAG TTTATTGCTA
TTACGGGACA TGGTTGGTGA GCAAATTCGG ACCAAACCAC GTTACCGTTG TCATAAGTGT
GGTTTCACCG CCCACTCACT CTATTGGCAT TGCCCGTCTT GCCGTGCCTG GGCATCAGTT
AAGCCGATCA GAGGATTAGA TGGGCAGTAG
 
Protein sequence
MLEMLFLLLP VAAAYGWYMG RRSAQQDKQQ DANRLSREYV AGVNFLLSNQ QDKAVDLFLE 
MLKEDSSTVE AHLTLGNLFR SRGEVDRAIR IHQALMESAS LTFEQRLLAV QQLGRDYMAA
GLYDRAEDMF NQLVEEQDFR LGALQQLLII HQATSDWHNA IEVAEKLVKM GKDNQRLEIA
HFYCELALQA MGSDDLDKAM GLLKKAATAD KQCARVSIMR GRVHLAKGEY AKGVEALERV
LEQDKEVVSE ALPMLSECYQ HLQQPQAWAN FLKRCVEDNT GAAAELMLAE VLEQQEGHDV
AQTYINRQLQ RHPTMRGFYR LMDYHLADAE EGRAKESLLL LRDMVGEQIR TKPRYRCHKC
GFTAHSLYWH CPSCRAWASV KPIRGLDGQ