Gene YPK_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0473 
Symbol 
ID6089980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp518371 
End bp521229 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content49% 
IMG OID641595535 
ProductYD repeat-containing protein 
Protein accessionYP_001719229 
Protein GI170022724 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0589964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAATA CGTCACCCAC TGATCTGTGC GCCAACACCC CCACTCTCAC CATCCACGAT 
AACCGGGGGT TAGCTATTCG CACACTGGCT TATAACCGCC GCAGTCATAA TGAAACCGTT
GACGAACTGA TCAGCCGCAA CCGCTATAAC GCCTCCGGTC AGCTAATCGC CAGCCGCGAC
CCGCGCCTTG AGGTGGATAA TTTCCGCTAT CAATACAGCC TCAGCGGTGT TCCACTGCGT
ACCGACAGCG TCGATAGCGG CAGTACGCTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG
CTCACGCGGG ATGCACACCA CACCCGCCGC TGGGTAGAGT ATGAGACCGG TGAGCACAGT
TTAGGCCGCC CGCTAAGTTA CCGCGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACTGAC
CGCTTTTTCT ATGGCGAAAA TAGCGAGCAG GATAAAGGCT GCAATCTGAA CGGCCAGTGT
GTCCGCCATT ACGACAGCGC CGGTTTACAG GCACTGATTA GCCAGTCAAT TATTGATGTG
CCGCTGCAAC AGCAGCGCCG TCTACTGACG GAGACTAAAG GCCCGGTTGA CTGGTTTGGC
GAGAAGGAGA ACTGGGGCGC TCGCCTGAGT GAATCGCCGT TTGTTAGCCA TTGCACCACC
AATGCTCTCG GCCAGTTAAT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC
TATAACCGTG CCGGACAACT TATCGGTTCG TGGCTAACGG TAAAAAATGG CGTGGAGCAG
CCCATTGTAT ACTCACTCAC CTATTCGGCT GCCGGGCAAA AACTGCGCGA AGAGAGCGGC
AACGGGGTCA TTACCGAATA CCGTTATGAA CCCCAGACTC AGCGCTTAAT CGGCATTAAA
ACCACCCGCC CGGCGAAGAA AGACCGCCCG ACCCGGTTAC AAGACCTGCG TTACGATTAT
GACCCGGTCG GGAATATTCT CGCCATCCAT AATGACGCCG AAGCCACCCG CTTCTACCGT
AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTACCA GCTTATCGAG
GCCACTGGCC GTGAAGCCGA TACCAACGGC ATACAAAACA GCCAGTTGCC CACGCTGGCG
TCACTGAACG ACAGCAACCA GTTCGTCAAT TACACCCGCC ACTACCACTA TGACCGCGCC
GGTAACCTGC TAAAAATTCA GCACACAGGA GCCAGCCAAT ACAGTACCTA TATCACGGTG
TCCGATTCGT CCAATCATGG CATTCAGCAA CAAGATGGCA TCACCGCCCA TGATGTCCGC
TCCCAGTTTG ATACGGCGGG TAATCAGCGA CAACTGCAAC CCGGTCAACT CCTTCACTGG
AACAGCCGCA ATCAGTTACA GCAAGTGGAG CCCGTGCCCC GCAACGACGG CATCAGTGAC
AGCGAAAACT ATCTCTATGA TGGCAGCGGT AGCCGGGTGG TCAAAATCAG TCTCCATAAA
ACCCATAACG CCATCCAAAC CCGTTCAGTC ATTTATTTAG CGGGACTGGA ACTGCGTAGC
CAATATAATG GCAATAATCT GACAGAAGAT TTTCAGGTGA TGACCGTGGG TGCCGCGGGC
CGTGCTCAGG TACGGGTATT ACACTGGGAG CGCGGCCAAC CCGTTGATAT CGTCAATGAC
CAACTGCGTT ACAGTTTCGA TAATCACCTT GGCTCGGCGT TAATCGAATT AGACAGCGAC
GGTGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGCG GCACCGCGGT GTTAGCCTCC
CGTAATACCG TGGAAGCTAA ATATAAAACC GTTCGTTACT CCGGTAAAGA GCGTGACACC
ACCGGACTGT ATTATTACGG TTACCGTTAT TACCAGCCGT GGCTGGGCCG ATGGTTAAGC
GCCGACCCCG CAGGCACTAT AGACGGGCTG AATTTATACC GAATGGTGAG GAATAACCCA
ATCAGATTGC GCGATAACAA TGGGCTGTTA ACCGAAGAGC AAATTGATAT GTACGTTAAT
TTGCTTAGTA ATATTGGATT AAAAAATGAT GATGAATTAA AGAGTGAACT ATTAAGATAT
GGTTTGAGCG AAGAAGAGCA AGACCAGATA TATTTAAATA TGCCAATGTT TACGCAGTCT
GGGTCATCAA GCTCATCAGC CACGTCCTTT TCTGAGAATA GTTCCAGTTC TGGGAGCACG
CAAAGTGCTG ATTCAGGTTA TCTCAGTCCA GTAAGAAACT ATCATTTTTT TGAAGATATT
AATTTAGTGA CAATGCACCG CCCCTATCCA ACAGAAAAAG TCTCTAGTGA AGAAATAATA
TCTGCTGCTG CAGAGCTAAA AGAAGCTAGC CCTATAGAAA TTCTAATTGG TTTGGATTTG
ACCAGTGAAA ACCTCCATTC ATATAAGTCA GCGCTTGCCG ATGAAGGAAT TGACTATATC
ACTAAAGAAA AATATGAAAT AAAGGATTTT TTTGAAGAAG GAGATTTATC GACTGAACAA
ATAGATCTGA CAGTAAATGA AATATTAACA TTACAAGAAA GTGCTATTGT TGGGGTTCAT
TGTGGCGCAG GTAATGGAAG AAGTGGAATT ATTGCATCAG CATTAGCCAT TAATAAGCTG
TATGCAACAA ATGAAGTAAG TAATTTTGAT GCAACTCATC CATTGGGAGG AACAATGTTT
GGGAACACAG ATACATATCA GGTTGATATT GTAACAGCCA GTGCGGTTGG GATTATCAGA
AAAACAAATC CTCAAGCAGT AGAGCGTAAT GAGGATGTTT CTGCCTTATA TGACTACTCC
TATTTATTAT ATACAAGGCA ACACTCAACA TCATTGTAA
 
Protein sequence
MPNTSPTDLC ANTPTLTIHD NRGLAIRTLA YNRRSHNETV DELISRNRYN ASGQLIASRD 
PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTRDAHHTRR WVEYETGEHS
LGRPLSYREQ AKGGLKTVTD RFFYGENSEQ DKGCNLNGQC VRHYDSAGLQ ALISQSIIDV
PLQQQRRLLT ETKGPVDWFG EKENWGARLS ESPFVSHCTT NALGQLITQT DAKGHIQRMA
YNRAGQLIGS WLTVKNGVEQ PIVYSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIGIK
TTRPAKKDRP TRLQDLRYDY DPVGNILAIH NDAEATRFYR NQKIVPETTY RYDALYQLIE
ATGREADTNG IQNSQLPTLA SLNDSNQFVN YTRHYHYDRA GNLLKIQHTG ASQYSTYITV
SDSSNHGIQQ QDGITAHDVR SQFDTAGNQR QLQPGQLLHW NSRNQLQQVE PVPRNDGISD
SENYLYDGSG SRVVKISLHK THNAIQTRSV IYLAGLELRS QYNGNNLTED FQVMTVGAAG
RAQVRVLHWE RGQPVDIVND QLRYSFDNHL GSALIELDSD GDIISQEEYY PFGGTAVLAS
RNTVEAKYKT VRYSGKERDT TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP
IRLRDNNGLL TEEQIDMYVN LLSNIGLKND DELKSELLRY GLSEEEQDQI YLNMPMFTQS
GSSSSSATSF SENSSSSGST QSADSGYLSP VRNYHFFEDI NLVTMHRPYP TEKVSSEEII
SAAAELKEAS PIEILIGLDL TSENLHSYKS ALADEGIDYI TKEKYEIKDF FEEGDLSTEQ
IDLTVNEILT LQESAIVGVH CGAGNGRSGI IASALAINKL YATNEVSNFD ATHPLGGTMF
GNTDTYQVDI VTASAVGIIR KTNPQAVERN EDVSALYDYS YLLYTRQHST SL