Gene YPK_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2303 
Symbol 
ID6087337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2547984 
End bp2550899 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content54% 
IMG OID641597365 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_001721033 
Protein GI170024528 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.841337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA AGAACCTCCG TTTGCGGGTT TCCTTAAGTG CCATAGATAA AATCACCCGG 
CCATTTAAAT CTATGTTGGC CAGCAATAAA ACGCTGGCTG CATCCATCAA AACGACGAAA
GACCAGCTTA AGCAACTCAA TGGCCAGGCG GCCAAAATTG AGGGTTTTCG TCAGAATAAA
GCCGCTGTTG ATCGTGCCGC ACAGGCGTTG ACTGCCGCCC GCAATAAAGC GCGTCAACTC
GCCACTGAAT TAAAAAACAG CGCAGCGCCT ACAGCTAAGC AGGCGAGAGA GTTTAAGCGT
GCCAGTGAAG AGGCCGCAAA ACTCAAGCAA AAGTACAATG ACTTACGTAC CGCACTCCAC
ACCCAGCGTG CCGCCTTACA AAGCAGCGGC GTTGCCACTA ATCGACTGGG ACAAGCACAG
CGAACCCTTA AAGCCAGCAT CACCAGCACC ACCGCCGCGC TGGCCGCACA ACAACGCCGG
TTAGCGCAAC AAGCCCAACA ACAGCAACGC CTGAATGCCG CCCGCAATCG CTTTGATGCC
AGCAATCAGC GCAAAGCGGT AGCTGCCGGA TTAGGGTATA CCTCGCTTGC TACCGGCCGC
GCCATGGGTC GGGGGATAAC CAAAACGTTG GGTGTGGGTT ATGAATTTGA CGCGATGATG
AGTGGAACTC AGGCAGTAAC GCGCATTGCA GATAAGAATT CTCCCGACAT GCAAGCCATG
CGCCAGCAGG CACGCACCCT TCCGCTCTCA TCCAAATTTA CCGATTTGGA AGTTGCGCAG
GGTCAGTATT ATCTTGGCCG CACTGGCTAT AGCCCTAAAC AGGTTGTCGG GGCGATGCCC
GGCCTGGTGA ATCTGGCAGC GGCGGGTGAT ATCGATCTCG GTATCACCGC CGATATTGCT
TCCAATATTC AAACCGCGAT GCGTATTCCG GCGGAGAAAA TGGATCACGT TGCCGATGTG
CTAACAGCAC TCTTCATCCG GAATAATGTC GATATCCCGA TGCTGGGTGA ATCGCTTAAG
TACTCTGCAG GTGTGGGTCG CGAATACGGC CAGAGTTTTG AAACCGTCGC CGCTTCTACC
GCCATGCTGG GGAGTGCGGG TATTCAAGGC AGTCAGGCCG GTACGGCTAT GCGCAGTATC
TTGAGCCGCA TCGGTGGCTC TAAAACCGTG AAGGATTTGG GCGTCAAAAC TGCCGATAAA
GACGGCAATA TGCGTGACCT GGTTGATATC CTCAAAGATA TCAATGAAAA AACCGGAAAG
ATGGGTAACG TTAAGCGCGG TGCAATCTAT AAAAGCATTG CTGGCCAATA TGCCGTTACC
GGCTTTGGCG TACTGATGAA CGCAGCCAGT AATGGTTCAC TGGAAAAAAT GCGCGGTAAG
CCCGGCGAAT ATGATGGTGA AGCGGCTCGT GTTGCGGCTA TAAAGCTGGA CAATATGAAG
GGCGATATGA CCATTCTCCA TGCCGCCATG GAGAATATCA GTGTTGAGTT ATTTGAGAAG
AATAACGACT GGTTACGTTC GGCGGCAAAA GGCATCAGTG AATTTATGCA CGGCGTGGCT
GAATTCCTTA AGGCCCATCC CGGCGTGAGT ACTGCGATTG TAAAAGTGGG TACCGTTGTC
GCCATTGCAA CCGCCGCATT CGGGGCGCTG GCGATTGCTG CCGTGGGTAT TTTAGGCCCC
TTCGCCCTGC TCCGTTTTAC TACCTCAGTG CTGGGGATCC GCTTATTGCC GCGCTTGTCG
TTGAGTCTGT TTCGACTGGC AAGTATCACC CCCATTACAG GTGCGCAAAT TGGCAACTTT
AGCCGCTCCC TGCTCGTGAT GTCTCAACAG GGCGGCCGCT CAGCCATCGC CAGTTTAAAA
GGGTTGGGTC AAGGTCTGGT GAACGTGGCC CGCTCGCCAG TGAAATCGGC CGTCAGTGGC
TTTACGTTAC TCGGTAATGG TATTAGCTGG CTGGCTAAAT CCCCGCTTAG GTTCCTGCGT
TTCGCGCTCG GTGGCCTGGG TAGCATGTTG GGTATCCTGA TCAGCCCGAT TGGGTTAATT
GCCGCGGCAA TCGTGGGTGC TGGCTTATTG ATTTACAAGT ACTGGCAACC GATTAAAGCG
TTCCTTGGGG GTGTGGTAGA GGGCTTTATG CAGGCCGCCG CACCGATTAA AGAGGCGCTT
AAACCGCTGG GGCCGGTGTT TGACTGGATT GGTGATGCAG TCAAAAACGT GTGGAACTGG
TTTAAAAAGT TACTGGAACC GGTGCAATCA ACCACGGCCG ATTTAAACCG CGCCGCTAAT
GCCGGTAAGG CCTTTGGTCA GTTTTTGGCT GACGGCATTG GACTGGCCAT GATACCGATA
AATGCGTTGA TCTCATCCAT TAAATGGGTA CTTGAAAAAC TGGATGAAGT AAAGCAACGC
TCTGACAAAA CCCAGGCACT GGCGCAGGCA AGCCCAGCCA CTGCCGCGGG CCCAGGTAAC
TACGGCGTGG CGTGGAAGCC AGCGCAAACG AAAACCTCCT ATATCGAAAG TAAATATACC
GGGGCATATG ATAACGGCGG CACCATCCCG CTGGGGAAAT TTGGTGTGGT AGGTGAATAT
GGCCCGGAAA TCATCAACGG CCCGGCACAA GTCACCAGCC GCCGCAACAC CGCCGCTATG
GCGGTTGCGG CTTCCATGCT ATTCAGTGGC TACCCGGCCA GCGCCGCGCC GCTCCATCCT
TACAGTTTAC CGGCGGCACA GTACCGCAGT AGCAACGGTC AGACAAATAA TCATCAGCAA
AATCAAACCA GCCATGCTGC GCCAGTTATC AATATTTACC CGACGCCGCA GCAGGATGCG
CAGGATATTG CCCGCGAGGT GGCCCGCCAA CTGGCCGCCC ACAACAGCAG GGAACAGAGC
AAATCAAACC GCAGTTATCA AGACCATGAC GACTAA
 
Protein sequence
MSDKNLRLRV SLSAIDKITR PFKSMLASNK TLAASIKTTK DQLKQLNGQA AKIEGFRQNK 
AAVDRAAQAL TAARNKARQL ATELKNSAAP TAKQAREFKR ASEEAAKLKQ KYNDLRTALH
TQRAALQSSG VATNRLGQAQ RTLKASITST TAALAAQQRR LAQQAQQQQR LNAARNRFDA
SNQRKAVAAG LGYTSLATGR AMGRGITKTL GVGYEFDAMM SGTQAVTRIA DKNSPDMQAM
RQQARTLPLS SKFTDLEVAQ GQYYLGRTGY SPKQVVGAMP GLVNLAAAGD IDLGITADIA
SNIQTAMRIP AEKMDHVADV LTALFIRNNV DIPMLGESLK YSAGVGREYG QSFETVAAST
AMLGSAGIQG SQAGTAMRSI LSRIGGSKTV KDLGVKTADK DGNMRDLVDI LKDINEKTGK
MGNVKRGAIY KSIAGQYAVT GFGVLMNAAS NGSLEKMRGK PGEYDGEAAR VAAIKLDNMK
GDMTILHAAM ENISVELFEK NNDWLRSAAK GISEFMHGVA EFLKAHPGVS TAIVKVGTVV
AIATAAFGAL AIAAVGILGP FALLRFTTSV LGIRLLPRLS LSLFRLASIT PITGAQIGNF
SRSLLVMSQQ GGRSAIASLK GLGQGLVNVA RSPVKSAVSG FTLLGNGISW LAKSPLRFLR
FALGGLGSML GILISPIGLI AAAIVGAGLL IYKYWQPIKA FLGGVVEGFM QAAAPIKEAL
KPLGPVFDWI GDAVKNVWNW FKKLLEPVQS TTADLNRAAN AGKAFGQFLA DGIGLAMIPI
NALISSIKWV LEKLDEVKQR SDKTQALAQA SPATAAGPGN YGVAWKPAQT KTSYIESKYT
GAYDNGGTIP LGKFGVVGEY GPEIINGPAQ VTSRRNTAAM AVAASMLFSG YPASAAPLHP
YSLPAAQYRS SNGQTNNHQQ NQTSHAAPVI NIYPTPQQDA QDIAREVARQ LAAHNSREQS
KSNRSYQDHD D