Gene YPK_2350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2350 
Symbol 
ID6088891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2582080 
End bp2584995 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content53% 
IMG OID641597412 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_001721080 
Protein GI170024575 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.5162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA AGAACCTCCG TTTGCGGGTT TCCTTAAGTG CCATAGATAA AATCACCCGG 
CCATTTAAAT CTATGTTGGC CAGCAATAAA ACGCTGGCTG CATCCATCAA AACGACGAAA
GACCAGCTTA AGCAACTCAA TAGCCAGGCG GCCAAAATTG AGGGTTTTCG TCAGAATAAA
GCCGCTGTTG ATCGTGCCGC ACAGGCGTTG ACTGCCGCCC GCGATAAAGC GCGTCAACTC
GCCACTGAAT TAAAAAACAG CGCAGCGCCT ACAGCTAAGC AGGCGAGAGA GTTTAAGCGT
GCCAGTGAAG AGGCCGCAAA ACTCAAGCAA AAGTACAATG ACTTACGTAC CGCACTCCAC
ACCCAGCGTG CCGCCTTACA AAGCAGCGGC GTTGCCACTA ATCGACTGGG ACAAGCACAG
CGAACCCTTA AAGCCAGCAT CACCAGCACC ACCGCCGCGC TGGCCGCACA ACAACGCCGG
TTAGCGCAAC AAGCCCAACA ACAGCAACGC CTGAATGCCG CCCGCAATCG CTTTGATGCC
AGCAATCAGC GCAAAGCGGT AGCCGCCGGA TTAGGGTATA CCTCGCTTGC TACCGGCCGC
GCCATGGGTC GGGGGATAAC CAAAACGTTG GGTGTGGGTT ATGAATTTGA CGCGATGATG
AGCAAAACCC AGGCCGTTAC CCGCATTCCG GATAAAAACG CGGAGGATAT GCAGGCGATG
CGTCACCAGG CCCGTACCCT GCCACTCTCA TCCAAGTTTA CCGATCTGGA AGTGGCTGAA
GGCCAATACT TTCTTGGCCG CACTGGCTAT AGCCCGAAAC AGGTTATGGG GGCAATGCCC
GGTATGCTTA ACCTCGCCGC TGCCGGAGGG ATTGATCTTG CTACTACTGC CGATATTGCT
TCCAATATTC AAACCGCCGC GGGCATCCCT GCAGAAAAGA TGGACCATGT TGCGGATGTC
CTTACGGCGT TATTCACTCG AAATAACGTC GATATTCCTA TGTTGGGCGA GTCTCTTAAA
TATTCGGCGG GGATAGGTCG TCAATATGGT CAATCGTTAG AAACTACAGC CGCCGCTACC
GCAATTATGG GGAGTGCCGG TATTCAGGGC AGCCAAGCAG GTACAACGCT AAAATCGGTT
CTTTTAAGAA TCGGCACATC AAAAGCTGTT TCTGATTTAG GCGTAAAAAC AACCGATAAA
AACGGTAACA TGCGCGATTT GGTTGATATC CTAAAAGATA TTGATAAAAA AACGTCCCAA
ATGGGCAACA TCGAAAGTGG CGCTATTTTT GAAAAGATAG CAGGAAAATA TGCCGTTACC
GGATTTGGGG AGTTAATGCG AGCAACTTCC AGCGGCAAAT TGGAACAGAT GCGCGGTAAG
CCTGGTGAAT ATGATGGTGA AGCGGCGCGT GTAGCTTCAA CCATGCTGGA CAATATGAAG
GGCGATATGA CCATTCTCCA TGCCGCCATG GAGAATATCA GTGTTGAGTT ATTTGAGAAG
AATAACGACT GGTTACGTTC GGCGGCAAAA GGCATCAGTG AATTTATGCA CGGCGTGGCT
GAATTCCTTA AGGCCCATCC CGGCGTGAGT ACTGCGATTG TAAAAGTGGG TACCGTTGTC
GCCATTGCAA CCGCCGCATT CGGGGCGCTG GCGATTGCTG CCGTGGGTAT TTTAGGCCCC
TTCGCCCTGC TCCGTTTCAC TACCTCAGTG CTGGGGATCC GCTTATTGCC GCGCTTGTCG
TTGAGTCTGT TTCGACTGGC AAGTATCACC CCCATTACAG GTGCGCAAAT TGGCAACTTT
AGTCGCTCAC TGCTCGTGAT GTCTCAACAG GGCGGCCGCT CAGCCATCGC CAGTTTAAAA
GGGTTGGGTC AAGGTCTGGT GAACGTGGCC CGCTCGCCAG TGAAATCGGC CGTCAGTGGC
TTTACGTTAC TCGGTAATGG TATTAGCTGG CTGGCTAAAT CCCCGCTTAG GTTCCTGCGT
TTCGCGCTCG GTGGCCTGGG TAGCATGTTG GGTATCCTGA TCAGCCCGAT TGGGTTAATT
GCCGCAGCTA TCGTGGGTGC TGGCTTATTG ATTTACAAGT ACTGGCAACC GATTAAAGCG
TTCCTTGGGG GTGTGGTAGA GGGCTTTATG CAGGCCGCCG CACCGATTAA AGAGGCGCTT
AAACCGCTGG GGCCGGTGTT TGACTGGATT GGTGATGCAG TCAAAAACGT GTGGAACTGG
TTTAAAAAGT TACTGGAACC GGTGCAATCG ACCACGGCCG ATTTAAACCG CGCCGCTAAT
GCCGGTAAGG CCTTTGGTCA GTTTTTGGCT GACGGCATTG GACTGGCCAT GATACCGATA
AATGCGTTGA TCTCATCCAT TAAATGGGTA CTTGAAAAAC TGGATGAAGT AAAGCAACGC
TCTGACAAAA CCCAGGCACT GGCGCAGGCA AGCCCAGCCA CTGCCGCGGG CCCAGGTAAC
TACGGCGTGG CGTGGAAGCC AGCGCAAACG AAAAGCACCT ATATCGAAAG TAAATATACC
GGGGCATATG ATAACGGCGG TACCATCCCG CTGGGAAAAT TTGGTGTGGT AGGTGAATAT
GGCCCGGAAA TCATCAACGG CCCGGCACAG GTCACCAGCC GCCGCAACAC CGCCGCTATG
GCGGTTGCGG CTTCCATGCT ATTCAGTGGC TACCCGGCCA GCGCCGCGCC GCTCCATCCT
TACAGTTTAC CGGCGGCACA GTACCGCAGT AGCAACGGTC AGACAAATAA TCATCAGCAA
AATCAAACCA GCCATGCTGC GCCAGTTATC AATATTTACC CGACGCCGCA GCAGGATGCG
CAGGATATTG CCCGCGAGGT GGCCCGCCAA CTGGCCGCCC ACAACCGCAG GGAACAGAGC
AAATCAAACC GCAGTTATCA AGACCATGAC GACTAA
 
Protein sequence
MSDKNLRLRV SLSAIDKITR PFKSMLASNK TLAASIKTTK DQLKQLNSQA AKIEGFRQNK 
AAVDRAAQAL TAARDKARQL ATELKNSAAP TAKQAREFKR ASEEAAKLKQ KYNDLRTALH
TQRAALQSSG VATNRLGQAQ RTLKASITST TAALAAQQRR LAQQAQQQQR LNAARNRFDA
SNQRKAVAAG LGYTSLATGR AMGRGITKTL GVGYEFDAMM SKTQAVTRIP DKNAEDMQAM
RHQARTLPLS SKFTDLEVAE GQYFLGRTGY SPKQVMGAMP GMLNLAAAGG IDLATTADIA
SNIQTAAGIP AEKMDHVADV LTALFTRNNV DIPMLGESLK YSAGIGRQYG QSLETTAAAT
AIMGSAGIQG SQAGTTLKSV LLRIGTSKAV SDLGVKTTDK NGNMRDLVDI LKDIDKKTSQ
MGNIESGAIF EKIAGKYAVT GFGELMRATS SGKLEQMRGK PGEYDGEAAR VASTMLDNMK
GDMTILHAAM ENISVELFEK NNDWLRSAAK GISEFMHGVA EFLKAHPGVS TAIVKVGTVV
AIATAAFGAL AIAAVGILGP FALLRFTTSV LGIRLLPRLS LSLFRLASIT PITGAQIGNF
SRSLLVMSQQ GGRSAIASLK GLGQGLVNVA RSPVKSAVSG FTLLGNGISW LAKSPLRFLR
FALGGLGSML GILISPIGLI AAAIVGAGLL IYKYWQPIKA FLGGVVEGFM QAAAPIKEAL
KPLGPVFDWI GDAVKNVWNW FKKLLEPVQS TTADLNRAAN AGKAFGQFLA DGIGLAMIPI
NALISSIKWV LEKLDEVKQR SDKTQALAQA SPATAAGPGN YGVAWKPAQT KSTYIESKYT
GAYDNGGTIP LGKFGVVGEY GPEIINGPAQ VTSRRNTAAM AVAASMLFSG YPASAAPLHP
YSLPAAQYRS SNGQTNNHQQ NQTSHAAPVI NIYPTPQQDA QDIAREVARQ LAAHNRREQS
KSNRSYQDHD D