Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_2303 |
Symbol | |
ID | 6087337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 2547984 |
End bp | 2550899 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641597365 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_001721033 |
Protein GI | 170024528 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.841337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATA AGAACCTCCG TTTGCGGGTT TCCTTAAGTG CCATAGATAA AATCACCCGG CCATTTAAAT CTATGTTGGC CAGCAATAAA ACGCTGGCTG CATCCATCAA AACGACGAAA GACCAGCTTA AGCAACTCAA TGGCCAGGCG GCCAAAATTG AGGGTTTTCG TCAGAATAAA GCCGCTGTTG ATCGTGCCGC ACAGGCGTTG ACTGCCGCCC GCAATAAAGC GCGTCAACTC GCCACTGAAT TAAAAAACAG CGCAGCGCCT ACAGCTAAGC AGGCGAGAGA GTTTAAGCGT GCCAGTGAAG AGGCCGCAAA ACTCAAGCAA AAGTACAATG ACTTACGTAC CGCACTCCAC ACCCAGCGTG CCGCCTTACA AAGCAGCGGC GTTGCCACTA ATCGACTGGG ACAAGCACAG CGAACCCTTA AAGCCAGCAT CACCAGCACC ACCGCCGCGC TGGCCGCACA ACAACGCCGG TTAGCGCAAC AAGCCCAACA ACAGCAACGC CTGAATGCCG CCCGCAATCG CTTTGATGCC AGCAATCAGC GCAAAGCGGT AGCTGCCGGA TTAGGGTATA CCTCGCTTGC TACCGGCCGC GCCATGGGTC GGGGGATAAC CAAAACGTTG GGTGTGGGTT ATGAATTTGA CGCGATGATG AGTGGAACTC AGGCAGTAAC GCGCATTGCA GATAAGAATT CTCCCGACAT GCAAGCCATG CGCCAGCAGG CACGCACCCT TCCGCTCTCA TCCAAATTTA CCGATTTGGA AGTTGCGCAG GGTCAGTATT ATCTTGGCCG CACTGGCTAT AGCCCTAAAC AGGTTGTCGG GGCGATGCCC GGCCTGGTGA ATCTGGCAGC GGCGGGTGAT ATCGATCTCG GTATCACCGC CGATATTGCT TCCAATATTC AAACCGCGAT GCGTATTCCG GCGGAGAAAA TGGATCACGT TGCCGATGTG CTAACAGCAC TCTTCATCCG GAATAATGTC GATATCCCGA TGCTGGGTGA ATCGCTTAAG TACTCTGCAG GTGTGGGTCG CGAATACGGC CAGAGTTTTG AAACCGTCGC CGCTTCTACC GCCATGCTGG GGAGTGCGGG TATTCAAGGC AGTCAGGCCG GTACGGCTAT GCGCAGTATC TTGAGCCGCA TCGGTGGCTC TAAAACCGTG AAGGATTTGG GCGTCAAAAC TGCCGATAAA GACGGCAATA TGCGTGACCT GGTTGATATC CTCAAAGATA TCAATGAAAA AACCGGAAAG ATGGGTAACG TTAAGCGCGG TGCAATCTAT AAAAGCATTG CTGGCCAATA TGCCGTTACC GGCTTTGGCG TACTGATGAA CGCAGCCAGT AATGGTTCAC TGGAAAAAAT GCGCGGTAAG CCCGGCGAAT ATGATGGTGA AGCGGCTCGT GTTGCGGCTA TAAAGCTGGA CAATATGAAG GGCGATATGA CCATTCTCCA TGCCGCCATG GAGAATATCA GTGTTGAGTT ATTTGAGAAG AATAACGACT GGTTACGTTC GGCGGCAAAA GGCATCAGTG AATTTATGCA CGGCGTGGCT GAATTCCTTA AGGCCCATCC CGGCGTGAGT ACTGCGATTG TAAAAGTGGG TACCGTTGTC GCCATTGCAA CCGCCGCATT CGGGGCGCTG GCGATTGCTG CCGTGGGTAT TTTAGGCCCC TTCGCCCTGC TCCGTTTTAC TACCTCAGTG CTGGGGATCC GCTTATTGCC GCGCTTGTCG TTGAGTCTGT TTCGACTGGC AAGTATCACC CCCATTACAG GTGCGCAAAT TGGCAACTTT AGCCGCTCCC TGCTCGTGAT GTCTCAACAG GGCGGCCGCT CAGCCATCGC CAGTTTAAAA GGGTTGGGTC AAGGTCTGGT GAACGTGGCC CGCTCGCCAG TGAAATCGGC CGTCAGTGGC TTTACGTTAC TCGGTAATGG TATTAGCTGG CTGGCTAAAT CCCCGCTTAG GTTCCTGCGT TTCGCGCTCG GTGGCCTGGG TAGCATGTTG GGTATCCTGA TCAGCCCGAT TGGGTTAATT GCCGCGGCAA TCGTGGGTGC TGGCTTATTG ATTTACAAGT ACTGGCAACC GATTAAAGCG TTCCTTGGGG GTGTGGTAGA GGGCTTTATG CAGGCCGCCG CACCGATTAA AGAGGCGCTT AAACCGCTGG GGCCGGTGTT TGACTGGATT GGTGATGCAG TCAAAAACGT GTGGAACTGG TTTAAAAAGT TACTGGAACC GGTGCAATCA ACCACGGCCG ATTTAAACCG CGCCGCTAAT GCCGGTAAGG CCTTTGGTCA GTTTTTGGCT GACGGCATTG GACTGGCCAT GATACCGATA AATGCGTTGA TCTCATCCAT TAAATGGGTA CTTGAAAAAC TGGATGAAGT AAAGCAACGC TCTGACAAAA CCCAGGCACT GGCGCAGGCA AGCCCAGCCA CTGCCGCGGG CCCAGGTAAC TACGGCGTGG CGTGGAAGCC AGCGCAAACG AAAACCTCCT ATATCGAAAG TAAATATACC GGGGCATATG ATAACGGCGG CACCATCCCG CTGGGGAAAT TTGGTGTGGT AGGTGAATAT GGCCCGGAAA TCATCAACGG CCCGGCACAA GTCACCAGCC GCCGCAACAC CGCCGCTATG GCGGTTGCGG CTTCCATGCT ATTCAGTGGC TACCCGGCCA GCGCCGCGCC GCTCCATCCT TACAGTTTAC CGGCGGCACA GTACCGCAGT AGCAACGGTC AGACAAATAA TCATCAGCAA AATCAAACCA GCCATGCTGC GCCAGTTATC AATATTTACC CGACGCCGCA GCAGGATGCG CAGGATATTG CCCGCGAGGT GGCCCGCCAA CTGGCCGCCC ACAACAGCAG GGAACAGAGC AAATCAAACC GCAGTTATCA AGACCATGAC GACTAA
|
Protein sequence | MSDKNLRLRV SLSAIDKITR PFKSMLASNK TLAASIKTTK DQLKQLNGQA AKIEGFRQNK AAVDRAAQAL TAARNKARQL ATELKNSAAP TAKQAREFKR ASEEAAKLKQ KYNDLRTALH TQRAALQSSG VATNRLGQAQ RTLKASITST TAALAAQQRR LAQQAQQQQR LNAARNRFDA SNQRKAVAAG LGYTSLATGR AMGRGITKTL GVGYEFDAMM SGTQAVTRIA DKNSPDMQAM RQQARTLPLS SKFTDLEVAQ GQYYLGRTGY SPKQVVGAMP GLVNLAAAGD IDLGITADIA SNIQTAMRIP AEKMDHVADV LTALFIRNNV DIPMLGESLK YSAGVGREYG QSFETVAAST AMLGSAGIQG SQAGTAMRSI LSRIGGSKTV KDLGVKTADK DGNMRDLVDI LKDINEKTGK MGNVKRGAIY KSIAGQYAVT GFGVLMNAAS NGSLEKMRGK PGEYDGEAAR VAAIKLDNMK GDMTILHAAM ENISVELFEK NNDWLRSAAK GISEFMHGVA EFLKAHPGVS TAIVKVGTVV AIATAAFGAL AIAAVGILGP FALLRFTTSV LGIRLLPRLS LSLFRLASIT PITGAQIGNF SRSLLVMSQQ GGRSAIASLK GLGQGLVNVA RSPVKSAVSG FTLLGNGISW LAKSPLRFLR FALGGLGSML GILISPIGLI AAAIVGAGLL IYKYWQPIKA FLGGVVEGFM QAAAPIKEAL KPLGPVFDWI GDAVKNVWNW FKKLLEPVQS TTADLNRAAN AGKAFGQFLA DGIGLAMIPI NALISSIKWV LEKLDEVKQR SDKTQALAQA SPATAAGPGN YGVAWKPAQT KTSYIESKYT GAYDNGGTIP LGKFGVVGEY GPEIINGPAQ VTSRRNTAAM AVAASMLFSG YPASAAPLHP YSLPAAQYRS SNGQTNNHQQ NQTSHAAPVI NIYPTPQQDA QDIAREVARQ LAAHNSREQS KSNRSYQDHD D
|
| |