Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_2350 |
Symbol | |
ID | 6088891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 2582080 |
End bp | 2584995 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641597412 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_001721080 |
Protein GI | 170024575 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.5162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATA AGAACCTCCG TTTGCGGGTT TCCTTAAGTG CCATAGATAA AATCACCCGG CCATTTAAAT CTATGTTGGC CAGCAATAAA ACGCTGGCTG CATCCATCAA AACGACGAAA GACCAGCTTA AGCAACTCAA TAGCCAGGCG GCCAAAATTG AGGGTTTTCG TCAGAATAAA GCCGCTGTTG ATCGTGCCGC ACAGGCGTTG ACTGCCGCCC GCGATAAAGC GCGTCAACTC GCCACTGAAT TAAAAAACAG CGCAGCGCCT ACAGCTAAGC AGGCGAGAGA GTTTAAGCGT GCCAGTGAAG AGGCCGCAAA ACTCAAGCAA AAGTACAATG ACTTACGTAC CGCACTCCAC ACCCAGCGTG CCGCCTTACA AAGCAGCGGC GTTGCCACTA ATCGACTGGG ACAAGCACAG CGAACCCTTA AAGCCAGCAT CACCAGCACC ACCGCCGCGC TGGCCGCACA ACAACGCCGG TTAGCGCAAC AAGCCCAACA ACAGCAACGC CTGAATGCCG CCCGCAATCG CTTTGATGCC AGCAATCAGC GCAAAGCGGT AGCCGCCGGA TTAGGGTATA CCTCGCTTGC TACCGGCCGC GCCATGGGTC GGGGGATAAC CAAAACGTTG GGTGTGGGTT ATGAATTTGA CGCGATGATG AGCAAAACCC AGGCCGTTAC CCGCATTCCG GATAAAAACG CGGAGGATAT GCAGGCGATG CGTCACCAGG CCCGTACCCT GCCACTCTCA TCCAAGTTTA CCGATCTGGA AGTGGCTGAA GGCCAATACT TTCTTGGCCG CACTGGCTAT AGCCCGAAAC AGGTTATGGG GGCAATGCCC GGTATGCTTA ACCTCGCCGC TGCCGGAGGG ATTGATCTTG CTACTACTGC CGATATTGCT TCCAATATTC AAACCGCCGC GGGCATCCCT GCAGAAAAGA TGGACCATGT TGCGGATGTC CTTACGGCGT TATTCACTCG AAATAACGTC GATATTCCTA TGTTGGGCGA GTCTCTTAAA TATTCGGCGG GGATAGGTCG TCAATATGGT CAATCGTTAG AAACTACAGC CGCCGCTACC GCAATTATGG GGAGTGCCGG TATTCAGGGC AGCCAAGCAG GTACAACGCT AAAATCGGTT CTTTTAAGAA TCGGCACATC AAAAGCTGTT TCTGATTTAG GCGTAAAAAC AACCGATAAA AACGGTAACA TGCGCGATTT GGTTGATATC CTAAAAGATA TTGATAAAAA AACGTCCCAA ATGGGCAACA TCGAAAGTGG CGCTATTTTT GAAAAGATAG CAGGAAAATA TGCCGTTACC GGATTTGGGG AGTTAATGCG AGCAACTTCC AGCGGCAAAT TGGAACAGAT GCGCGGTAAG CCTGGTGAAT ATGATGGTGA AGCGGCGCGT GTAGCTTCAA CCATGCTGGA CAATATGAAG GGCGATATGA CCATTCTCCA TGCCGCCATG GAGAATATCA GTGTTGAGTT ATTTGAGAAG AATAACGACT GGTTACGTTC GGCGGCAAAA GGCATCAGTG AATTTATGCA CGGCGTGGCT GAATTCCTTA AGGCCCATCC CGGCGTGAGT ACTGCGATTG TAAAAGTGGG TACCGTTGTC GCCATTGCAA CCGCCGCATT CGGGGCGCTG GCGATTGCTG CCGTGGGTAT TTTAGGCCCC TTCGCCCTGC TCCGTTTCAC TACCTCAGTG CTGGGGATCC GCTTATTGCC GCGCTTGTCG TTGAGTCTGT TTCGACTGGC AAGTATCACC CCCATTACAG GTGCGCAAAT TGGCAACTTT AGTCGCTCAC TGCTCGTGAT GTCTCAACAG GGCGGCCGCT CAGCCATCGC CAGTTTAAAA GGGTTGGGTC AAGGTCTGGT GAACGTGGCC CGCTCGCCAG TGAAATCGGC CGTCAGTGGC TTTACGTTAC TCGGTAATGG TATTAGCTGG CTGGCTAAAT CCCCGCTTAG GTTCCTGCGT TTCGCGCTCG GTGGCCTGGG TAGCATGTTG GGTATCCTGA TCAGCCCGAT TGGGTTAATT GCCGCAGCTA TCGTGGGTGC TGGCTTATTG ATTTACAAGT ACTGGCAACC GATTAAAGCG TTCCTTGGGG GTGTGGTAGA GGGCTTTATG CAGGCCGCCG CACCGATTAA AGAGGCGCTT AAACCGCTGG GGCCGGTGTT TGACTGGATT GGTGATGCAG TCAAAAACGT GTGGAACTGG TTTAAAAAGT TACTGGAACC GGTGCAATCG ACCACGGCCG ATTTAAACCG CGCCGCTAAT GCCGGTAAGG CCTTTGGTCA GTTTTTGGCT GACGGCATTG GACTGGCCAT GATACCGATA AATGCGTTGA TCTCATCCAT TAAATGGGTA CTTGAAAAAC TGGATGAAGT AAAGCAACGC TCTGACAAAA CCCAGGCACT GGCGCAGGCA AGCCCAGCCA CTGCCGCGGG CCCAGGTAAC TACGGCGTGG CGTGGAAGCC AGCGCAAACG AAAAGCACCT ATATCGAAAG TAAATATACC GGGGCATATG ATAACGGCGG TACCATCCCG CTGGGAAAAT TTGGTGTGGT AGGTGAATAT GGCCCGGAAA TCATCAACGG CCCGGCACAG GTCACCAGCC GCCGCAACAC CGCCGCTATG GCGGTTGCGG CTTCCATGCT ATTCAGTGGC TACCCGGCCA GCGCCGCGCC GCTCCATCCT TACAGTTTAC CGGCGGCACA GTACCGCAGT AGCAACGGTC AGACAAATAA TCATCAGCAA AATCAAACCA GCCATGCTGC GCCAGTTATC AATATTTACC CGACGCCGCA GCAGGATGCG CAGGATATTG CCCGCGAGGT GGCCCGCCAA CTGGCCGCCC ACAACCGCAG GGAACAGAGC AAATCAAACC GCAGTTATCA AGACCATGAC GACTAA
|
Protein sequence | MSDKNLRLRV SLSAIDKITR PFKSMLASNK TLAASIKTTK DQLKQLNSQA AKIEGFRQNK AAVDRAAQAL TAARDKARQL ATELKNSAAP TAKQAREFKR ASEEAAKLKQ KYNDLRTALH TQRAALQSSG VATNRLGQAQ RTLKASITST TAALAAQQRR LAQQAQQQQR LNAARNRFDA SNQRKAVAAG LGYTSLATGR AMGRGITKTL GVGYEFDAMM SKTQAVTRIP DKNAEDMQAM RHQARTLPLS SKFTDLEVAE GQYFLGRTGY SPKQVMGAMP GMLNLAAAGG IDLATTADIA SNIQTAAGIP AEKMDHVADV LTALFTRNNV DIPMLGESLK YSAGIGRQYG QSLETTAAAT AIMGSAGIQG SQAGTTLKSV LLRIGTSKAV SDLGVKTTDK NGNMRDLVDI LKDIDKKTSQ MGNIESGAIF EKIAGKYAVT GFGELMRATS SGKLEQMRGK PGEYDGEAAR VASTMLDNMK GDMTILHAAM ENISVELFEK NNDWLRSAAK GISEFMHGVA EFLKAHPGVS TAIVKVGTVV AIATAAFGAL AIAAVGILGP FALLRFTTSV LGIRLLPRLS LSLFRLASIT PITGAQIGNF SRSLLVMSQQ GGRSAIASLK GLGQGLVNVA RSPVKSAVSG FTLLGNGISW LAKSPLRFLR FALGGLGSML GILISPIGLI AAAIVGAGLL IYKYWQPIKA FLGGVVEGFM QAAAPIKEAL KPLGPVFDWI GDAVKNVWNW FKKLLEPVQS TTADLNRAAN AGKAFGQFLA DGIGLAMIPI NALISSIKWV LEKLDEVKQR SDKTQALAQA SPATAAGPGN YGVAWKPAQT KSTYIESKYT GAYDNGGTIP LGKFGVVGEY GPEIINGPAQ VTSRRNTAAM AVAASMLFSG YPASAAPLHP YSLPAAQYRS SNGQTNNHQQ NQTSHAAPVI NIYPTPQQDA QDIAREVARQ LAAHNRREQS KSNRSYQDHD D
|
| |