Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_3196 |
Symbol | |
ID | 8134163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | + |
Start bp | 3603740 |
End bp | 3606619 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644866491 |
Product | phage tail tape measure protein, TP901 family |
Protein accession | YP_003018755 |
Protein GI | 253689565 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.112717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAACA CTCTACAGTT AAGCGTTTTG CTGAAAGCCG TGGACAGGGC GACCCGCCCG TTTAAGGCGG TGCAAACCGA AAGTAAAAAG CTGTCGGGCG ATATCCGCGA TTCACAGACC CAGCTCAAAG ACCTGAACAC GCAGGCCGGG CGTATCGACG GTTTCCGCAA GGCAAGAAGC CAGCTCAGCG AAACAGGCGC AGCACTCCAG CAGGCGCAGG CGAAAGCCGC CGAACTGTCG GCAGCACTGC GCAATAGCGA AAATCCCACC AAACGGCAGG CGCAGGCGCT GGAGCGGGCG AAACGTCAGG CCGCTACGCT AAAAACGGAG TATGGCCAAC TGCGCCAGTC GGTACAGCGC CAGCGTACCG AGTTAGAGCA GGCAGGCATC AGCACGCGCA ACCTGTCAGG CGCGGAGCGT AAATTACGCA CGAATATTAC TCAGACAACG ACGCAGCTTG ACCAGCAACG CGCCGCGCTT TCCCGCGTCA GTCAGCAACA GGAAAAACTG AACGCCGTCA GACAGCGCTA TGAGAAAGGC AAGGAAATTA CCGCCGGGGT GCGTAATACC AGTGCGGCCG CGTTTGGCCT CGGTTCCGCT GCGCTGTATG CCGAAAGCCG CCTGATTGCA CCGTCAGTGC AGGCCGACGG ACACGGGGCG CGCATCGCCG CACAGACAGG CGGAAATGCT GCCGACGGCG AACAGTACAC CCGCGTTATC AAAGATATTA ACGCTTCGGG TGTGAGTAAC GATCTCACTC AGATAGCGGA CGCGGTGGCC GCTGTACGCA GCACATTGGG GGCGATGGGG GATGTAGGCG AAACCGAGCT GGCGCGCATA TCGCGTAAGG CGCTGGACAT ACAAACGGCG CTCGGCGGCG ATGCAACCGA GAGTATCCAG ATAGCCGCTA TCATGATGAA AAACGGCCTG GCGAAGAACA GCGACGAGGC GTTTGACTTG ATGGTGTCCG GGATGCAGCG CGTATCTGCA CAGATGCGCG GCGAACTGCC GGAAATCCTG CACGAATATT CGACCCACTT CCGCAACATG GGATTCAGCG GATCGGAAGC CATGACACTA TTAGTGGACA TGGCGCAGCA AGGCAAGTTT GCGCTGGACA AGACAGGCGA CGCGGTGAAG GAATTCTCAA TCCGTGGGTC GGACATGTCC AAAGCCAGCA TTGAAGCCTA TGACGCTGCC GGACTCAATG CCGCCAAAAT GTCTACCGCC ATTGCCAGCG GCGGCGATAA GGCGCGGGCG GCAATGCAGA AAACCGCCAA CGGGCTGCTG AAAATCAAAG ACCCGGCAGA ACGGGCAAAC GCGGCCATTG CCCTGTTTGG TACGCCGATT GAAGACCTGT CGATTGACCA GATACCGAAA TTTCTGGCCG CGCTGGCCGG AGCCGAAAAC AAGCTGGGTG ATGTGTCCGG GGCGGCTGAC CGCATGGGTG ATACCCTGCG CGATAACCTC GAAGGGGATA TCGGGCGGCT ACAGGGCGCG ATGTCCAGCC TGCGCTTTAA CCTGTTCAAT GACGATGACG GCGCGCTGCG CAAACTGACG CAGGCCGCGA CGGAATGGTT AACCCACGTC AATGAATGGG TCAAGGCTAA CCCGGAGCTG ACGCGGCAGA TAGTCATGGT AGGCGGAGCC GCCACGGCGT TAATTACGGC GCTGGGCGGG ATCGGGCTGG TTGCGTGGCC TGTTATGAGC GGGATTAATA CATTAGTCGG CGGAGCGGGT TTACTGAGTG CCGGATTCGG TAAGGTCGGT GGAAAGGGAT TACCCAAGTT ATCTATGGGA TTAAACCGCC TCGGCGGCAT GATCGGCTGG CTGGCAAAGT CGCCGCTGAT GCTGCTACGT GCCGGAGCCT CGGCGCTGAC CTCGGTATTC GGGGCGGTCA GTAACCCGCT GACCATTATC CGGGGCGCGA TGTCAGGGTT TGGCCGGGTG CTGATGTGGC TGTTTACCTC ACCGCTGGCA CTGCTGCGCA CAGGCATTAC GCTGGTTGGC AGTGCGTTAG GCGTCCTGCT GTCTCCCGTC GGGCTGGCCG TCGCGGCGAT TGTTGGCGGC GCGCTGCTTA TCTGGAAATA CTGGGAGCCC ATCAAAGCCT TTATCGGCGG CGTGGTGGAG GGGTTTGTTG CAGCCAGTGC GCCGATTATT GCGGCGTTTG AGCCGCTTCA GCCTGTCTTT ACGTGGATAG GCGACAAAAT AAAGGCGCTG TTTGGCTGGT TTGGTGACCT GCTGACGCCC GTCAAATCCA CTGCCGCCGA GCTGGACAGT GCGGCCAGTA TGGGGAAACG CTTTGGTGAG GCGCTGGCTA GCGGGCTGAA TATCATCATG AACCCGCTGG AGTCGCTGAA AAAAGGCGTG TCGTGGCTGC TGGAAAAGCT GGGGGTGGTT GACGATAAAT CGAAAAAGCT GCCGACGGCT GAAAATATTA TACCGCCGAA GGACGCTGCC GCGCTTAAGG CGGGGGTCAG CACGGCCCCG GTTCCGCAAG GCAACGACGC AAAGGCAATT GCAGCCCGAT ACAGTGGTGC GTATGACAAC GGCGGACGCA TCCCGCTGGG GAAATTTGCT GTTGTTGGTG AACATGGCCC GGAAATCGTT GAGGGTCCGG TCAATGTCAC CAGCCGCCAG AACACCGCCG CTATGGCGTC CGCTGCCATG AACATGTCAG CCTATCGCCC GATAGCGCCA ACGGTGCAGG CCAGCGCGGC ATCATCCCCG GTCAGCATTC ACGCGCCCAT CAGTATTGTT GCCCAGCCCG GCCAGAGTGC GCAGGACATC GCGCAGGAAG TCACACGCCA GCTTGAGCAG CGGGAACGGG CGGCACGGTC ACGCGCATTC AGTCAGTACA GTTATCAGGG AGGCGAATAA
|
Protein sequence | MSNTLQLSVL LKAVDRATRP FKAVQTESKK LSGDIRDSQT QLKDLNTQAG RIDGFRKARS QLSETGAALQ QAQAKAAELS AALRNSENPT KRQAQALERA KRQAATLKTE YGQLRQSVQR QRTELEQAGI STRNLSGAER KLRTNITQTT TQLDQQRAAL SRVSQQQEKL NAVRQRYEKG KEITAGVRNT SAAAFGLGSA ALYAESRLIA PSVQADGHGA RIAAQTGGNA ADGEQYTRVI KDINASGVSN DLTQIADAVA AVRSTLGAMG DVGETELARI SRKALDIQTA LGGDATESIQ IAAIMMKNGL AKNSDEAFDL MVSGMQRVSA QMRGELPEIL HEYSTHFRNM GFSGSEAMTL LVDMAQQGKF ALDKTGDAVK EFSIRGSDMS KASIEAYDAA GLNAAKMSTA IASGGDKARA AMQKTANGLL KIKDPAERAN AAIALFGTPI EDLSIDQIPK FLAALAGAEN KLGDVSGAAD RMGDTLRDNL EGDIGRLQGA MSSLRFNLFN DDDGALRKLT QAATEWLTHV NEWVKANPEL TRQIVMVGGA ATALITALGG IGLVAWPVMS GINTLVGGAG LLSAGFGKVG GKGLPKLSMG LNRLGGMIGW LAKSPLMLLR AGASALTSVF GAVSNPLTII RGAMSGFGRV LMWLFTSPLA LLRTGITLVG SALGVLLSPV GLAVAAIVGG ALLIWKYWEP IKAFIGGVVE GFVAASAPII AAFEPLQPVF TWIGDKIKAL FGWFGDLLTP VKSTAAELDS AASMGKRFGE ALASGLNIIM NPLESLKKGV SWLLEKLGVV DDKSKKLPTA ENIIPPKDAA ALKAGVSTAP VPQGNDAKAI AARYSGAYDN GGRIPLGKFA VVGEHGPEIV EGPVNVTSRQ NTAAMASAAM NMSAYRPIAP TVQASAASSP VSIHAPISIV AQPGQSAQDI AQEVTRQLEQ RERAARSRAF SQYSYQGGE
|
| |