Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0849 |
Symbol | |
ID | 8397633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 923266 |
End bp | 925482 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644995195 |
Product | phage tail tape measure protein, TP901 family |
Protein accession | YP_003152598 |
Protein GI | 257066342 |
COG category | [S] Function unknown |
COG ID | [COG5280] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000280434 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGAGACC AAAGAGAGCT TACCTGGAGG CTTGAAACTG ATACTGGTAA AGCGGTATCG GATATACAAG AAGTCGATAA ACAATTTGAT AAAGTAAAAG AAAGTATGGA GGATGTCGAT AAGAAATCCT CCATTTTTGA TAAGATAGGC GGACCTATGA AGTCTGCTGG CTCTGCTGTA TCTGCTTTTG GTGGAAAAGT AATGAATCTA GGCGGAGGAA TGATGAAGAC AGGAGCGAAG GTTACGGCCT TTACGGCTCC TGTTTCTCTT GCTTTAAAAG AAGGAGTGCA AGGTGCCTTA GAGCTTGATA CGGCGATAAG ACAGGTTACT ACCCTAGCAG ATGAAGACAT ACTACCTGTT AGTAAAATCC AAGATGAAGT TAGAAGGATA TCAGATGCCT CTGGTATTGC ACAGACTGAA ATTAGTAATT CTATGTATGA GGCCTTATCA TCTAGTGTAG GTAGTAGTGA TGTAGTAGGT TTTGTAGACC AAGCGGTAAA GCTAACTAAG GCAGGATTTA CTGATATGCC TACTGTTATT GATGCAACCA CTACTGCCAT GAATGCTTAT GGTCTGAGTG GACAAGAAGC AGTAGGCCAT ATTCAAGATG TATTTGTTAA GACCCAAGAT CTAGGTAAGA TTACGGTTGA TGAATTAGGT AAGTCTATAG GTAGGGTTAT ACCTTTTGCT TCGGCAGCTG GTGTATCTAT AGACCAATTA GGGGCAGGGT ATTCGATACT AACTGCTAAA GGTCAAAATG CCCAAATAGC TACTACTAAC CTAAGCTCTC TAATATCTGA ATTATCTACA AGCGGTACAA AGGCTGATAA AGCCTTGCGA GAGAATTTAG GTGGTTCTTT CAAAGAGTTA ATGGAAAATG GCCAATCTAT GGGCGATGTA TTGCAATCCC TACAGGGTGT AGCTGAAGAA AACGGTGAGT CCTTAGGGGA TATGTTTGGC AACAAGATGG CCACGTCTGC TGCTAATGCT TTGATGTCTG ATGGTGCTGG AGCTTTTAAC GATACTTTAA ACAAGATGGT TAATAGTGGT GGTGCTGTAG ATGCAAACTA CGAAAAGATG ATAGGACCTG CGGAGAAACT ACAAAGAGCC CAGACTAAGC TTAAAAACTC TTTAATAGAG TTAGGGGGAG CTTTAGCTCC AGTCATAGAG AAATTTTCAA ACGGACTATC TAAAATCACG GATAAGTTTA ACTCCTTAAG CGATGAGACT AAAGGAAAAA TAGCAAAGAT AGCTGGAGCT ATAGCTGTAG CTGGTCCTAT TATAGCGGCC GTTGGTGCTG CTTTTATGGT GGTAGGTGGA GTTATAAAGA CTATAGGCTT AGCTATAATG CTATTAGCAA GTCCTATCGG TCTAGTTGCG GCTGGTATAG CAGCTGTGGT TGCCGTTGGA TATCTTTTAT ATGATAATTG GGAGTTAATA AAGCAGAAAG CAAGTGAAGT ATGGGACGGA ATATCAACTA AAATAAGTGA AGTAGCATTA TCGGTAGCTA CTACTATAGG CGAATTTGTC GAGGGGATAA AACTCAAATT TGATGAATTT GTAATGGCGG TAGGAGAGAA GTTTGAAACG GCCAAAGCCG TTATAATCGA GAAATTTACC GCCATGAAAG AGTGGGTAGG TACTATAATC GATGGAATCA AACTTAAAAT TGATACATTT GCTGAAGGTA TGGCAACCGC TATAAGTGGA GCTATAGAAA CGGTCAAAGG AATGTTTGAA GGGTTAAGGT CTAAGGCAGT AGGTGCGATA GAAGGGATTA AGAGTGCGTG GAACGGTCTT AAGAACCTAT TATCTAAGCC TATCAATGCG GTTGTAAATG TCGTCAAGAG TGGAGTTGGT AAGATAAAAT CCTTGGCTGG ATTTGCAACT GGTTTGTACC GTGTACCATA TGATGAGTTC CCAGCTATGC TCCATAAAGA CGAAATGGTT GTAAATGCTA GTGGGTCTGA ACAACTTAGG GCTATGGGAG CGACTGAAAA AGGATTTAGT CAAACTCCAA CCAATACAGG CATGGGTGAT GTTAGTAGTG GTGTGATTAA TAATACAGCA AGTACTAATA ATAGTTCATT TAGTCCGCAT ATAACCGTTA ATTATTATGG CCAAGGAAAT GCTCAATCAG ACGGTAACGT TATAGCTGAT ATAGTAGACG AAAGAATAAT GTCCCTATTT AATACAGCTA ATCTACAAAG GGGGTAA
|
Protein sequence | MGDQRELTWR LETDTGKAVS DIQEVDKQFD KVKESMEDVD KKSSIFDKIG GPMKSAGSAV SAFGGKVMNL GGGMMKTGAK VTAFTAPVSL ALKEGVQGAL ELDTAIRQVT TLADEDILPV SKIQDEVRRI SDASGIAQTE ISNSMYEALS SSVGSSDVVG FVDQAVKLTK AGFTDMPTVI DATTTAMNAY GLSGQEAVGH IQDVFVKTQD LGKITVDELG KSIGRVIPFA SAAGVSIDQL GAGYSILTAK GQNAQIATTN LSSLISELST SGTKADKALR ENLGGSFKEL MENGQSMGDV LQSLQGVAEE NGESLGDMFG NKMATSAANA LMSDGAGAFN DTLNKMVNSG GAVDANYEKM IGPAEKLQRA QTKLKNSLIE LGGALAPVIE KFSNGLSKIT DKFNSLSDET KGKIAKIAGA IAVAGPIIAA VGAAFMVVGG VIKTIGLAIM LLASPIGLVA AGIAAVVAVG YLLYDNWELI KQKASEVWDG ISTKISEVAL SVATTIGEFV EGIKLKFDEF VMAVGEKFET AKAVIIEKFT AMKEWVGTII DGIKLKIDTF AEGMATAISG AIETVKGMFE GLRSKAVGAI EGIKSAWNGL KNLLSKPINA VVNVVKSGVG KIKSLAGFAT GLYRVPYDEF PAMLHKDEMV VNASGSEQLR AMGATEKGFS QTPTNTGMGD VSSGVINNTA STNNSSFSPH ITVNYYGQGN AQSDGNVIAD IVDERIMSLF NTANLQRG
|
| |