Gene Apre_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0849 
Symbol 
ID8397633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp923266 
End bp925482 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content41% 
IMG OID644995195 
Productphage tail tape measure protein, TP901 family 
Protein accessionYP_003152598 
Protein GI257066342 
COG category[S] Function unknown 
COG ID[COG5280] Phage-related minor tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000280434 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAGACC AAAGAGAGCT TACCTGGAGG CTTGAAACTG ATACTGGTAA AGCGGTATCG 
GATATACAAG AAGTCGATAA ACAATTTGAT AAAGTAAAAG AAAGTATGGA GGATGTCGAT
AAGAAATCCT CCATTTTTGA TAAGATAGGC GGACCTATGA AGTCTGCTGG CTCTGCTGTA
TCTGCTTTTG GTGGAAAAGT AATGAATCTA GGCGGAGGAA TGATGAAGAC AGGAGCGAAG
GTTACGGCCT TTACGGCTCC TGTTTCTCTT GCTTTAAAAG AAGGAGTGCA AGGTGCCTTA
GAGCTTGATA CGGCGATAAG ACAGGTTACT ACCCTAGCAG ATGAAGACAT ACTACCTGTT
AGTAAAATCC AAGATGAAGT TAGAAGGATA TCAGATGCCT CTGGTATTGC ACAGACTGAA
ATTAGTAATT CTATGTATGA GGCCTTATCA TCTAGTGTAG GTAGTAGTGA TGTAGTAGGT
TTTGTAGACC AAGCGGTAAA GCTAACTAAG GCAGGATTTA CTGATATGCC TACTGTTATT
GATGCAACCA CTACTGCCAT GAATGCTTAT GGTCTGAGTG GACAAGAAGC AGTAGGCCAT
ATTCAAGATG TATTTGTTAA GACCCAAGAT CTAGGTAAGA TTACGGTTGA TGAATTAGGT
AAGTCTATAG GTAGGGTTAT ACCTTTTGCT TCGGCAGCTG GTGTATCTAT AGACCAATTA
GGGGCAGGGT ATTCGATACT AACTGCTAAA GGTCAAAATG CCCAAATAGC TACTACTAAC
CTAAGCTCTC TAATATCTGA ATTATCTACA AGCGGTACAA AGGCTGATAA AGCCTTGCGA
GAGAATTTAG GTGGTTCTTT CAAAGAGTTA ATGGAAAATG GCCAATCTAT GGGCGATGTA
TTGCAATCCC TACAGGGTGT AGCTGAAGAA AACGGTGAGT CCTTAGGGGA TATGTTTGGC
AACAAGATGG CCACGTCTGC TGCTAATGCT TTGATGTCTG ATGGTGCTGG AGCTTTTAAC
GATACTTTAA ACAAGATGGT TAATAGTGGT GGTGCTGTAG ATGCAAACTA CGAAAAGATG
ATAGGACCTG CGGAGAAACT ACAAAGAGCC CAGACTAAGC TTAAAAACTC TTTAATAGAG
TTAGGGGGAG CTTTAGCTCC AGTCATAGAG AAATTTTCAA ACGGACTATC TAAAATCACG
GATAAGTTTA ACTCCTTAAG CGATGAGACT AAAGGAAAAA TAGCAAAGAT AGCTGGAGCT
ATAGCTGTAG CTGGTCCTAT TATAGCGGCC GTTGGTGCTG CTTTTATGGT GGTAGGTGGA
GTTATAAAGA CTATAGGCTT AGCTATAATG CTATTAGCAA GTCCTATCGG TCTAGTTGCG
GCTGGTATAG CAGCTGTGGT TGCCGTTGGA TATCTTTTAT ATGATAATTG GGAGTTAATA
AAGCAGAAAG CAAGTGAAGT ATGGGACGGA ATATCAACTA AAATAAGTGA AGTAGCATTA
TCGGTAGCTA CTACTATAGG CGAATTTGTC GAGGGGATAA AACTCAAATT TGATGAATTT
GTAATGGCGG TAGGAGAGAA GTTTGAAACG GCCAAAGCCG TTATAATCGA GAAATTTACC
GCCATGAAAG AGTGGGTAGG TACTATAATC GATGGAATCA AACTTAAAAT TGATACATTT
GCTGAAGGTA TGGCAACCGC TATAAGTGGA GCTATAGAAA CGGTCAAAGG AATGTTTGAA
GGGTTAAGGT CTAAGGCAGT AGGTGCGATA GAAGGGATTA AGAGTGCGTG GAACGGTCTT
AAGAACCTAT TATCTAAGCC TATCAATGCG GTTGTAAATG TCGTCAAGAG TGGAGTTGGT
AAGATAAAAT CCTTGGCTGG ATTTGCAACT GGTTTGTACC GTGTACCATA TGATGAGTTC
CCAGCTATGC TCCATAAAGA CGAAATGGTT GTAAATGCTA GTGGGTCTGA ACAACTTAGG
GCTATGGGAG CGACTGAAAA AGGATTTAGT CAAACTCCAA CCAATACAGG CATGGGTGAT
GTTAGTAGTG GTGTGATTAA TAATACAGCA AGTACTAATA ATAGTTCATT TAGTCCGCAT
ATAACCGTTA ATTATTATGG CCAAGGAAAT GCTCAATCAG ACGGTAACGT TATAGCTGAT
ATAGTAGACG AAAGAATAAT GTCCCTATTT AATACAGCTA ATCTACAAAG GGGGTAA
 
Protein sequence
MGDQRELTWR LETDTGKAVS DIQEVDKQFD KVKESMEDVD KKSSIFDKIG GPMKSAGSAV 
SAFGGKVMNL GGGMMKTGAK VTAFTAPVSL ALKEGVQGAL ELDTAIRQVT TLADEDILPV
SKIQDEVRRI SDASGIAQTE ISNSMYEALS SSVGSSDVVG FVDQAVKLTK AGFTDMPTVI
DATTTAMNAY GLSGQEAVGH IQDVFVKTQD LGKITVDELG KSIGRVIPFA SAAGVSIDQL
GAGYSILTAK GQNAQIATTN LSSLISELST SGTKADKALR ENLGGSFKEL MENGQSMGDV
LQSLQGVAEE NGESLGDMFG NKMATSAANA LMSDGAGAFN DTLNKMVNSG GAVDANYEKM
IGPAEKLQRA QTKLKNSLIE LGGALAPVIE KFSNGLSKIT DKFNSLSDET KGKIAKIAGA
IAVAGPIIAA VGAAFMVVGG VIKTIGLAIM LLASPIGLVA AGIAAVVAVG YLLYDNWELI
KQKASEVWDG ISTKISEVAL SVATTIGEFV EGIKLKFDEF VMAVGEKFET AKAVIIEKFT
AMKEWVGTII DGIKLKIDTF AEGMATAISG AIETVKGMFE GLRSKAVGAI EGIKSAWNGL
KNLLSKPINA VVNVVKSGVG KIKSLAGFAT GLYRVPYDEF PAMLHKDEMV VNASGSEQLR
AMGATEKGFS QTPTNTGMGD VSSGVINNTA STNNSSFSPH ITVNYYGQGN AQSDGNVIAD
IVDERIMSLF NTANLQRG