Gene Apre_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0834 
Symbol 
ID8397618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp913728 
End bp914966 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content36% 
IMG OID644995180 
Productphage terminase, large subunit, PBSX family 
Protein accessionYP_003152583 
Protein GI257066327 
COG category 
COG ID 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00567279 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATA TCGATAAAGT TTATACTAAA AAACAACAAG AAATATATAG GGATGTAGGT 
TCTAAAGATT GGTTTATACT AATTCTCCAT GGAGCTAAGA GATCTGGTAA GACTCAATTA
AATAATGATT TGTTTCTTAG GGAGCTTATC AGAGTTAAAA AGATTGCTAA TATGGAGAAG
GTGGATAAGC CTCAATATAT TTTAGCTGGC TTTTCTAAGT CTACTATTTA TCAAAACGTG
CTTATAGAAT TGTCTTCTAA GTATGGGATA GATTTCAAAT TCGATAAGCT AGGCAATTTT
ACTATGCTAG GTGTTTATGT AGTCCAAGTT GGCCATGGTA AGATTGACGG ATTAGGTCGT
ATACGTGGTA TGACTTCATA CGGTGCTTAT GTGAATGAGG CCTCACTTGC TAATGAGCTG
GTTTTTGATG AAATAAGGTC TAGATGTTCA GGTAAGGGGG CAAGAATTAT TTGTGATACT
AACCCCGACA ATCCCGAGCA TTGGCTGAAA AAAGAGTACA TCGATAATCC AACGGATAGG
ATTTTATCTT ATAAGTTTAC TATTTTCGAC AATACCTTTC TTGATAAGAG ATATTTACAA
TCGACTATTG ACACTACACC TGATGGGATG TTTACCGAGC GAAACATATA CGGGAATTGG
GTTAGTGGCG AAGGTGTTGT ATATAAAGGC TTTGACCCTA AGAGACATTA TGTCAAGAGC
TTAGATGGTA TTAGATTTAG TAGTTACATA GCTGGTGTGG ACTGGGGATA TGGACACTAC
GGATCTATTG TAGTATTTGG CATATCTAAT GACGGTAAGT ATTACATGAT AGAAGAGCAT
GCTGAGCAGT ATCAAGAAAT AGATTTTTGG GTAGCTGTAG CGAAAGATAT AGCTAATAGA
TATAAGGGCA TAGTCTTTTA TTGTGATTCG GCAAGGGTGG AACATATAGA CAGGTTTAGT
CGTGAGGGAC TTGTTGCATA TATGGCAGAC AAGGCGGTTA TTCCTGGAAT AGAGGCGGTG
TCTATTTTAT ATAAGACGGA CAAGCTTTTT ATATATGAGT ATATAGCCAA GAGGTTTAAG
GAAGAGATTT ATTCTTATGT ATGGGCTACT AATTACAGGT CAGATGAAGT CAAGAAGGAG
TTTGACGATG TAATGGACTC CATGAGATAT GCTTTATATA GTTATGAACA AGGTTTGGGA
AGTATTAAGA CCATGGATAG AAGTGTTTTA GGATTGTAG
 
Protein sequence
MSNIDKVYTK KQQEIYRDVG SKDWFILILH GAKRSGKTQL NNDLFLRELI RVKKIANMEK 
VDKPQYILAG FSKSTIYQNV LIELSSKYGI DFKFDKLGNF TMLGVYVVQV GHGKIDGLGR
IRGMTSYGAY VNEASLANEL VFDEIRSRCS GKGARIICDT NPDNPEHWLK KEYIDNPTDR
ILSYKFTIFD NTFLDKRYLQ STIDTTPDGM FTERNIYGNW VSGEGVVYKG FDPKRHYVKS
LDGIRFSSYI AGVDWGYGHY GSIVVFGISN DGKYYMIEEH AEQYQEIDFW VAVAKDIANR
YKGIVFYCDS ARVEHIDRFS REGLVAYMAD KAVIPGIEAV SILYKTDKLF IYEYIAKRFK
EEIYSYVWAT NYRSDEVKKE FDDVMDSMRY ALYSYEQGLG SIKTMDRSVL GL