Gene Apre_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1044 
Symbol 
ID8397831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1116211 
End bp1118061 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content35% 
IMG OID644995392 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003152793 
Protein GI257066537 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00420676 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACAAAAA AACAGATTAA AGAGAAATTA AAAGAACTTC CAGACCTACC TGGCGTTTAT 
ATAATGAGGA ATGCCGAAGA TGAAATTATT TATGTAGGCA AGGCCATTTC CCTAAAAAGA
AGAGTACGCC AATACTTTGA CAATAATAAA AACAAGGGGG CTAAGGTCCT TGCCATGGTC
AAAAACATCG ACCATTTCGA ATATATTATT GTTCAAAATG AGGTAGAAGC CTTAGTTCTT
GAGAGCAATT TAATCAAAAA AAATAGACCC AGATATAATA TAGTCCTAAG GGATGATAAG
CAATATCCCT ATATTAAAAT CTGTAAGGAA AAATATCCTA GGATAAAAAA GGTTAGACAA
GTTTTAAAGG ATGGGGGACG CTACTTTGGA CCCTTTCCTG ATGCTTATGC AGTAAATGAT
GCGATCGATT TATTTCACCT CTATTATCCC TTTAGAACTT GCAATCTAAA CTTCGATAAG
GGTGCAAGGC TAGATAGGCC GTGTCTCAAT TATTTCATCC ATAAATGCAA GGGACCTTGT
GTAGACAAGG AGGATGAGAA AAGATATTTA AACCATATAG ACGACGTGGT GAAGTTTTTA
GAGAACAAAT CAGAAAAAAT CCCGAATTGG GTTCTTGCTA AAATGAATGA TGCGAGTAAG
GATTTGAATT TCGAGATGGC TGCAAAATAT AGAGACTACT ATAGGGCCTT ATCTGTGATT
TCTGAAAGAC AGAACGTTAC AGAAACTGGT GGAGATGACC TAGATATAAT AGCCATGAGT
AAGGGAATGA ATTCTATTAT TATCCAGGTT TTCTTTATGC GTATGGGAAA GATTGTCGAT
AGAGAGCACT TCATAATCAA AAACGATTTT ATGGAAACTG ACTCAGATAT TATGTCTTCA
TTCATAAAGC AGTTTTATCT AGATATAATG TATGTACCAA AAGAAATTCT TGTCCAATAT
ATGCCAGCAG ATTTTGATTC TATTAGCGAA TTTTTATCTA GAAAGAAAAA ATCCAAGGTT
TATATACACA ATCCAAAACT CGGAAAGAAA AAAGAGCTTG TAGATATGGC AAGCCGTAAT
GCTCTCGATA TGAGAATTAA ATATGATAAA AGAGTTGAAA GAAAGGAAAG GAAGAAATCT
TCTGGAATCA ACCAGCTCAT GGACATACTT AATCTTAAAG ATGTATTTAG AGTAGAATGT
TATGATATTT CTAACACATC CGGCGTTCTT TCGGTAGGAT CTATGGTTGT CTTTGAGGGA
GGAATACCAA CTCCTAAGGA ATATAGGAAG TTTAAAATAA AGACTGTAGT AGGAAGCGAT
GATTATGCAT CCCATAGGGA AGTTCTTACG AGAAGGCTAA AACGTGGACT AGAAGAAAAA
GAAAAAGGAA ATACCCAGAC AGGGTTTGGA TCTCTACCTG ATTTGATCTT AATGGATGGA
GGAAAAGGGC AGGTCACTAT AGCAAAAGAA GTTATAGATG GCTTGGGTCT ATCCATAGAA
GTAGCAGGAC TTGTTAAGGA TGATAAACAT ACTACAAGAG CAATAGTCTA TAACAATGAG
GAAATCGCCA TTAACAGGAG AGATCCTGTC TATAAGCTCA TATATGAAAT CCAAGAAGAA
GCTCATAGAT TTGCAATAAA CTACCATAGA AAGCTTATGC AAAACACAAT GAAGACTACA
GAACTAGACA ATATAAAAGG GGTTGGCGAG AAAACTAGGA AAAACCTCTA CAAGCATTTT
AAGACAATTT CAAATATTAA AAAGGCAAGC GTTGAAGAGC TCATGGAAGT TCCTCTGGTG
GGGAAAGTTC AAGCCTTAGA AATATACAAA TATTTTAGAC TGAAAGGATA G
 
Protein sequence
MTKKQIKEKL KELPDLPGVY IMRNAEDEII YVGKAISLKR RVRQYFDNNK NKGAKVLAMV 
KNIDHFEYII VQNEVEALVL ESNLIKKNRP RYNIVLRDDK QYPYIKICKE KYPRIKKVRQ
VLKDGGRYFG PFPDAYAVND AIDLFHLYYP FRTCNLNFDK GARLDRPCLN YFIHKCKGPC
VDKEDEKRYL NHIDDVVKFL ENKSEKIPNW VLAKMNDASK DLNFEMAAKY RDYYRALSVI
SERQNVTETG GDDLDIIAMS KGMNSIIIQV FFMRMGKIVD REHFIIKNDF METDSDIMSS
FIKQFYLDIM YVPKEILVQY MPADFDSISE FLSRKKKSKV YIHNPKLGKK KELVDMASRN
ALDMRIKYDK RVERKERKKS SGINQLMDIL NLKDVFRVEC YDISNTSGVL SVGSMVVFEG
GIPTPKEYRK FKIKTVVGSD DYASHREVLT RRLKRGLEEK EKGNTQTGFG SLPDLILMDG
GKGQVTIAKE VIDGLGLSIE VAGLVKDDKH TTRAIVYNNE EIAINRRDPV YKLIYEIQEE
AHRFAINYHR KLMQNTMKTT ELDNIKGVGE KTRKNLYKHF KTISNIKKAS VEELMEVPLV
GKVQALEIYK YFRLKG