Gene Apre_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0520 
Symbol 
ID8397297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp597139 
End bp598254 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content37% 
IMG OID644994879 
Productnuclease SbcCD, D subunit 
Protein accessionYP_003152288 
Protein GI257066032 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGACTCC TCCACCTATC AGACCTACAT CTAGGAAAAA ATATCGGTTC GTATTTCCTA 
ATCGAAGAGC AAGGCTTTGC TCTAGCTGAG ATTATCAAAA TTATAAAAGA AGAAGATGTC
GATATAGTGA TGATTGCAGG AGATATCTTC GATACTATTA TTCCAAGTGC GGAGGCCATG
GATCTTTATT CTAACTTTAT CGAAGAGATA GTTTTTGATT TAGGAAAGAA GGTCCTAGCT
GTTTCTGGCA ATCACGATTC ATCTAAGAGA CTTGATATCA ACAAGAGATT CTACAGGTCC
AATAATTATT ATCTAGTAAG CGAATATGAC AAAGATCCTA TTAGCTTTGA GGATGATTTT
GGGAAAGTTA ACTTCTATCT CATTCCCTTT ATTTCCATAA ATAAGGCGAA AACAATCTTT
GATTCATCAA TAGATAATTT CACCGATGTC TATAAATATG CCCTAGAAGC TATTGACTAT
AGGGATAGGA ATGTGCTTAT TACTCATTGC TACGCTTCAA ATATGAGTTC ATTTGACAAA
GAAGTCTATG ACGAAGGTCA AAAGCCTCTT ACTATCGGAG GAACTGACGC CATGGATGCA
AGTTTATTTG AAGGCTTCGA CTATGTAGCC CTGGGCCATC TTCATAGGGC TCACTACGTC
TTAGACCCTA AGATCAGATA TTCAGGGACC TTTATGAAAT ATTCCTTCGA TGAGGAAAAT
CTTACAAAAA CTGTAAGCCT AGTTGACCTT AAAGATAAGG CAGAAATAAG AAAAATCGAA
ATCCCCTTCT TGAGGGACTT TGTTACAAAA AGGGGAATGT TTGAAGAAAT CTTAAAGGAA
GAAAAGTCAG AGGATTATAT AAAATTTATC CTAGAAGATT CCTATATTCA CGAAAATGCC
ATGGCAAGGT TAAAGGAGAA ATTCCCTAGG GCTGTCTCAA TCACTTACGC CAACAAGGCT
GTATTTGAGA GGGAAGATAG TTACGATGTG GACATAGATG ACAAGAATTT GCTAGAGCTT
TTTGCAGAAT TTTATCACTT CAAGATGGAT GAAGACCTTA AACAAAAAGA CACCCAACTT
ATACAAAGGA TAGGCTTATG CGACCAAGAA GACTAA
 
Protein sequence
MRLLHLSDLH LGKNIGSYFL IEEQGFALAE IIKIIKEEDV DIVMIAGDIF DTIIPSAEAM 
DLYSNFIEEI VFDLGKKVLA VSGNHDSSKR LDINKRFYRS NNYYLVSEYD KDPISFEDDF
GKVNFYLIPF ISINKAKTIF DSSIDNFTDV YKYALEAIDY RDRNVLITHC YASNMSSFDK
EVYDEGQKPL TIGGTDAMDA SLFEGFDYVA LGHLHRAHYV LDPKIRYSGT FMKYSFDEEN
LTKTVSLVDL KDKAEIRKIE IPFLRDFVTK RGMFEEILKE EKSEDYIKFI LEDSYIHENA
MARLKEKFPR AVSITYANKA VFEREDSYDV DIDDKNLLEL FAEFYHFKMD EDLKQKDTQL
IQRIGLCDQE D