Gene Apar_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0641 
Symbol 
ID8413501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp718900 
End bp720249 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content42% 
IMG OID645022219 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_003179662 
Protein GI257784445 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.782635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGA TTCAAGAGAA AGAAGATTCT GTTCTTTCAG TATCTGATGC AGTGCTTTTT 
GCAAAAAATG AAGTTGCCTC AATGCCGTCT ATGACGGTAG TTGGAGAGGT TTCTGGATTT
AGAGGTCCAA ACTACAAATC TGGTCATTGC TATTTTGACG TAAAAGACCC AGTTAGCTCT
ATGTCGGTTA TTGTTTGGTC AAAGATTTAT GCTGCTAGTG GTATTACGTT GCAAGACGGC
ATGAAGATTC AGATGAAGGG AAAGTTTGAT ATTTACGCTT CAAGCGGCAA GCTTTCTTTT
CATGCTCGCA CTATCCAGAT TGCTGGAGAA GGAGACCTCC GTCAGAAGGT GGCACAGCTT
GCTAAAAAGC TAGAAAAAGA AGGTCTTACT GATATTTCTA GAAAGCGCTC TATTCCAGAG
TTTTGTACGC GCATTGGTGT GGTTACGTCA CTTTCTGGTA GCGTAATTGA TGACGTTAAA
CGTACGCTTG CACGTAGAAA CCCCCTTGTA GAGCTTGAGG TTTCCGGATG TGCTGTTCAG
GGCACTCATG CGCCCGCAAC TATCATCAGA GCGCTTCAGG TGGCTGCAAC CACGCGTCCA
GACGCAATCT TGCTGGTACG TGGTGGTGGC TCATTTGAAG ACCTTATGTG TTTCAATGAT
GAGGGAGTAG CGCGTGCTAT TGCTGCATGC CCTATTCCTG TTATTACAGG TATTGGACAT
GAGCCAGATA CTACAATTGC AGATATGGTT TCTGATAGAA GAACTTCAAC GCCAACGGCT
GCTGCAGAGT CTGTGGCTCC CGCAATTAGC GAGTTACAGA CTACGTTTAA TAACAGAATT
TCTCGCCTTG GAAACGCTAC AAGAAAGATT TTGCAGTTCT ATAAAACGGG TCTTATTACT
GTTGAACAGC GTTCTAACTT GGCAATGAAG AATGTTTTAT CAAGTAAAAA ATATCAGCTA
GATAAGTTGT CTGATCGTCC TTGTTTGAAG GATCCTATTT ATATTATTTC TACAAGAACA
GAAGATCTCA ATCAGACTGA GCAAAGGTTA ATGGATGCTT TTCCTGCACT TCTTTCAAGA
AAAAAAGAGA TGCTTTTTGT TGAGTCACAA AGACTTACCA GAGCTAGTGC CGCCATGTTT
CATCCACATC AGGCTTTGCT GTCTTCGCTT GCAGGAAGGC TAGATGCACT TTCTCCTTTA
AAAGTTCTTA CAAGGGGATA TGCATACGTT CAAGATGAAA GAGGATCTGT TATTAAAACG
GCCAAAGAGG CCTCTGTTGG AGATTCCTTG CTGGTCAAAA TGAGGGATGG AACAATTAAG
GCAACTGTTT CTGACATTGA ACTTTCATAA
 
Protein sequence
MNQIQEKEDS VLSVSDAVLF AKNEVASMPS MTVVGEVSGF RGPNYKSGHC YFDVKDPVSS 
MSVIVWSKIY AASGITLQDG MKIQMKGKFD IYASSGKLSF HARTIQIAGE GDLRQKVAQL
AKKLEKEGLT DISRKRSIPE FCTRIGVVTS LSGSVIDDVK RTLARRNPLV ELEVSGCAVQ
GTHAPATIIR ALQVAATTRP DAILLVRGGG SFEDLMCFND EGVARAIAAC PIPVITGIGH
EPDTTIADMV SDRRTSTPTA AAESVAPAIS ELQTTFNNRI SRLGNATRKI LQFYKTGLIT
VEQRSNLAMK NVLSSKKYQL DKLSDRPCLK DPIYIISTRT EDLNQTEQRL MDAFPALLSR
KKEMLFVESQ RLTRASAAMF HPHQALLSSL AGRLDALSPL KVLTRGYAYV QDERGSVIKT
AKEASVGDSL LVKMRDGTIK ATVSDIELS