Gene Apar_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0121 
Symbol 
ID8412965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp135534 
End bp136535 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content50% 
IMG OID645021689 
ProductHhH-GPD family protein 
Protein accessionYP_003179148 
Protein GI257783931 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000756832 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.180452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AAATTTCACA TACAAAAACA AACGAAGACC TGGATACGGT GTCTGACATT 
TCAACATCTT TATCCGATGA GCTTGCAGTT ACATTTAAAA AGACAGAGCT TACGACAGAG
CTACGTGCGT TTGTCGAGTC TGTGGCCAAA AAAGGCCGCG AGCTGTATCG CGATTTGCCT
TGGCGTCGCA CGTACGATCC ATATGCCATT TGGATTTCTG AGGTCATGCT TCAGCAGACG
CAGGTTAGTC GCGTAGATGG TCGCTGGCAG AGATGGTTGG AACACTTTCC AACGGTTGAT
GCGCTGGCTG CTGCCGCGCC TTCAGATGTA CTTGAAGAAT GGCAGGGCCT GGGCTACAAC
CGTCGAGCTT TGTCTGTACA TCGAGCTGCT CAAGCAATTT CTGAAGCAGG CGGAGTCTTT
CCACAAGATC AAAAGGAGCT CGTAAAGCTT CCAGGCATTG GTCCCGCTAC TGCAGCAGGT
ATTCGCGCGT TTGCGTTCAA TCTGCATGGC GTTTATTTGG AGACTAACGT TCGTACGGTT
TTCTTGCATG AGCTTTACCC GCAGGCAGAA GGAGTGCCAG ACTCTGAGCT TATTCCTCTT
GTTGAGCTGA CGTGCCCTGC GAGTGTTTCT ACCGCAGCGG GCACTGACAC AGCAAACGCT
GCTACAACGG AACTCACGCC GCGTAGCTGG TACTACGCCC TTCTCGACTA TGGCGCGTAC
CTGAAGAAAA CTATTCCCAA TCCTTCACGA AGGTCTAAAA GCCACGTCAA ACAGTCTCGC
TTTGAGGGCT CTCATCGGCA GAAGCGTGCT GAGCTTTTAC GCGTTCTTCT TGCCCACAAA
GATGAGGGTG GAGCAGAGTT TGAGACACTT CATCAGGAAC TCTGTCAGAT TGAGGTCCAT
GCCGGGCGAG AAACCCTTGA TGAGCAGGTT ACCCTTGGCT TACTTGAAGA ACTTGCGAAG
GAGGGCTTCT GTCAGAAAAA TGATGAATAT TGGTTGCCAT AA
 
Protein sequence
MKKKISHTKT NEDLDTVSDI STSLSDELAV TFKKTELTTE LRAFVESVAK KGRELYRDLP 
WRRTYDPYAI WISEVMLQQT QVSRVDGRWQ RWLEHFPTVD ALAAAAPSDV LEEWQGLGYN
RRALSVHRAA QAISEAGGVF PQDQKELVKL PGIGPATAAG IRAFAFNLHG VYLETNVRTV
FLHELYPQAE GVPDSELIPL VELTCPASVS TAAGTDTANA ATTELTPRSW YYALLDYGAY
LKKTIPNPSR RSKSHVKQSR FEGSHRQKRA ELLRVLLAHK DEGGAEFETL HQELCQIEVH
AGRETLDEQV TLGLLEELAK EGFCQKNDEY WLP