Gene Dole_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1519 
Symbol 
ID5694356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1812018 
End bp1813100 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content61% 
IMG OID641264114 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001529400 
Protein GI158521530 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.570552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGT TTTCCGCCGG GCCGTTCCAG CGCCGGCTGC TGCGCTGGTA TACGGCCCAT 
CAGCGGGACC TGCCGTGGCG AAGATCGAAA AACCCTTATC ACATCTGGGT TTCCGAGGTG
ATGCTGCAAC AGACCCAGGT TGCCACGGTG GTTGATTACT ACCGGCGGTT TCTGCAGGCC
TTTCCCGATA TCGGGACCCT GGCGGTCGCC GAGCTTCAGG ATGTTTTAAA GCTGTGGGAG
GGTCTGGGCT ACTATGCCCG GGCAGCCAAC CTTCACAAGG CCGCGCGGCA AATCGTTGCC
GGCGGTAAAA AGCGCGTTCC CCGCACCCCT GAAACCTTTG GCCGGCTGCC CGGCGTGGGG
GACTATATTA ACGCGGCGGT CTCCAGCATC GCCTTCGGCC ATCCGCTGCC GGTGGTCGAC
GGCAATGTCA AGCGGGTCCT GGCAAGGCTT TTTCTTTTGG ACGAGCCGGT CAACCGGCCC
TCAAACCACA GGGTGTTTCT TGAAAAGGCC CGCCTGCTGC TGGCTTTCAA AGATCCCGGC
ACCTTTAATC AGGCGATGAT GGAGCTGGGC GCCCTGGTGT GCAAACCGGG CCGGCCCCTG
TGCGACCAAT GTCCTGTAGC ATCGTTCTGT GGGGCCCATC AGGCCGGGCG CGTCACCGAT
TTCCCCAGGC GGCTGGCCGC CAGAAAAAAC CCTCACCATC ATCTGGCTGT GGGGCTGGTA
AAAAAAGGAA ACCGGTTTCT TATTGTGCGT CGGCCGGCAA CCGGTCTTCT GGCCGGGCTG
TGGGAGATGC CCGGGGGCCG AGTAGAAAAG CCTGAAAACC CGGCCGATGC CTGTTGCCGG
GCCGTTCTGG AGAGCGTCGG TCTCACGGTT TTCCCCGGCC CGCGCCTTGC CCGGGTTGCC
CATGCCTACA CCCATTTTAA AATCACCATG GACCTGTTTG CCTGCGACGT TGTCTCCGGC
CGAGTAAAAA GAAACGGGTA CCAGGCCCAC CACTGGATCC GCATGAAAGA TATTGGCCAA
TATCCTTTTC ACAGGGCCAT GCACAAGGCC TTTGCCGCAC TGGCGGGCGC CCTCCCTCCT
TGA
 
Protein sequence
MTLFSAGPFQ RRLLRWYTAH QRDLPWRRSK NPYHIWVSEV MLQQTQVATV VDYYRRFLQA 
FPDIGTLAVA ELQDVLKLWE GLGYYARAAN LHKAARQIVA GGKKRVPRTP ETFGRLPGVG
DYINAAVSSI AFGHPLPVVD GNVKRVLARL FLLDEPVNRP SNHRVFLEKA RLLLAFKDPG
TFNQAMMELG ALVCKPGRPL CDQCPVASFC GAHQAGRVTD FPRRLAARKN PHHHLAVGLV
KKGNRFLIVR RPATGLLAGL WEMPGGRVEK PENPADACCR AVLESVGLTV FPGPRLARVA
HAYTHFKITM DLFACDVVSG RVKRNGYQAH HWIRMKDIGQ YPFHRAMHKA FAALAGALPP