Gene Dole_3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3097 
Symbol 
ID5695957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3711364 
End bp3713577 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content58% 
IMG OID641265714 
Productadenine-specific DNA methylase containing a Zn-ribbon-like protein 
Protein accessionYP_001530977 
Protein GI158523107 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.622238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAA AACGTTTTGA CATATCCTTT ATCGCCGACC TGGCCCTGCG GGAGAAGCAG 
ATTCAGCAGA ACTACCGGCC TGTTATTGCC GTGCATAAGT GGTTTGCCCG GCGCCCGGGA
ACCCTGTTTC GGGGCCTTCT GCTCTCCGAG TTTGTCGATT CGCCGCTTCG GGATGTGTTT
TACAGGGCCA ACGACCTGGA CGGCAAAACC GTGGCAGACC CCTTCATGGG AGGCGGCATT
CCGGTGCTGG AGGCCAACCG GCTGGGGTGT GATGTGACCG GGTTTGACAT CAACCCCATG
TCCTACTGGA TCGTCAAGCA GGAGATCGAG CACCTGGATC TGAAAGCTTA TGAACGGGCC
GCGACCGTCC TTTGTCAAAC CCTGGAAAAG GAGGTCGGCC CGTTTTACCG GACCCGGTGC
GAAGTGTGCG GTTCCGATGA TGCCCATGTG AAATATTTCC TCTGGGTCAA GACCATTCCG
TGTCAAGGAT GCGGAAAAAC CGTAGACCTG TTTCCCGGCT ATCTTGTTTC CGCCGACGCT
CGTCATCCTT TAAATGTGTT TGTGTGCCCG GCGTGTGGGG ACCTGACAGA AACAAAGAGC
CGGACCTCGC CGGGGAATTG CGACACCTGT TCCGCCGCGT TGACCATGGC CGGCCCTGCC
GGGCGAAGCC GGTGCAAGTG TCCGGCCTGC GGGGTGGATA ACACCTACCC CGATGCAGCC
GCAGGCCCGC CGGATCACCG CCTGTTTGCC ATCGAGTATC ACTGCCCGGC CTGCAAGCCT
TCCCATGCGG GCCGGTTTTT TAAAAAGCCG GATGCACGGG ACCTGGCCGG GATGGGGACA
GTTGAATCCC GATGGAAAAA AATGCGACCC CGGTATGTGC CTACGGATCC GATTCCGGGC
GGCGACGAAA CCGACAGGCT GCACCGGTGG GGATACCGGT TTTACCGTCA GATGTTCAAC
AGCCGCCAGC TTCTGGGCCT GGAACTGTCG GCCCGCATTA TTGCGGGCAT CGAAGCAGAG
CGGGTTCGAA ACGCTTTGGC CACCAACCTT TCCGACCTGC TGCGGTATCA GAATATGCTG
TGCCGGTATG ACACACGGGC CTTAAAATCG CTGGACATTT TTTCGGTACA CGGCTTTCCC
GTGGGGCTGA TCCAGTGCGA GTCCAATTTT CTGGGTATTC GTGCCCAGGG CCGCGGCATG
TGTATTGGCA GTGGAGGGTG GGCCAACATC ATTGAAAAAT TTAAAAAAGC CAAAGCCTAT
TGCGACCACC CCTTTGAGAT TCGTCACCAG GGCCGGGCAA AAAAAGTGGT GCCCATTGCC
GGTGAATGGA TCGGGGACAG GAGAAACGGT CATGACGGCC CACCGGAAAG AAAGGTCGAT
CTGTCCTGCC GGGACGCGGC CGCAGCCGCC TTGCCCGGCG GAACATTGGA TGCCGTGCTT
ACAGACCCGC CCTATTTCGG CAATGTGCAG TATGCCGAAC TGATGGATTT CTGTTACACC
TGGTTGCGCC GGCTGGCAGG CTCCACGGCC GCGCCCTTTG ATACGGTATC TACCCGAAAC
CCCCATGAGC TGACCGGCAA CCTGGACATG GGCCGGGACC TGGCTCATTT CACCGAGGGG
CTTTCAGCAG TGTTCCGCCG AATGGCAACG GCTTTAAAGC CGGGAGCCCC GCTGGTGTTC
ACCTATCATC ACAACACCAT TGAGGCATAT TATCCCGTGG CCGTGGCCAT GCTGGATGCA
GGTCTGACCT GCTCGGCGTC ACTGCCCTGC CCGGCGGAGA TGGGGGCCTC TATTCATATC
AACGGAACCG GCTCTTCCAT CATCGACACG GTGTTTGTCT GTCGGGCCAC CGGCGTCATG
CAGCGCAAAT GGCTGGCCGA TTCAATCGGC GGGGTCGCGA AAATCGTTGA AGCGGATCTG
GCTCTGCTGC GGGCCGGCAA AGTGAACCCC ACTCCCGGCG ATACCCGGTG CATCACTTAT
GGCCACCTGA TTCGGCTGGC TGTGTGGTCG TTGCGAACCG ATTGGAATCA AGATGCCCCT
GCGGCCCTGC GTATCGCAAA GGTGGCAAGC TGGTTCGTAC ATTTTGGCGG CTGGCCCGAT
GTGAAAAAAT ACATGGGTCA CGACAGAAAA GCAATCATGC GGGAAGTTCC TTTGCTCGCG
ATTTCAGAAA CCAGCGCGGA ATATGGAGTC GAATATGCCG AAATACCCTT TTGA
 
Protein sequence
MIEKRFDISF IADLALREKQ IQQNYRPVIA VHKWFARRPG TLFRGLLLSE FVDSPLRDVF 
YRANDLDGKT VADPFMGGGI PVLEANRLGC DVTGFDINPM SYWIVKQEIE HLDLKAYERA
ATVLCQTLEK EVGPFYRTRC EVCGSDDAHV KYFLWVKTIP CQGCGKTVDL FPGYLVSADA
RHPLNVFVCP ACGDLTETKS RTSPGNCDTC SAALTMAGPA GRSRCKCPAC GVDNTYPDAA
AGPPDHRLFA IEYHCPACKP SHAGRFFKKP DARDLAGMGT VESRWKKMRP RYVPTDPIPG
GDETDRLHRW GYRFYRQMFN SRQLLGLELS ARIIAGIEAE RVRNALATNL SDLLRYQNML
CRYDTRALKS LDIFSVHGFP VGLIQCESNF LGIRAQGRGM CIGSGGWANI IEKFKKAKAY
CDHPFEIRHQ GRAKKVVPIA GEWIGDRRNG HDGPPERKVD LSCRDAAAAA LPGGTLDAVL
TDPPYFGNVQ YAELMDFCYT WLRRLAGSTA APFDTVSTRN PHELTGNLDM GRDLAHFTEG
LSAVFRRMAT ALKPGAPLVF TYHHNTIEAY YPVAVAMLDA GLTCSASLPC PAEMGASIHI
NGTGSSIIDT VFVCRATGVM QRKWLADSIG GVAKIVEADL ALLRAGKVNP TPGDTRCITY
GHLIRLAVWS LRTDWNQDAP AALRIAKVAS WFVHFGGWPD VKKYMGHDRK AIMREVPLLA
ISETSAEYGV EYAEIPF