Gene Dole_0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0249 
Symbol 
ID5693067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp283450 
End bp284631 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content49% 
IMG OID641262829 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001528136 
Protein GI158520266 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCA AAGAAAATTC TCAATTCTCA ACTCGCCATT CTCCATCGGT TCCGCCGGGA 
TACAAACAGA CCGAGGTGGG GGTGATTCCG GAGGATTGGG AAGTTAAGCC TCTCGCTTTT
GTTGTGAAAT ACACAAACGG AAAGGCGCAC GAGCAAAGCA TCACGGATTC GGGCAATTTT
GTGGTTGTAA ATTCCAAGTT CATTTCAACT GAAGGTATCA TTCGTAAATT TGCTCAAATG
CGTTTCTGCC CAGCGGAGAA AGGGGATGTG CTCATGGTGA TGAGCGATGT CCCAAACGGA
AGAGCCATTG CAAAATGTTT TTGGGTAGAT TGCGAAGATA CTTACACTGT CAATCAGCGT
ATTTGTGTCC TGAATCCTTG TGGGATAGAT GGCAAACTTC TGTATTACAA ACTCGACCGG
AATCCGTTCT ATTTGACATT TGATGATGGT GCTAAACAGA CGAACCTCCG AAAGGAAGAC
GTCCTTTCTT GCCCTCTGTC AATTCCAAAT ACCGAAGCCG AACAACGCGC CATCGCTGCC
GCCTTGAGCG ACGTGGATGC CCTGCTGGAT GGCCTCGACC GGCTGATCGC CAAAAAGCGC
GACCTCAAAC AGGCCGCCAT GCAGCAACTC CTCACCGGCC AAACCCGCCT GCCGGGGTTT
AAGGGGGAGT GGGAGATTAA ACGGTTGGGG GATGTACTTA TGGTCCGTCA CGGCAAGAGT
CAGCGCGGCA TCTCTGTGTC TGACGGGAAG TACCCGATTC TTGCATCCGG TGGAGAAATT
GGACGAACCA ATACCTGCAT TTACGACAAG CCCTCTGTTT TGATTGGGCG AAAAGGAACG
ATTGATTCAC CACAGTATGT GGACTCTCCC TTTTGGACGG TGGACACGTT GTTTTTTACG
GAAATTTCTA CCGAAGCGAA CGCCAAGTTC ATTTTTTCCA AGTTCTCTAT AATCCCTTGG
AGAACTTACA ACGAGGCTTC GGGTGTGCCC AGCTTAAACG CAAAAACTAT CGAAAATATC
GAGATTTTTT TACCCTCCCC CACCGAACAA ACCGCCATCG CCCAAGTCCT CTCCGACATG
GACGCCGAAA TCGCCGCCCT GGAACAGCGC CGCAACAAAA CCAGAGACAT CAAACAGGCC
ATGATGCAGG AACTTTTAAC TGGAAAGACG AGGCTGGTAT GA
 
Protein sequence
MKSKENSQFS TRHSPSVPPG YKQTEVGVIP EDWEVKPLAF VVKYTNGKAH EQSITDSGNF 
VVVNSKFIST EGIIRKFAQM RFCPAEKGDV LMVMSDVPNG RAIAKCFWVD CEDTYTVNQR
ICVLNPCGID GKLLYYKLDR NPFYLTFDDG AKQTNLRKED VLSCPLSIPN TEAEQRAIAA
ALSDVDALLD GLDRLIAKKR DLKQAAMQQL LTGQTRLPGF KGEWEIKRLG DVLMVRHGKS
QRGISVSDGK YPILASGGEI GRTNTCIYDK PSVLIGRKGT IDSPQYVDSP FWTVDTLFFT
EISTEANAKF IFSKFSIIPW RTYNEASGVP SLNAKTIENI EIFLPSPTEQ TAIAQVLSDM
DAEIAALEQR RNKTRDIKQA MMQELLTGKT RLV