Gene Dole_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2237 
Symbol 
ID5695085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2708897 
End bp2710135 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content50% 
IMG OID641264843 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001530118 
Protein GI158522248 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTTC ATGAAGTGTG TGATCTGATT GTTGATTGTG AGCATAAGAC CGCACCAACC 
CAAGCAGAAG GCTACCCCTC TATTAGAACC CCAAACATTG GTCGAGGATA TTTTCTTTTG
GACGGAGTGA ATCGTGTTTC AGAAGAAACT TACCGGTCAT GGACAAGACG AGCCGAACCA
AAACCTGGCG ACTTAATAAT GGCCCGAGAG GCTCCGGTTG GGAATGTTGC TATGGTCCCT
GCGGGTCTTC GCCCCTGCCT TGGACAAAGG ACCTTGCTGA TACGACCAAT GAGGTCAAAG
GTTTTCCCAC GCTACCTCGC GTATTTGCTG ATTGGCGACC AAATCCAGAA TATTATCCAT
GCCATGACGA ATGGAGTCAC CGTACCTCAT TTAAACATGA AGGATGTGAG GTCGCTTCCC
CTACCACCGC TTCCCCCCCT TCCCACCCAG CGCAAAATCG CCGCCATACT TTCGGCCTAT
GACGACCTGA TCGAGAACAA CCTGAGGCGG ATCAAGATTC TGGAGGAGAT GGCGCAGAAC
CTCTACCGCG AGTGGTTCGT CAAGTTCCGC TTCCCCGGCT GGGAGAAAGC CCGCTTTGTG
GATTCGCCGC TGGGGAAGAT TCCGGAGGAG TGGGAGGTGA CAACAATCAA CAAAGTCACC
TCATACATTA ACCGTGGCGT AACTCCTAAA TATGACGCCT CTGCATCGAG TCTTGTTGTA
AATCAAAAAT GTATTCGTGA TCGCAAACTT AACTTGAGCC TTGCGAGACA GCATAAAAGT
CGCGTGATGG ATGACAAATA CGTTGTGTTT GGCGATATTT TGATCAATTC CACTGGTGTT
GGAACTTTAG GTCGTGTGGC CCAGGTGTAT GAAGATTTGA ACGATGTGAC AGTTGATACG
CATGTGTCGA TTGTTCGCCC TTCAAACGGA GATGGCATTG ATTTCTTGGG CCTCGCCTTG
ATTGATTTAG AGCCTCATTT TGAGTCGCTC GGAGCGGGTG CCACCGGTCA AACCGAGCTT
CGTCGTGATA GGATTGGTGA AACCGAAATC GTTTTACCAC CGGTTAAAAT GCGGAAGCAG
TTTTCAGAAA AGGTAACTTC GCTTCGAAAA TTGGTCCTTA ATCTGGCAGC TCGAAACGAA
ACCCTGCGCC GCACCCGCGA CCTGCTTCTC CCCAAACTCA TATCCGGCGA GGTGGATGTG
TCGGAACTGG ACATCGCTAT TCCTGAGGAG GCTGCATGA
 
Protein sequence
MDLHEVCDLI VDCEHKTAPT QAEGYPSIRT PNIGRGYFLL DGVNRVSEET YRSWTRRAEP 
KPGDLIMARE APVGNVAMVP AGLRPCLGQR TLLIRPMRSK VFPRYLAYLL IGDQIQNIIH
AMTNGVTVPH LNMKDVRSLP LPPLPPLPTQ RKIAAILSAY DDLIENNLRR IKILEEMAQN
LYREWFVKFR FPGWEKARFV DSPLGKIPEE WEVTTINKVT SYINRGVTPK YDASASSLVV
NQKCIRDRKL NLSLARQHKS RVMDDKYVVF GDILINSTGV GTLGRVAQVY EDLNDVTVDT
HVSIVRPSNG DGIDFLGLAL IDLEPHFESL GAGATGQTEL RRDRIGETEI VLPPVKMRKQ
FSEKVTSLRK LVLNLAARNE TLRRTRDLLL PKLISGEVDV SELDIAIPEE AA