Gene Dole_0824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0824 
Symbol 
ID5693659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp954240 
End bp956591 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content53% 
IMG OID641263421 
ProductEcoEI R domain-containing protein 
Protein accessionYP_001528711 
Protein GI158520841 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATAAAA GCGAAAAGAA AAAACTCAGC GAAACAGATA TCTGTGATCT GTTCATCACC 
CCGGCGATTA AAAATGCAGG GTGGGACCCT ATTCGGCAGA TTCGTCGGGA GGTAACGCTC
TCTCCCGGTC CGGTTGTTGT GCGGGGCAAC ATGTCCTCGC GCAACAAGAA AAAGAAAAAA
TTTGCCGACT ATGTCCTAGC CTGGGAGCCG GGCGTTCCCG TGGCGGTCGT GGAAGCCAAA
GCAAACGATC ACACCGTCAG CCAGGGCCTA CAACAGGCCC TGGGCTACGC CGAGATATTA
CAGGTGCCCA GCGCCTTCAG TTCCAACGGC GATGCCTTTG CCTCCCACAA CAAAGTTCCG
GCCACCGGTG AAGAAATTGA AACCCAGCTT CCCCTTGACC AGTTCCCGCC GCCGCCCACT
CTCTGGCAGC GCTATAAAAC CTATCGCAAT ATTGAAGACG CCTCGGACGA CTTGCTTCTG
CAGCCCTATC ATTTCGACGC CACGGAAAAA GAACCCCGCT ATTATCAGGT CGAAGCCATC
AACCGGGTCA TTGAAGCCGT TGCCAAAGGC AACCGGCGGA TGCTGCTGGT CATGGCCACC
GGCACCGGCA AAACCTACAC CACCTTTCAG ATCATCTGGC GGCTGTGGCA GGCCAGGGCC
GTAAAGCGCG TCCTGTTTCT GGTGGACCGA AATATCCTGG CCGACCAAAC CCTGGTCAAC
GACTTCAAAC CCTTCGGCTC GGTCATGACC AAGGTGAAAA ACCGAAAAAT TGATCCAGCC
TATGAAATTC ATCTGGCCCT TTACCAAGCC ATCACCGGCC CGGACGAAGC AGACAAAATT
TTTAAAAGCG TTACCCCTGA CTTTTTCGAC ATGGTCGTTA TTGATGAATG CCATCGTGGC
AGCGCCGCTG AAGACTCCGA CTGGCATGAA ATCCTCAATT ACTTTTCCGG CGCCATCCAA
CTGGGCCTTA CCGCCACCCC CCGGGAAACA AAATATGTCT CCAATATCTC CTATTTCGGT
GAGCCGATTT ACACCTACAG CTTAAAGCAG GGCATTCAGG ACGGCTTTTT AGCGCCCTAT
AAAGTGGTGC GCATCGACAT AGACAAAGAC ATCCGTGGCT GGACGCCGCC GCCCGGTATG
GTAGACGATC TCGGCCAGGC AATTGAACAT CGCACTTACA ACCAGAAGGA CATGGATCGC
ATTCTCGTGC TCAACCAGCG CACCAAGCTG GTTGCCAAGC GCGTTATGCA GCTTCTTCGC
GCCACCGATC CGTTTTCCAA AACCATCATC TTTTGTGAAG ACATCGACCA TGCCGAGCGC
ATGCGCAAGG CCATTGTTAA TGCCGCCGGC CAACTGGCCA TCGACAACGC CAAATATGTT
ATGCGGATTA CCGGCGACAG CCCCGAAGGT AAGGCCGAGC TGGACAACTT TATCGACCCG
GAAAACCCTT TCCCCGTTAT TGCCACCACC TCGGACCTTC TCACTACCGG GGTAGACGCC
AAGACCTGCA AACTCATCGT ATTAGACAAA ACCATCCACT CCATGACCAC TTTTAAGCAG
ATTATCGGCC GGGGCACACG CATCGATGAA GACAACAATA AATGGTTTTT CACCATCATG
GATTTTAAAA AAGCCACCAA ACTGTTTGCC GATCCGGAAT TCGACGGTGA GCCGGTGGTG
ATTTACGAGC CCGACGATGA CGACCCCCCG GTCCCGCCTG ATCCAAAACC GGATGATGAC
GATGACGGTA CCATCGAAGA CCCCGGGCCG GGCATCCAAA AGTTTGTGGT AAACGGCGTA
CCGGTCAGCA TCATTGCCGA GCGCGTCGAA TACTACGGCC CGGACGGCGA CCTCATCACA
GAGTCCTACC GCGACTTTAC CCGCAAGCAG ATCCACCAGG AATTTACTTC GTTAGATGAA
TTTCTTGCGC GCTGGAACGC GGCTGAAAAA AAGCAGGCCA TTATCGATCT GCTGGAAGAA
CACGGCATTA TTCTGGAAAA CCTGGCCGAA GAAGTGGGTA AGGATTTCGG CGACTTTGAC
CTTATCTGCC ATATCGCCTT TGACCAACCG CCGCTTACCC GTAAAGAACG GGCCAACAAT
GTAAAAAAGC GAAACTACTT CACCAAATAT GGCGAGCAGG TCCGGGCCGT GCTGGCCGCC
CTGTTGGACA AATACGCCGA CGAAGGCATC CGCACGCTTG AGAACGCCAA GGTGCTTAAA
ATGAAGCCCT TCAGCGACAT GGGCACCCCC ATGGAGATCA TCAACACTGT TTTTGGCGGC
AAAGCCAATT ATGACAATGC CATTGCCGAA CTGGAAAAAG AACTGTTTAT TAATACGGAG
CAGCGAGCAT GA
 
Protein sequence
MDKSEKKKLS ETDICDLFIT PAIKNAGWDP IRQIRREVTL SPGPVVVRGN MSSRNKKKKK 
FADYVLAWEP GVPVAVVEAK ANDHTVSQGL QQALGYAEIL QVPSAFSSNG DAFASHNKVP
ATGEEIETQL PLDQFPPPPT LWQRYKTYRN IEDASDDLLL QPYHFDATEK EPRYYQVEAI
NRVIEAVAKG NRRMLLVMAT GTGKTYTTFQ IIWRLWQARA VKRVLFLVDR NILADQTLVN
DFKPFGSVMT KVKNRKIDPA YEIHLALYQA ITGPDEADKI FKSVTPDFFD MVVIDECHRG
SAAEDSDWHE ILNYFSGAIQ LGLTATPRET KYVSNISYFG EPIYTYSLKQ GIQDGFLAPY
KVVRIDIDKD IRGWTPPPGM VDDLGQAIEH RTYNQKDMDR ILVLNQRTKL VAKRVMQLLR
ATDPFSKTII FCEDIDHAER MRKAIVNAAG QLAIDNAKYV MRITGDSPEG KAELDNFIDP
ENPFPVIATT SDLLTTGVDA KTCKLIVLDK TIHSMTTFKQ IIGRGTRIDE DNNKWFFTIM
DFKKATKLFA DPEFDGEPVV IYEPDDDDPP VPPDPKPDDD DDGTIEDPGP GIQKFVVNGV
PVSIIAERVE YYGPDGDLIT ESYRDFTRKQ IHQEFTSLDE FLARWNAAEK KQAIIDLLEE
HGIILENLAE EVGKDFGDFD LICHIAFDQP PLTRKERANN VKKRNYFTKY GEQVRAVLAA
LLDKYADEGI RTLENAKVLK MKPFSDMGTP MEIINTVFGG KANYDNAIAE LEKELFINTE
QRA