Gene Dole_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2093 
Symbol 
ID5694936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2542111 
End bp2543544 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content44% 
IMG OID641264694 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001529974 
Protein GI158522104 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAGC TGCTGCCGGA GGGATGGGTT GCGGCCCCTC TTCAGAAAAT TTCTCAGATT 
GTATATGGCA AGGGCCTTCC AAAAAATAAG TTTAACAAAC AAGGTCTGTA TCCCGTATTT
GGCGCCAACT CAATAATTGG CTATTATGAT TCGTTTTTAT ATGAAGATCC CCAAGTTCTA
ATATCTTGCC GGGGGGCTAA TAGCGGAACT ATTAATATTT CTCCCCCGAA ATGTTTTGTC
ACTTCGAACT CATTGGTTGT CCAATTGCCC AACACTCTTC ATCAAAGTTT CAAATATTTG
TACTATGCAC TCGAATCAAG CGACAAAGAA AAAATTGTTA CCGGCACGGC ACAGCCGCAA
GTAACAATAG ATAATTTAAA AAGTTTTTGT GTTCCCCTCC CGCCCTTTAA CGAACAAAAG
CGCATTGTCG CCCGGCTGGA CCAAATCATT CCCCGCATTG ACAAATTAAA AACCCGGCTG
GACAAAATCC CCACCATCAT CAAACGCTTC CGTCAGTCTG TTTTAACCGC CGCCGTCACC
GGCCGCCTCA CCGAAAAATG GCGGGAAGAC CATCCGGATG TGGAGGGTGC GGAGGCTACT
GTTCAATCGA TATATTATAG ACGTCTTGAT GAAAGCCAAA CAAACCAACA AAAGAACAAA
ATTGAAAAGT TGTTTGCTGA AGTTGAGACC GAAGACAATG GGTTGCTCCC AGAAACATGG
AAGTATACTT TTCTGAATAA GATTTGCGAA TCATTTCAAT ATGGAACATC CAGCAAATCA
AGCAAAAAAG GAGACATTCC TGTCCTCAGG ATGGGCAATT TGCAAAATGG GGCAATCGAC
TGGAGCAATC TTGTTTATTC TTCCAATAAG AAAGAAATAG AAAAATATAA ATTAGAAAAA
AATACGGTTT TATTCAATCG TACCAATAGC CCTGAATTGG TCGGCAAAAC AGCAATTTAT
TTGGGAGAAC GAGCCGCTAT TTTTGCAGGT TATCTTATTA GAATCAATAA TATGGATATT
CTCGATTCCC ACTATCTCAA TTATTCTTTA AATACGGATT ATGCTAAAGC CTTTTGTAAT
AGAGAAAAAA CGGATGGTGT GAATCAGTCG AACATTAATG CACAAAAACT TGGCCGCTTT
GAGATCCCCT TCCCGCCCCT TGAAGAACAA AAAGAGATCG TCCGGCAAGT GGAGCGGTCG
TTTGCCCTGG CCGACAAGCT GGAGGCCCAT TATCAAAACG CCCGAGCCCG GGTGGATAAG
CTGGCCCGGT CGGTGCTGGC CAAGGCCTTT CGCGGTGAAC TGACGCCTCA GGACCCAAAC
GACGAGCCCG CCGAAAAGCT GCTGGAACGC ATTCTGGCGG AAAAAGAAAA AATGGCAGCA
GCCGTCAAAA AAACCCGGAA ACAAGCAAAG CGGAAAAGTC GCACGACAAC TTAA
 
Protein sequence
MKELLPEGWV AAPLQKISQI VYGKGLPKNK FNKQGLYPVF GANSIIGYYD SFLYEDPQVL 
ISCRGANSGT INISPPKCFV TSNSLVVQLP NTLHQSFKYL YYALESSDKE KIVTGTAQPQ
VTIDNLKSFC VPLPPFNEQK RIVARLDQII PRIDKLKTRL DKIPTIIKRF RQSVLTAAVT
GRLTEKWRED HPDVEGAEAT VQSIYYRRLD ESQTNQQKNK IEKLFAEVET EDNGLLPETW
KYTFLNKICE SFQYGTSSKS SKKGDIPVLR MGNLQNGAID WSNLVYSSNK KEIEKYKLEK
NTVLFNRTNS PELVGKTAIY LGERAAIFAG YLIRINNMDI LDSHYLNYSL NTDYAKAFCN
REKTDGVNQS NINAQKLGRF EIPFPPLEEQ KEIVRQVERS FALADKLEAH YQNARARVDK
LARSVLAKAF RGELTPQDPN DEPAEKLLER ILAEKEKMAA AVKKTRKQAK RKSRTTT