Gene Dole_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0103 
Symbol 
ID5692918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp115053 
End bp116078 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID641262680 
Productsingle-stranded nucleic acid binding R3H domain-containing protein 
Protein accessionYP_001527990 
Protein GI158520120 
COG category[R] General function prediction only 
COG ID[COG1847] Predicted RNA-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA CAAAAGAGTT TGAAGGCAAA AATATTGACG CGGCCCTGGA AAAGGCCAGC 
AGCGCCCTTA ACATGACCAA AGACCAGTTG CGCTACGAGG TGGTCTGTAC CGGGTCCAGC
GGCATTTTCG GGCTGGTGGG GGTTAAAAAC GCCCGCATTC GAATTCTCAA TTCGAAAAAA
GGGTCGGGGG GCGCCGGAGC AGGGGGCAGG CGGGACGTGC TGGACGAGGA CCGGCAGGAG
ATTCTCTCCA TGCTGGACGA GGCTTTTGCC GAACCGGCCC CTGAACCCGA AAGCCGGCCC
AGGCCCGAGG CTAAAGCAGC GCCCAGGGGA GAGCCCAGGG CTGAGTCGAA AAAAGCGCCC
AGGGCCAAAC CCAAATCCCG GCCCCGGACG GAAAAAGGGG CTCCTGCCCC GGCTGAACGG
TCAAGGGCGG CCGGCCGGCC GCCGGCAAAC GGGGCCAAGC CCCCTGCACC GGAGGAGCGC
CCAGAGACAC CGCCGGCCGC TTCAGAGGAA CTGCCGGAAT CGCCGCCGGC ACCGCCTGTT
GAGGTGAAAG AGGCGGACGT GGTTCTGGCC CGGGACGTTT TGAGTAAAAT ACTGGACAAC
ATCACCGACG AAGCCACGGT GAAAGTGGCG TTCGAGCCGG GCCGGGTCGG TTTTTCCATG
GAGGGCGGCA ACTCATCGGT GCTCATCGGC AAGCGCGGCA AAACCCTGAA GGCCATGCAG
CATATTGTTG AAAAGGTGGT TAAAAAATCG ATGGGTGAGG CGGTGGAGGT GCAGGTGGAC
GTGGAAGGGT ATCTGGAAAA ACGGGCATCC TCCCTGACAA CCCTTGCCTC CCGTCTGGCT
GAAAAGGCCC GGCAGACCGG CAAGCCCACC ACCATCAGCC GAATGGACGC CTACGAGCGG
AAGATCATTC ATGACGCCCT GCGGACCGAC AGAAGCGTGA AAACCCGCAG CGTGGGAAAC
GGGGACATTC GAAACGTGGT GATCCATCCC GGACGGCGGA CCAGCCGTAA AAAAACGGCG
CCATAA
 
Protein sequence
MTDTKEFEGK NIDAALEKAS SALNMTKDQL RYEVVCTGSS GIFGLVGVKN ARIRILNSKK 
GSGGAGAGGR RDVLDEDRQE ILSMLDEAFA EPAPEPESRP RPEAKAAPRG EPRAESKKAP
RAKPKSRPRT EKGAPAPAER SRAAGRPPAN GAKPPAPEER PETPPAASEE LPESPPAPPV
EVKEADVVLA RDVLSKILDN ITDEATVKVA FEPGRVGFSM EGGNSSVLIG KRGKTLKAMQ
HIVEKVVKKS MGEAVEVQVD VEGYLEKRAS SLTTLASRLA EKARQTGKPT TISRMDAYER
KIIHDALRTD RSVKTRSVGN GDIRNVVIHP GRRTSRKKTA P