Gene Dole_2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2404 
Symbol 
ID5695252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2894699 
End bp2896000 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content59% 
IMG OID641265010 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001530285 
Protein GI158522415 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA CCCAGCATTA CGACGCCATT ATCGTCGGGT CAGGCCCCGG CGGGGCAACC 
GTGGCCAGGG AACTGACAAA GCAGGGCAAA AAGGTCCTGA TTCTGGAATG GGGCAGCAAC
GCGCCGATAA AGGGGTCCAT GTTCCAGATG GCCCTGAATG CCGGCATGCC CGGCAAAAGC
GTGCTGTTTA CCAACAAAAA GATGCTGGCC ATGGTGAGAG GGATCTGCAC CGGCGGCAGT
TCTGTTTTTT ACTGCGGTAC CGCCTTTGAT CCGCCGTATG AAATGATGCG GTCCCACGGC
ATTGAACTTG AAGAGGAAAC GGCGGCGTTA AAAAAGGAGC TGCCTATTGC CCCGGCCGGT
GACGCCATTT TCGGCCCGGG CGCCCGCCGC ATGATGGAAA GCGCCCAGGA AATGGGCTAT
GACTGGAAGC CACTCAACAA GTTCATCTAC CAGGACAAGT GCAAGCCCGA CTGCTGGAAG
TGCAGTTACG GATGCCCGGA AGGTGCCAAG TGGAGCGCCC GCATGTTCGT GGAAGAGGCC
GTCACCGATG GCGCTGAACT GATCAACGGC GCAAAGGTGA CCCGGGTGCT GTTTGACGGC
AACACCGCCA CCGGCGTGGA ATACAAAAAG AACCTGGGCA CCCACAAGGT CACCGCCGAC
CGGGTCGTCA TCTCCGCCGG CGGGGTGGGG TCTCCCACCA TTCTCCGGGC CAGCGGCATT
TCCCGGGCCG GCTACGACTT TTTCTTTGAC CCCCTGATCA TGGTATTCGG CACGGTAAAA
AACCTCAAGG GCAAAGGCGA AATCCAGATG GCGGCCGGTG CCCACATGGC CGACGAGGGG
TACCTGATGG TGGACCTGGA TTTTCCCTGG CCCATGTACA TGGTGCAGAG CGCGCCCAAG
CTGCGGCTGC ACAAACTTCT CTCCAGGCGC GATACCCTGA TGCTGATGAT CAAGATCAAG
GATGACCTGG GGGGCCGCAT CACCGACGGC GGTGGGGTCC GCAAGGACAT CACGAAAAAC
GACAAGGCCA AACTGCAAAA AGGATATGAA CGGGCAAAAG GCATTCTGCA GAACGCCGGA
GCTAAAGGGG TGTTTTCCGG CTGGACCGTG GCGGCCCACC CCGGCGGCAC GGTCAAGATC
GGTGACGTGG TGGATTCGAA CCTGAAAACC GAAAAGGAGA ATCTCTACGT GTGCGACTGT
TCGGTGATGC CGGATGCCTG GGGCATTCCC CCCACCCTCA CCCTGCTGGC CCTGGGTAAG
CGGCTGGCAA AGCATTTGGG AGAGGAAATG GACGCAAAAT AA
 
Protein sequence
MNTTQHYDAI IVGSGPGGAT VARELTKQGK KVLILEWGSN APIKGSMFQM ALNAGMPGKS 
VLFTNKKMLA MVRGICTGGS SVFYCGTAFD PPYEMMRSHG IELEEETAAL KKELPIAPAG
DAIFGPGARR MMESAQEMGY DWKPLNKFIY QDKCKPDCWK CSYGCPEGAK WSARMFVEEA
VTDGAELING AKVTRVLFDG NTATGVEYKK NLGTHKVTAD RVVISAGGVG SPTILRASGI
SRAGYDFFFD PLIMVFGTVK NLKGKGEIQM AAGAHMADEG YLMVDLDFPW PMYMVQSAPK
LRLHKLLSRR DTLMLMIKIK DDLGGRITDG GGVRKDITKN DKAKLQKGYE RAKGILQNAG
AKGVFSGWTV AAHPGGTVKI GDVVDSNLKT EKENLYVCDC SVMPDAWGIP PTLTLLALGK
RLAKHLGEEM DAK