Gene Dole_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1643 
Symbol 
ID5694480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1951350 
End bp1952498 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content61% 
IMG OID641264238 
Productsaccharopine dehydrogenase 
Protein accessionYP_001529524 
Protein GI158521654 
COG category[S] Function unknown 
COG ID[COG3268] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATC TGGGAGGCAT CATGGACAAT CAGAAAATCG TACTTTTTGG CGCTACCGGC 
TATACCGGAA AGCAGGTGGC CCAGGAGCTG GTCAGGCGGG GCCTTTTGCC GATTCTGTGC
GGCCGCAGCC GTGAAAAGCT GGAGTCTGTG GCCGCGGAAC TGGGCGGCCT GAAAACCGCG
GTTGTCGATG TTGCCGACCC GGCCGGTCTG GCGGCCCTGG TGGGGAAAGG GGATATTCTG
GTCTCCACGG TGGGGCCGTT TGCAAAGTAC GGCACAACCG CGGTTTCCGT TGCCGCTGAA
AAAGGCGCTG TTTATATCGA CTCCACCGGT GAACCCTCCT TTATCGCCCG GGTGTTTGAA
ACTTACGGAC CGCAGGCCCG TTCCACCGGC GCCACTCTGC TGACCGCCTG CGGGTACGAC
TATATTCCGG GCAACTGCGC GGCCGGCATT GCCTTGAGCG CATCGGGCAA GAAGGCGGTG
CGGGTGGACG TGGGCTATTA TTCAAAAAAG AAGGGCAGGG TCCAGCCCCT TGACATGAGC
CAGGGGACCG CCTCTTCCCT TCGTCTGGCC ATGATCGATC CCGTCAAGGT ATGGCAGTCG
GGGAAGCTGG TGGAGCAGAC CGGCGGTATC CGCACGCGCA CTTTTGATCT GGACGGCCAA
CCCCATCCCG GTCTGACCGT GTCGTGTACC GAACATTTCT CCCTGCCGCG GGTTTTTCCC
GAGCTGCGGG AGATCAATAC CTACCTGGGA TGGTTTGCCG GCAAAACTTA TATCATGCAG
AAGGCGGCCC TGTTCCAGTC GGTTGCCGGA AAAATCCCAG GATACCGTTC ACTGGCAAGG
GCCGCGCTGT CCATGCTGCC GGAGAGCACA GGCAAAGGAC CGTCACCGGA AATACTGCAA
CAGCACCAGA CCCACGTGGT GGCTGAAACC TTTGACGAAA AAGGCCGCCT GCTGGCCCGG
GCCGATCTTG TGGGTGTTGA CGGTTACTCG TTTACGGCGA AAATGATGGC CTGGGCCGCT
CACCGGGCCG CCGTCCAGGG GTTTCGGGCC ACCGGCGCTG TCGGCCCCAT CGAAGCCTTT
GACCTGGACG GCCTGATCGA GGGGTGTGAG GCATGCGGAT TGACGCCGTC GGTGCATCTG
GGAAAATAA
 
Protein sequence
MKDLGGIMDN QKIVLFGATG YTGKQVAQEL VRRGLLPILC GRSREKLESV AAELGGLKTA 
VVDVADPAGL AALVGKGDIL VSTVGPFAKY GTTAVSVAAE KGAVYIDSTG EPSFIARVFE
TYGPQARSTG ATLLTACGYD YIPGNCAAGI ALSASGKKAV RVDVGYYSKK KGRVQPLDMS
QGTASSLRLA MIDPVKVWQS GKLVEQTGGI RTRTFDLDGQ PHPGLTVSCT EHFSLPRVFP
ELREINTYLG WFAGKTYIMQ KAALFQSVAG KIPGYRSLAR AALSMLPEST GKGPSPEILQ
QHQTHVVAET FDEKGRLLAR ADLVGVDGYS FTAKMMAWAA HRAAVQGFRA TGAVGPIEAF
DLDGLIEGCE ACGLTPSVHL GK