Gene Dole_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1208 
Symbol 
ID5694042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1441112 
End bp1442386 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content61% 
IMG OID641263801 
Producthomoserine dehydrogenase 
Protein accessionYP_001529091 
Protein GI158521221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4091] Predicted homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGC AAAACGGCTC AGACCCCGCA AAAATCAGGA TCGGCATTAT CGGCATCGGC 
TCCATGGGCA AGGGGCTGGT ATACCAGGCC CACATCACCC CGGGGGTGCG GTGCGTGGCC
GTGTGCGATA CCGATGTCAA ACGGTGCACG GCCGTGCTCA CATGGCTGCA TATACCCCAT
TCGATTGCCA CCAGCCGGGC CGTCATGGAG GATGTGACCC GCCGGGGAGA AGTGGCGGTT
TGCGAAGATG GTTTATGGGT TGCCGAATGC GCGGACGTCG ACGTTGTCAT TGAGGCGTCC
AGCGCCATTC TTCCAGCGGC AGAATTCGCC CTGGCCACCC TGAACAGCGG CAAGCACCTG
GTTCTGATGA ACTCTGAGAT CGATCTGTTG TTCGGCCCGC TGCTGGCGGA CATCGCCCGT
AAAAACGGCG TGGTCTGCAC CAGCTGCGAC GGCGACCAGT ACGGCGTGCT CAAGCACCAG
ATCGACGACC TGGCGTTATG GGGGCTGGAC CTGGTCATGG CCGGCAACAT CAAGGGATTC
CTGGACCGCT CGGCCAACCC CACCTCCATC GTTCCCGAAG CGGACATCCG CAACCTTGAC
TACCGCATGT GCACCTCCTA CACGGACGGC ACAAAACTCA ATATCGAGAT GGCCATCATT
GCCAACGCCT GCGGCCTGAT CACCACAACG CCGGGCATGC ACGGGCCCCG GGCCGCCCAT
GTCCAGGACG TGTTCAATTG CTTTGATTTT GACGCCCTGT GGAAGGACCG CCGCCCCTTT
GTGGATTACA TCCTGGGGGC CGAGCCCGGC GGCGGGGTGT TTGTGATCGG CCATTGCGAC
AATCCCTATC AGCGGGAGAT GCTGGCCTAC TACAAGATGG GGCCCGGCCC GTTCTACCTG
TTTTACCGGC CCTACCACCT GTGCCATATC GAGGCCATGG GAACCGTCCT TCAGGCAGCA
CGGCGGCAAA CGCCCTTCCT TGTTCCGGAT TACGGGTTCC AGACCCAGGT GTATGCCTAT
GCCAAACGCG ACCTGAAAGC CGGTGAAGTG CTGGACGGCA TCGGTGGCTA CTGCTGCTAC
GGCCTGATTG AAAATTTTAA GGAAAACCAC GCCTCACCCG GCCTGCCCAT CGGCCTGGCC
GATAACGTGG CCCTGAGACG CGATGTGCCG GAACAGGGGC GAATTTCCCT GGATGACGTG
AGTTACGATC CCGCACGCCT GGATTTTGCG CTTTTTGACC GGGCCTTCGG GCTTCCTGCC
AATGCGGCGG TATGA
 
Protein sequence
MQKQNGSDPA KIRIGIIGIG SMGKGLVYQA HITPGVRCVA VCDTDVKRCT AVLTWLHIPH 
SIATSRAVME DVTRRGEVAV CEDGLWVAEC ADVDVVIEAS SAILPAAEFA LATLNSGKHL
VLMNSEIDLL FGPLLADIAR KNGVVCTSCD GDQYGVLKHQ IDDLALWGLD LVMAGNIKGF
LDRSANPTSI VPEADIRNLD YRMCTSYTDG TKLNIEMAII ANACGLITTT PGMHGPRAAH
VQDVFNCFDF DALWKDRRPF VDYILGAEPG GGVFVIGHCD NPYQREMLAY YKMGPGPFYL
FYRPYHLCHI EAMGTVLQAA RRQTPFLVPD YGFQTQVYAY AKRDLKAGEV LDGIGGYCCY
GLIENFKENH ASPGLPIGLA DNVALRRDVP EQGRISLDDV SYDPARLDFA LFDRAFGLPA
NAAV