Gene Dtox_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2114 
Symbol 
ID8429096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2289352 
End bp2290659 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content43% 
IMG OID645034435 
Producthomoserine dehydrogenase 
Protein accessionYP_003191566 
Protein GI258515344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0831513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA GTCTAATCAA AGTTGCTTTG CTGGGTGCAG GTACAATTGG TGGTGGAGTA 
TATAAGCTGC TTAATGCCAA CAGTAAGATT ATTGAACAAA GAATTGGCTC GCAAATCGTA
GTGTCAAAGG TTCTGGAACG TAACGCCAGG CGTTGTCAGG AACTTGGTAT GGCCAACAAT
TTGATTGCCA ATTCTATTGA TGAGATTATT GATGACCCGG AAATAGATAT TGTTATAGAG
TTAATGGGGG GCATAGATCC GGCCAGGGAA TTTGTGATCA AAGCTTTAGA AAAAAAGAAG
AGTGTGGTAA CTGCCAATAA AGACATGGTG GCACTGCACG GTAAGGAATT GTTTGCTGCC
GCTATTAAAA ACAAGGTGGA TTTACTGTTT GAAGCCAGTG TGGGCGGTGG TATACCTATC
ATTCGTCCTT TAAAACAATG TTTGGCTGCC AATCATATAC AGGAAATAAT GGGTATCATC
AACGGCACCA CTAACTATAT GTTAGAGAAA ATGAGTCAGG AAGGTCTGGA TTTTGATCTT
GTTTTAAAAG AAGCCCAATC CAAAGGCTAT GCGGAAGCCA ATCCCAGTTC AGATGTAGAG
GGCTTTGACG CAGCCAGAAA AATTGCTATT TTAGCTTCTA TTGCTTTCAA TACCAGAGTT
ACTGTGAACG ATGTTTATGT AGAAGGAATT TCTCGCATTA CTGCAGAAGA TATTGCTTAT
GCCCGTGAAT TGCATTATGT CGTCAAGCTT CTGGGTATTG CCAAGGAAGC GCCGGAAGGT
ATTGAGGTAA GAGTACATCC GGTTTTTATT CCCGAGCAGC ATCCTCTGGC ATCTGTGGGA
GATGTTTTTA ATGCTATATT TGTAAAAGGC GATGCGGTTG GTGAAACCAT GTTCTATGGC
CGTGGTGCAG GAGAAATGCC TACCGCCAGC GCAGTTGCAG CCGATGTCAT GGATGCTGCG
CGCAACCTGA ATAATAATGT ACGGGGTCTT GTAGGTTGTA CCTGTTTTGA GGATAAACCC
ATTAAGCCAA TCGGGCTTAC CATCAGCAAA TATTATATAC GTATGCAGGT TGCTGACCGA
CCGGGTGTTT TGGCTTCCAT TGCTTACGAG TTTGGCCGCT ATAATGTCAG CCTGGCTTCT
GTACTGCAAA AGAATACCTT GGGAGATTCG GCCGAACTGG TGCTGGTAAC TCACCGGGTC
AAGGAGCAAA ACCTAAGAGA TTCACTGGAT AAAATAAAGC TTCTGGACGC AATCGTTCAT
GAGGTTTCCA ATGTAATCAG GGTTGAAGGT GAAGATGTAA AAGCTTAG
 
Protein sequence
MQKSLIKVAL LGAGTIGGGV YKLLNANSKI IEQRIGSQIV VSKVLERNAR RCQELGMANN 
LIANSIDEII DDPEIDIVIE LMGGIDPARE FVIKALEKKK SVVTANKDMV ALHGKELFAA
AIKNKVDLLF EASVGGGIPI IRPLKQCLAA NHIQEIMGII NGTTNYMLEK MSQEGLDFDL
VLKEAQSKGY AEANPSSDVE GFDAARKIAI LASIAFNTRV TVNDVYVEGI SRITAEDIAY
ARELHYVVKL LGIAKEAPEG IEVRVHPVFI PEQHPLASVG DVFNAIFVKG DAVGETMFYG
RGAGEMPTAS AVAADVMDAA RNLNNNVRGL VGCTCFEDKP IKPIGLTISK YYIRMQVADR
PGVLASIAYE FGRYNVSLAS VLQKNTLGDS AELVLVTHRV KEQNLRDSLD KIKLLDAIVH
EVSNVIRVEG EDVKA