Gene Daro_2386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2386 
Symbol 
ID3568600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2571595 
End bp2572905 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content58% 
IMG OID637680853 
Producthomoserine dehydrogenase 
Protein accessionYP_285592 
Protein GI71908005 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCA TCAATGTTGG CCTTATCGGC ATCGGCACCG TCGGCGGTGG CACCTGGACC 
GTTCTTAAGC GCAACGAGGA AGAGATTACC CGTCGTGCCG GTCGGCCGAT TCGCATCACC
GCTGTGGCCG ACAAGAACGT TGAACTCGCC AAGCAGATTA CTGGTGGTGC AGCCCGTGTT
ACCGACGATG CCTTTTCGTT GGTCAATGAC CCGGAAATCG ATATCATCGT CGAACTGATC
GGCGGTTATG GTGTTGCCAA GGAAGTGGTC ATGCAGGCCA TCGCCAACGG CAAACATGTC
GTGACGGCCA ACAAGGCACT GCTTGCCGTG CATGGCACTG AAATCTTTAC GGCCGCTCAA
CAAAAGGGCG TGATGGTCGC TTTCGAAGCG GCAGTAGCCG GCGGCATTCC CATCATCAAG
GCATTGCGCG AGGGCCTGAC GGCCAATCGC ATCGAGTGGG CGGCCGGCAT TATCAATGGT
ACGACCAACT TCATCCTGTC TGAAATGCGC GACAAGGGCC TGTCCTTTGG CGATGTGCTG
AAAGAAGCTC AGCGTCTTGG TTATGCCGAA GCAGATCCGA CTTTCGACAT CGAAGGTGTC
GATGCGGCCC ACAAGGCGAC ATTGATTGCT TCGATTGCCT ATGGCATTCC GGTCCAGTTC
GACAAGGCTT ACATCGAAGG CATCACCAAG CTGGAAGCCT CCGATATCAA GTATGCCGAA
CAGCTCGGTT ACCGCATCAA GCTGCTTGGC ATCGCCAAGC GTCGCGAGAA GGGTATCGAG
TTGCGCGTCC ATCCGACCCT GATCCCGGCA AAGCGCCTGC TGGCCAATGT CGAAGGTGCG
ATGAATGCCG TCATGGTCAA GGGCGACGCG GTTGGCATCA CCCTGTATTA CGGCAAGGGG
GCTGGTGCTG AGCCGACCGC TTCCGCCGTG GTGGCTGATC TGGTCGATGT TGCTCGTCTG
GCCACCGCTG ATGCCGCGCA CCGCGTGCCG CACCTGGCCT TCCAGCCGGA CGCCATGTCC
AACCTGCCGA TCCTGCCAAT GAGCGAAGTC GAGACTGGTT ATTACCTGCG TTTGCGCGTT
GAAGACAAGC CTGGGGTACT GGCAGATATC ACTCGCATCC TGGCCGATCA GGGTATTTCT
ATCGATGCCA TGCTGCAGCG CGAGCCGGAA GAGGGTGAGG GCGAGACCGA CATTATCATC
CTGACCCACA TCTGCAAGGA AAGTGCGGCC GATGCGGCGA TTGCCAAGAT CGAAGGCTTG
TCTGCGCAAA AGGGCAAGGT CAAGCGTATT CGCCTGGAAG AGCTGCAATA A
 
Protein sequence
MKPINVGLIG IGTVGGGTWT VLKRNEEEIT RRAGRPIRIT AVADKNVELA KQITGGAARV 
TDDAFSLVND PEIDIIVELI GGYGVAKEVV MQAIANGKHV VTANKALLAV HGTEIFTAAQ
QKGVMVAFEA AVAGGIPIIK ALREGLTANR IEWAAGIING TTNFILSEMR DKGLSFGDVL
KEAQRLGYAE ADPTFDIEGV DAAHKATLIA SIAYGIPVQF DKAYIEGITK LEASDIKYAE
QLGYRIKLLG IAKRREKGIE LRVHPTLIPA KRLLANVEGA MNAVMVKGDA VGITLYYGKG
AGAEPTASAV VADLVDVARL ATADAAHRVP HLAFQPDAMS NLPILPMSEV ETGYYLRLRV
EDKPGVLADI TRILADQGIS IDAMLQREPE EGEGETDIII LTHICKESAA DAAIAKIEGL
SAQKGKVKRI RLEELQ