Gene Daro_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3387 
Symbol 
ID3567117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3640204 
End bp3641505 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content61% 
IMG OID637681859 
Producthistidinol dehydrogenase 
Protein accessionYP_286586 
Protein GI71908999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCA TTAAACGTCT TGCGACGGTC GACGCCGATT TCAAGGCGCA AATGGATGCG 
CTGCTCGCTT TCGAGGCAGC GCAGGATGAA GGCATCGAAC GCACCGTCAT CGGCATTTTG
GCCGATGTGA AGGCGCGCGG TGATGCCGCA GTAGTCGAAT ACAGCAACAA GTTCGACCGT
CTGACTGCCA GTAGCATGGC CGACCTCGAG TTGTCGAAGG CCGAAATGCA GAAGGCACTC
GACGGCTTGC CCGCTGATCA GCGTCAGGCG CTGGAGGCGG CCGCTCACCG CGTTCGCGTT
TATCACGAAA AGCAGCGGAT GGAAGGCTGG TCCTATACTG AAGCCGACGG CACCATGCTC
GGTCAGATGA TCACTCCGCT CGACCGCGTC GGCCTCTATG TGCCGGGCGG CAAGGCGGCT
TACCCTTCTT CCGTGCTGAT GAATGCGATT CCGGCCAAGG TGGCAGGCGT CAAGGAACTG
ATCATGGTTG TCCCGACGCC GGGTGGTGAG CACAACCAGT TGGTGCTGGC GGCTGCTTGT
CTGGCCGGCG TCGACCGTGT TTTCACCATC GGTGGGGCGC AGGCCGTTGG CGCGCTGGCC
TACGGCACCG AGGCTGTGCC GCAGGTCGAC AAGATTGTTG GTCCTGGCAA TGCGTATGTG
GCCTGTGCCA AGCGTCGGGT GTTTGGTATC GTCGGCATCG ATATGATTGC CGGTCCGTCG
GAGATTCTGG TTGTGGCTGA TGGCAGTAGC GATCCTGACT GGGTGGCGAT GGACCTCTTC
TCGCAGGCCG AGCATGATGA ACTGGCGCAA TCGATCCTGA TCTGCACTGA TGCCGCCTAT
ATCGACCGCG TGCAGGCCAG CATTGAAAAA CTGCTGCCGA CCATGCCGCG TCGCGAAGTG
ATCGAAACCT CGCTGACCAA CCGCGGGGCG CTGATCCTCG TGCGTGATCT CGAAGAAGCC
TGCGCCATTG CCAACCGCGT GGCACCGGAA CACCTCGAGC TGTCGCTGGC CGATCCAGAT
CCCTGGGTTG CCAAAATTCA CCACGCCGGT GCCATCTTCA TCGGTCACTA CACCTCCGAG
TCGCTTGGCG ACTACTGTGC CGGCCCGAAC CACGTACTCC CGACGTCCGG CAGTGCGCGC
TTCTCGTCTC CGCTGGGTGT CTATGACTTC CAGAAGCGAA CCAGTCTGAT CAAGGTGTCC
AAGGCTGGTG CGCAGACCTT GGGCAAGATC GCCTCGACGC TGGCCCATGG CGAAGGACTG
CCGGCGCACG CCAAGTCGGC AGAGTTCCGG CTCGAAAATT GA
 
Protein sequence
MVAIKRLATV DADFKAQMDA LLAFEAAQDE GIERTVIGIL ADVKARGDAA VVEYSNKFDR 
LTASSMADLE LSKAEMQKAL DGLPADQRQA LEAAAHRVRV YHEKQRMEGW SYTEADGTML
GQMITPLDRV GLYVPGGKAA YPSSVLMNAI PAKVAGVKEL IMVVPTPGGE HNQLVLAAAC
LAGVDRVFTI GGAQAVGALA YGTEAVPQVD KIVGPGNAYV ACAKRRVFGI VGIDMIAGPS
EILVVADGSS DPDWVAMDLF SQAEHDELAQ SILICTDAAY IDRVQASIEK LLPTMPRREV
IETSLTNRGA LILVRDLEEA CAIANRVAPE HLELSLADPD PWVAKIHHAG AIFIGHYTSE
SLGDYCAGPN HVLPTSGSAR FSSPLGVYDF QKRTSLIKVS KAGAQTLGKI ASTLAHGEGL
PAHAKSAEFR LEN