Gene Daro_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0302 
Symbol 
ID3569752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp344356 
End bp346212 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content62% 
IMG OID637678740 
Productdihydroxy-acid dehydratase 
Protein accessionYP_283531 
Protein GI71905944 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000000297284 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.110934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAAT ACCGCTCCCA CACCTCGACC CACGGCCGCA ACATGGCTGG TGCCCGCTCC 
TTGTGGCGCG CCACCGGCAT GAAGGATGGT GACTTCGGCA AGCCGATCAT CGCCGTGGTC
AATAGCTTCA CCCAGTTCGT GCCGGGCCAC GTCCATCTGA AGGACATGGG GCAGCTGGTG
GCGCGCGAAA TCGAAGCCGC TGGCGGCGTC GCCAAGGAAT TCAACACCAT CGCCATCGAC
GACGGCATCG CCATGGGCCA CAGCGGTATG CTGTATTCGC TGCCCAGCCG CGACCTGATC
GCCGACTCGG TCGAATACAT GGTCAACGCC CACTGTGCCG ACGCCATGGT CTGCATCTCC
AACTGCGACA AGATCACCCC GGGCATGCTG ATGGCCGCCA TGCGCCTCAA CATCCCGGTC
ATCTTCGTGT CCGGTGGCCC GATGGAAGCC GGCAAGGTCA AGATTCAGGG GCAAGTTATC
CATCTCGACC TCATTGATGC CATGGTCAAG GCTGCAGATC ACTCGGTTTC CCAAGCTGAA
CTGGATGATG TCGAGCGTTC GGCCTGCCCG ACCTGTGGCT CCTGCTCCGG CATGTTTACC
GCCAATTCGA TGAACTGCCT GACCGAAGCC TTTGGCCTCA GCCTGCCCGG CAACGGTACC
GTCGTCGCCA CGCACGCCGA CCGCAAGCAA CTCTTCCTGC GTGCCGGTCG CCAGATCGTC
GAGCTGTGCA AGCGCTACTA CGAGCAGGAT GATGCCTCGG TCCTGCCGCG TGCGATCGCC
ACCAAGGCCG CCTTCGAAAA TGCCATGACC CTCGATGTCG CCATGGGTGG CTCGACCAAC
ACCGTGCTGC ATATCCTGGC TACGGCGCAG GAGGCCGGTG TTGACTTCAC GATGGCCGAC
ATCGACCGCA TTTCGCGCTC GGTGCCGTGC CTGTGCAAGG TCGCGCCGAT GACTGACAAG
TATCACATTG AAGACGTCCA TCGCGCTGGT GGCATCATGG GCATCCTCGG CGAACTCGAC
CGGGCTGGTC TGATCAATCG TGACGTCCCG AACGTCTATG CAAAAAACCT CGGTGAAGCA
ATCGACCGTT GGGATGTGGT GCGCCAGCAC GACGTCAAGG TGCACGAATT CTTCAAGGCA
GCGCCCGGTG GCGTGCCAAC GCAAGTCGCT TTCTCGCAGG ATCGCCGCTT CAACGAACTG
GATATTGATC GTACCCACGG CTGTATCCGC AACAAGGCCA ACGCCTATTC GCAGGAAGGT
GGTCTGGCTG TGCTCTACGG CAACATCGCG CTGGATGGTT GTATCGTCAA GACAGCCGGT
GTCGACGAGT CGATCTGGAA ATTCACCGGC AAGGCCCGCG TTTTTGAAAG CCAGGATGCT
GCGGTCGAAG CCATCCTGGG CGAGAAAATC GTCGCCGGCG ACGTTGTCGT TATCCGCTAC
GAAGGCCCGA AGGGCGGCCC CGGCATGCAG GAAATGCTCT ACCCGACGTC TTATCTGAAA
TCGATGGGTC TCGGCAAGGA ATGTGCGCTG CTCACCGATG GTCGTTTCTC CGGGGGCACT
TCCGGTCTCT CCATTGGTCA TGCCTCGCCC GAGGCGGCCG ATGGCGGCGC CATCGGTCTG
GTGGAGGAGG GCGATACCAT CGAAATCGAC ATCCCGAATC GTCGCATCCA TCTGGCGGTC
ACCGATGGCG AGCTGGCCCA GCGTCGCGCC GCGATGGAAG CCAAGGGCGA AGCCGCCTGG
CAGCCGGTTA GTCGCGAACG GGTCATTTCG CCTGCCTTGC AAGCCTACGC CCTGATGGCG
ACGTCGGCCG ACAAGGGTGC CGTGCGCGAC GTCAAACAGA TCCAGCGCCG CAAGTGA
 
Protein sequence
MPQYRSHTST HGRNMAGARS LWRATGMKDG DFGKPIIAVV NSFTQFVPGH VHLKDMGQLV 
AREIEAAGGV AKEFNTIAID DGIAMGHSGM LYSLPSRDLI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAAMRLNIPV IFVSGGPMEA GKVKIQGQVI HLDLIDAMVK AADHSVSQAE
LDDVERSACP TCGSCSGMFT ANSMNCLTEA FGLSLPGNGT VVATHADRKQ LFLRAGRQIV
ELCKRYYEQD DASVLPRAIA TKAAFENAMT LDVAMGGSTN TVLHILATAQ EAGVDFTMAD
IDRISRSVPC LCKVAPMTDK YHIEDVHRAG GIMGILGELD RAGLINRDVP NVYAKNLGEA
IDRWDVVRQH DVKVHEFFKA APGGVPTQVA FSQDRRFNEL DIDRTHGCIR NKANAYSQEG
GLAVLYGNIA LDGCIVKTAG VDESIWKFTG KARVFESQDA AVEAILGEKI VAGDVVVIRY
EGPKGGPGMQ EMLYPTSYLK SMGLGKECAL LTDGRFSGGT SGLSIGHASP EAADGGAIGL
VEEGDTIEID IPNRRIHLAV TDGELAQRRA AMEAKGEAAW QPVSRERVIS PALQAYALMA
TSADKGAVRD VKQIQRRK