Gene Daro_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1232 
Symbol 
ID3569416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1338605 
End bp1339696 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content59% 
IMG OID637679699 
Productprephenate dehydratase 
Protein accessionYP_284458 
Protein GI71906871 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.964016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG ATCTGCAAAA GGCCTTGGCT GGCGTTCGTA CCGATATTGA TCGCATTGAC 
GGGGAACTGC TCAAGCTGCT GAACGAACGC GCTCGCTGTG CGCAGCGGGT TGGCGAGATC
AAGGCTGAAC ACGGCGAAGC GGGGCATATC TACCGTCCGG AACGTGAAGC TCAGGTTTTG
CGCCGCTTGC AGGAGGCCAA CCCGGGGCCG CTCCCTGGCG AGAACATCAC CTTCTTCTTC
CGTGAAGTGA TGTCGGCCTG CTTGTCGCTG GAAGAGCCCT TGGGTATTGC CTTTCTCGGG
CCGCTGGGCA CCTTCTCCGA ATCTGCGGCC ACCAAGCATT TTGGTCATGC CGCACGCTTG
CTGCCGCAAT CGTCGATCGA CGACGTCTTC CGCGAGGTTG AGTCCGGCCA CGCCCATTAT
GCCGTCGTCC CGGTCGAGAA TTCGACTGAA GGTGCGGTCG GTCGGACCAT GGATTTGTTG
CTGGCCACAC CGCTGAAAAT CTGCGGCGAA GTCGTGCTGC GTATCCACCA GAACCTGCTG
ACCAACGAAA CCGACCTGGG CAAGATTACC AAGGTCTATT CGCATGCCCA GTCGTTGGCT
CAATGCCACG AGTGGCTGAA CCGAGTCTTG CCCAAGGCGC AGCGTATCTC CGTTGGCAGC
AATGCCCAGG CCGCCCAGAA CGCAGCCGCT GAGCCAGGTA CGGCGGCGAT TGCCGGTGAA
GCTGCCGCAG CGCGCTATAA CTTGCCGAAA TTGGTCGAGA ACATCGAGGA TGAGCCGAAC
AATACGACCC GTTTCCTGGT CCTTGGCAAG CATGACTCCG GTATTTCCGG GCGTGACAAG
ACGTCACTGA TCATGTCGGC ACCCAACCGG ACCGGCGCCT TGCATGAGTT GCTGCTCCCG
CTGTCTACCG CCGGGGTTTC GATGTGTCGT CTTGAATCCC GTCCAGCCCG CAATGCGCTG
TGGGAATACG TCTTCTACGT TGATATCGAG GGCCATCGCG ACGAACCAGC GATCAAGGCG
GCGCTCGAAA AACTGGCTGG CTATGCCGCC TATCTGAAAA TCCTCGGGTC CTACCCGGTT
GCCGTTTATT GA
 
Protein sequence
MSDDLQKALA GVRTDIDRID GELLKLLNER ARCAQRVGEI KAEHGEAGHI YRPEREAQVL 
RRLQEANPGP LPGENITFFF REVMSACLSL EEPLGIAFLG PLGTFSESAA TKHFGHAARL
LPQSSIDDVF REVESGHAHY AVVPVENSTE GAVGRTMDLL LATPLKICGE VVLRIHQNLL
TNETDLGKIT KVYSHAQSLA QCHEWLNRVL PKAQRISVGS NAQAAQNAAA EPGTAAIAGE
AAAARYNLPK LVENIEDEPN NTTRFLVLGK HDSGISGRDK TSLIMSAPNR TGALHELLLP
LSTAGVSMCR LESRPARNAL WEYVFYVDIE GHRDEPAIKA ALEKLAGYAA YLKILGSYPV
AVY