Gene Daro_0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0907 
Symbol 
ID3570064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp981608 
End bp982639 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content62% 
IMG OID637679365 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_284133 
Protein GI71906546 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.0284448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGC GCGGCAAGAA CGTCACCGTC CACGACATGA CCCTGCGGGA TGGCATGCAT 
CCCAAGCGTC ACCTGATGAC CCTCGACCAG ATGGTCAGCA TCGCCACCGG CCTCGACGAA
GCCGGTATTC CGCTGATCGA AGTCACCCAC GGCGATGGTC TCGGTGGTTC CTCGGTTAAC
TACGGCTTCC CGGCCCATAC CGATGAAGAG TATCTCGGCA CCGTCATCCC GAAGATGAAG
AATGCCAAGA TCTCGGCCTT GCTGTTGCCG GGTATCGGGA CTGTCGATCA CCTGAAGATG
GCACGTGACC TCGGCGTGCA CACCATTCGC GTCGCCACGC ACTGTACTGA GGCTGATGTC
TCCGAACAGC ACATCACCAT GGCCCGCAAA CTGGACATGG ACACCGTCGG CTTCCTGATG
ATGAGCCACA TGAACGGTGC CGAAGGTCTG GTCAAGCAAG CCAAGCTGAT GGAAGGCTAC
GGCGCCAACT GTATCTACGT CACCGACTCG GCCGGCCACC TGCTGCCGGA AGGCGTCAAG
GAACGTCTCG GTGCCGTCAG AAAAGCCCTG AAGCCGGAAA CCGAACTCGG TTTCCATGGC
CACCACAACC TGGCCATGGG CGTCGCCAAC TCGATCGCCG CCATCGAAGT CGGGGCCAAC
CGCATCGACG CAGCGGCGGC CGGCCTTGGC GCCGGCGCAG GCAACACGCC GATGGAAGTG
CTGATTGCCG TGTGCAGCCT GATGGGCATC GAGACTGGGG TTGATGTCGC CAAGATCACC
GACGTGGCCG AAGACCTGGT GGTGCCGATG ATGGACTTCC CGATCCGCAT TGACCGCGAT
GCACTGACGC TCGGCTATGC CGGCGTCTAT GGTTCCTTCC TGCTCTTTGC CAAGCGCGCT
TCCGCCAAGT ACGGCGTACC GGCCCGCGAC ATTCTGGTCG AGCTGGGCCG GCGCGGCATG
GTCGGTGGGC AGGAGGACAT GATCGAGGAT ACGGCCATCA CCATGGCGCG GGAACGTGGG
CTGAAGGTCT GA
 
Protein sequence
MSLRGKNVTV HDMTLRDGMH PKRHLMTLDQ MVSIATGLDE AGIPLIEVTH GDGLGGSSVN 
YGFPAHTDEE YLGTVIPKMK NAKISALLLP GIGTVDHLKM ARDLGVHTIR VATHCTEADV
SEQHITMARK LDMDTVGFLM MSHMNGAEGL VKQAKLMEGY GANCIYVTDS AGHLLPEGVK
ERLGAVRKAL KPETELGFHG HHNLAMGVAN SIAAIEVGAN RIDAAAAGLG AGAGNTPMEV
LIAVCSLMGI ETGVDVAKIT DVAEDLVVPM MDFPIRIDRD ALTLGYAGVY GSFLLFAKRA
SAKYGVPARD ILVELGRRGM VGGQEDMIED TAITMARERG LKV