Gene Daro_3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3737 
Symbol 
ID3567372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4016582 
End bp4017691 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content65% 
IMG OID637682211 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II-like protein 
Protein accessionYP_286936 
Protein GI71909349 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value0.983513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.19888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCC ATCCCGCCAT AGCCAGCACT GCTGAAATCG TCACCGAACT CAAGGCCGGC 
CGCATGGTCA TCCTGGTCGA TGAAGAAGAC CGCGAAAACG AGGGCGACCT CGTCATGGCC
GCCGAGCACA TCACGCCTGA AGCCATCAAT TTCATGGCCA AGTTCGGCCG TGGCCTGGTC
TGCCTGACCC TGACCGAAGC ACGCTGCAAA AAGCTGGGGC TGACCCAGAT GGCGCGCAAC
AACGGTACGG TTTACGGCAC CGCCTTCACC GTTTCCATCG AAGCTGCCGA GGGCGTCACC
ACCGGCATCT CCGCCGCTGA CCGGGCGCGC ACCATCCAGG TCGCCGTCAA CAAGGCATCG
ACGGCTGACG ACATCGTGCA GCCCGGCCAC GTCTTCCCGA TCACCGCCCG CGAAGGTGGT
GTGCTGGTCC GCGCCGGCCA TACCGAAGCC GGCTGCGACC TCGCTGGCAT GGCCGGCCTG
GAGCCATCCT CCGTTATCTG CGAGATCATG AACGACGACG GCACGATGGC CCGTTTGCCG
GAACTGATCG AATTCGCCAA GGAACATGGC CTGAAAATCG GCACCATCGC CGACCTGATC
CATTACCGTG CCTCCTCGGA AACCCTGGTC GAGCGCGTCA CCAGCAAAAC CGTCTCCACG
GCCCACGGCG ATTTCACCCT GCACGCTTAC GTCGATCGGG CCAGCGGCGC CACGCACCTC
GCCATGGTTA AAGGCAGCCT CCCGGCCGGC GGCGAAACCC TGGTTCGCGT CCATGAGCCC
TTATCTGTCC TCGACTTCCT TGATCCGGCC AGCAAACGCC AGGCGTTCTC GATCGACCAG
GCGCAGGCTG CGCTGGCCAA GCACGGCCAC GGCGTCATCG TCCTGATGCA TCGCCCGGAA
GATGGCGCGG CGCTGCTTTC CCGCCTGACC GGCACCGCCC CGACCGCCCC CGCCAAGTGG
GACCCGCGCA GCTACGGCAT CGGCGCCCAG ATTCTGCGTG ACCTCGGCGT GACCAAGATG
CGTCTGCTCT CCAGCCCGCG CAAGATGCCT TCCATGACCG GCTTCGAACT TGAAGTCACC
GGTTTCGTCA CCTCTCCTGC GGAGCTCTAA
 
Protein sequence
MPPHPAIAST AEIVTELKAG RMVILVDEED RENEGDLVMA AEHITPEAIN FMAKFGRGLV 
CLTLTEARCK KLGLTQMARN NGTVYGTAFT VSIEAAEGVT TGISAADRAR TIQVAVNKAS
TADDIVQPGH VFPITAREGG VLVRAGHTEA GCDLAGMAGL EPSSVICEIM NDDGTMARLP
ELIEFAKEHG LKIGTIADLI HYRASSETLV ERVTSKTVST AHGDFTLHAY VDRASGATHL
AMVKGSLPAG GETLVRVHEP LSVLDFLDPA SKRQAFSIDQ AQAALAKHGH GVIVLMHRPE
DGAALLSRLT GTAPTAPAKW DPRSYGIGAQ ILRDLGVTKM RLLSSPRKMP SMTGFELEVT
GFVTSPAEL