Gene Daro_3706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3706 
Symbol 
ID3567918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3984113 
End bp3985057 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content65% 
IMG OID637682179 
Producthypothetical protein 
Protein accessionYP_286905 
Protein GI71909318 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase
[COG1051] ADP-ribose pyrophosphatase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.461385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA TCGTCGAAGT TGCCGCTGCC GTCATGCTGC GTGCCGATGG CCGCGAATTC 
CTGCTCGCCC AGCGCCCGGA AGGCAAGGTT TACGCTGGCT ACTGGGAATT CCCCGGCGGC
AAGGTCGAAC CCGGCGAAAC CGTCCGCCAG GCACTGATCC GCGAACTGCA GGAGGAACTG
GGCATCACGG TCACCGCCTG CTCGCAGTGG CTGACCCGGC AATTTACCTA CCCGCATGCC
ACCGTCCGCC TGAACTTCTG GCGAGTCACC GCCTGGGATG GCGAGATCGG CATCACCGCA
CCGCTCGAAC ATTCGGCAGT CGAGTGGCAA AAAACAGGAG GGGCCGCCAG CGTCGCCCCC
ATCCTGCCGG CCAACGACCC GATCCTGAAA GCCCTGTCGT TGCCGACAAC GATGGCCATC
ACGATGGCTG AAAGCGAAGG CACCGAGCGC CAGCTGGAGC GCCTCGAAGA AGCCCTGAAT
GCCGGCCTGC GCCTGATCCA GATTCGCGAC AAAAGCCTGC CGCCAGCCCA GCGCCTGTGG
TTTGCCGAAA CCGTGCTGCA ACTGGCCCGC AGCCATGGCG CCACGGTTGT CATCAACGAC
GACGAAGCAC TGGCCAGACG CATCGGCGCC GATGGTGTCC ACCTGTCAGC GGCACGCTTG
GCCGCTTGCC AGCAACGCCC GGACTTCACC TGGGTGGGCG CCTCCTGCCA TAGCGCGGAG
GAAATCGTCC GGGCCGGCGA ACTTGGTCTG GATTACGCGC TGCTGGGTCC GGTAATGCCA
ACGCCAACCC ATCCTGAATC AACCGGGCTC GGCTGGACTG AATTCGAAGG GCGACTGGCC
GGCAATACGC TGCCGGTGTT TGCGCTGGGC GGCATGAAGC CGGGAATGCT GGCCGAGGCC
CAAGGCCACG GCGCCCACGG ATTGGCGCTT ATGCGCGGCT GGTAG
 
Protein sequence
MTKIVEVAAA VMLRADGREF LLAQRPEGKV YAGYWEFPGG KVEPGETVRQ ALIRELQEEL 
GITVTACSQW LTRQFTYPHA TVRLNFWRVT AWDGEIGITA PLEHSAVEWQ KTGGAASVAP
ILPANDPILK ALSLPTTMAI TMAESEGTER QLERLEEALN AGLRLIQIRD KSLPPAQRLW
FAETVLQLAR SHGATVVIND DEALARRIGA DGVHLSAARL AACQQRPDFT WVGASCHSAE
EIVRAGELGL DYALLGPVMP TPTHPESTGL GWTEFEGRLA GNTLPVFALG GMKPGMLAEA
QGHGAHGLAL MRGW