Gene Daro_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3665 
Symbol 
ID3567607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3939779 
End bp3940792 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content61% 
IMG OID637682138 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_286864 
Protein GI71909277 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTG TCGGTTTCCA GCTTCGCAAC AACCTGTTCG TCGCCCCCAT GGCCGGCGTG 
ACGGATCGTC CTTTCCGCCA GTTGTGCAAG AAGATGGGGG CTGGCCTGGC CGTGTCCGAA
ATGGTCACCT CCAATTCATT GCTTTATGGC AGCGCCAAGA CGCTGCGCCG GGCCAATCAC
GAAGGTGAGG TCGCCCCGAT CTCGGTGCAG ATCGCCGGCG CCGATCCGAA AATGATGGCC
GAGGCGGCCA AACACAACGT CGACAACGGC GCCCAGATCA TCGACATCAA CATGGGTTGC
CCGGCCAAGA AGGTCTGCAA CGTGATGGCC GGCTCGGCGC TGATGCAGGA CGAGCAGCAG
GTCGGACGCA TTCTCGATGC CGTCGTTGCT GCAATCCCGA ACACCCCGGT CACGCTGAAA
TTCCGTACCG GCTGGAATCT GGCCAACAAG AACGCCCCGA CCATCGCGCG CATCGCCGAA
TCGGCGGGCA TCCGTGCCGT CGCCATCCAC GGCCGGACGC GCTGCCAGCA ATACACCGGC
GAGGCGGAAT ACGACACCAT CGCCATGGTC AAGACGCTGA TCAGCATCCC GGTCATCGCC
AACGGCGACA TCACGACCCC GGAAAAGGCC AAGCACGTGC TCGACGTGAC CGGCGCCGAT
GGCGTCATGA TCGGCCGCGC CGCCCAGGGT CGCCCCTGGC TGTTCCGCGA GATCGAACAC
TATCTAAAAA CCGGCGAGCA CCTGCCACCG GCCGAGGTCA TGGAGATTCA CAGCATCCTG
CTGGAGCATC TCGAAGACCT TTACGCTTTC TACGGCCCGG AAACGGGGTT CAAGGTCGCC
CGCAAGCACA TCTCCTGGTA CACCAAGGGG TTGGTTGGCT CGGCGGCCTT CCGCAAGGAA
ATGAACGTCC TGCCCAGCAT CGATCAACAG ATGCAGGCAG TGAACGACTT CTTCAGCCGA
CTGGCGGCTG AGCATCAGCA TTTGAAATAC ACAGAGGAGG CGTTGGCAGC ATGA
 
Protein sequence
MDFVGFQLRN NLFVAPMAGV TDRPFRQLCK KMGAGLAVSE MVTSNSLLYG SAKTLRRANH 
EGEVAPISVQ IAGADPKMMA EAAKHNVDNG AQIIDINMGC PAKKVCNVMA GSALMQDEQQ
VGRILDAVVA AIPNTPVTLK FRTGWNLANK NAPTIARIAE SAGIRAVAIH GRTRCQQYTG
EAEYDTIAMV KTLISIPVIA NGDITTPEKA KHVLDVTGAD GVMIGRAAQG RPWLFREIEH
YLKTGEHLPP AEVMEIHSIL LEHLEDLYAF YGPETGFKVA RKHISWYTKG LVGSAAFRKE
MNVLPSIDQQ MQAVNDFFSR LAAEHQHLKY TEEALAA