Gene Daro_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3889 
Symbol 
ID3567734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4180548 
End bp4181795 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content64% 
IMG OID637682363 
Productdihydroorotase 
Protein accessionYP_287087 
Protein GI71909500 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.238894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCG TTATTGAAAA TGGCCGTGTC ATCGATCCGA AAAACGGTGT CGACCGCAGA 
GCCTCGCTCT ATGTCGCCGA TGGTAAGGTA GCCGGCATCG GTCAGGTACC TGCCGGGTTT
GTCGCCGACC GGAGCATCGA TGCCGCCGGC TGTGTCGTCT GCCCCGGCTT TATCGACCTT
GGCGCCCGCC TGAACTCAAT CGAAGCGGAA CTAGCCGCTG CAGTTGCTGG TGGTGTGACC
ACCGTCGTCG TGCCGCCGGA TGCCGACCCC CCGCTCGACG AGCCGGAACT GGCCGATCGC
CTGGTTCATC GCGGTGAGGA AATCGGCAAG GCCCGCGTCC TGCCGCTCGG TGCGCTGACC
CTCGGCCTCA AGGGCGAGCG CCTGGCCGAA CTGGCCGGCT TGAAGAAAGC CGGCTGCGTT
GCCTTCTCGC AGGCTAACAA GACGGTGGTC GATACCGAGG CGCTGCTGCG TGCCTTGGAA
TACGCGGCGA CCTTCGATTT CGCCGTCTGG TTGCAGCCGC AGGACTACTG GCTGTCACGC
AATGGCATTG CGCACGAAGG GGAAGTGGCC AGTCGCCTCG GTCTGGCTGG TATTCCCGTC
GCGGCCGAAA CCATCGCCAT CGGTACCATC ATCCAGTTGG TGCGCGACAC CGGTTGCCGC
ATCCATCTGA CCCGCATCTC GTCCGCGGCC GGCATGGCCC TAGTGCACCG TGCCCAGCAC
GATGGCCTGC CAATTTCCTG CGATGTCGGC GTGCATCATT TATTGCTCAC CGAGAACGAC
ATCGGCTTCT TCAATCCGCA TGCCCGTTTC TGCCCACCAT TGCGGGCCCA GACCGACCGC
CAGGCCCTGT CCGACGCCGT CGTTGCCGGC TGGGCGGCCA TCTGTTCCGA CCATACCCCG
GTTGGCGCCG ACGACAAGCT GCTGCCCTTC GGCGAAGCCA AGCCAGGCGC CACCGGCCTT
GAAGTGCTGC TCCCGCTGAC CCTGAAGTGG GCCGACGCCG CCAAGGTCGA TCTGCCGACG
GCTCTGGCCC GTATTACGTC TGCGCCGGCA GCAGTGCTCG GTCTGGCCAG CGGTCAACTG
GCGATCGGTA CAACGGCGGA CATCTGTATT TTCGACCCGG AAGCGAACTG GCAGCTGACA
CCGGACGCCC TGAAGAGCCG CGGTAAAAAT TCGCCTTGGC TGGGCTACGT GATGACAGGG
AAGGTCAAGG CGACGCTGGT TGGTGGCCGC CCGGTCTATC AAGCGTGA
 
Protein sequence
MNIVIENGRV IDPKNGVDRR ASLYVADGKV AGIGQVPAGF VADRSIDAAG CVVCPGFIDL 
GARLNSIEAE LAAAVAGGVT TVVVPPDADP PLDEPELADR LVHRGEEIGK ARVLPLGALT
LGLKGERLAE LAGLKKAGCV AFSQANKTVV DTEALLRALE YAATFDFAVW LQPQDYWLSR
NGIAHEGEVA SRLGLAGIPV AAETIAIGTI IQLVRDTGCR IHLTRISSAA GMALVHRAQH
DGLPISCDVG VHHLLLTEND IGFFNPHARF CPPLRAQTDR QALSDAVVAG WAAICSDHTP
VGADDKLLPF GEAKPGATGL EVLLPLTLKW ADAAKVDLPT ALARITSAPA AVLGLASGQL
AIGTTADICI FDPEANWQLT PDALKSRGKN SPWLGYVMTG KVKATLVGGR PVYQA