Gene Daci_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_3988 
Symbol 
ID5749571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp4393591 
End bp4394637 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content66% 
IMG OID641299086 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001565004 
Protein GI160899422 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.055299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.021613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTGA TCCCCTACGC CCTGACCCGT CCCTTTCTCT TCGGCATGGA CCCCGAATCC 
GCCCACGATC TCACGATGAA CCTGATGGCA AAGGGGCAGA ACACGCTCCT GCAGCAGGCG
TGGGCACAGC CCATGGTGAG TGACCCCGTC GAGCTTGCCG GCCTCAAGTT CCCCAACCGC
GTGGGCATGG CAGCGGGTCT GGACAAGAAT GCACGCTGCA TCGACGCGCT GGCCGCCATG
GGCTTCGGCT TCGTCGAGGT GGGCACCGTG ACACCCCGCC CGCAGCCGGG CAACCCCAAG
CCGCGCATGT TCCGCATTCC CGAACGCAAT GCGCTGATCA ACCGCCTGGG CTTCAACAAC
GAAGGCCTGG ATGCCTTCCT GAGCAACGTC AAGCGCTCGC AGGCCCGCGC GCAGGGCAAA
CCCATGCTGC TGGGGCTGAA CATCGGCAAG AACGCGACCA CTCCCATCGA AGATGCCACC
AGCGACTATC TCAAGGCGCT GGACGGCGTG TACCCGCATG CCGACTACGT GACGGTGAAC
ATCAGCTCGC CCAACACCAA GAACCTGCGC GCCCTGCAAA GCGACGAAGC GCTGGACGCC
CTGCTGGGTG CGATTGCCGA GCGCCGTGAG CAACTGGCCA CGCAGCATGG CAAGCGGGTG
CCGGTGTTCG TGAAGATCGC ACCCGACCTG GATGAAGAGC AGGTCGGCGT CATCGCCGCC
ACGCTGCAGC GCCATGGCAT GGATGGCGTG ATCGCCACCA ACACCACGAT CAGCCGGGAA
GCCGTCAAGG GCCTCCCCTA CGCGCAGGAA ACGGGCGGCC TGTCCGGTGC GCCGGTGCTG
GAGGCCAGCA ACCAGGTCAT CCGCCAGCTG CGTTCCGCCC TGGGCAGCCG CTACCCCATC
ATCGGCGTGG GCGGCATTCT CAGCGGCGAA GATGCCGTCA GCAAAATTCG CGCAGGCGCC
GACGTGGTCC AGATCTACAG CGGCCTGATC TACCGAGGCC CTGCCCTGGT GCCCGAGACC
GCACGCGCCA TAGCCCAGCT GCGTTGA
 
Protein sequence
MSLIPYALTR PFLFGMDPES AHDLTMNLMA KGQNTLLQQA WAQPMVSDPV ELAGLKFPNR 
VGMAAGLDKN ARCIDALAAM GFGFVEVGTV TPRPQPGNPK PRMFRIPERN ALINRLGFNN
EGLDAFLSNV KRSQARAQGK PMLLGLNIGK NATTPIEDAT SDYLKALDGV YPHADYVTVN
ISSPNTKNLR ALQSDEALDA LLGAIAERRE QLATQHGKRV PVFVKIAPDL DEEQVGVIAA
TLQRHGMDGV IATNTTISRE AVKGLPYAQE TGGLSGAPVL EASNQVIRQL RSALGSRYPI
IGVGGILSGE DAVSKIRAGA DVVQIYSGLI YRGPALVPET ARAIAQLR