Gene Gdia_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3102 
Symbol 
ID6976536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3396897 
End bp3398216 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content69% 
IMG OID643392610 
Productdihydroorotase 
Protein accessionYP_002277447 
Protein GI209545218 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.544962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTACG ACCTGATCAT CCGCAATGGC GTCTGCCTTC TGCCCTGGGG CGAGGCCCGT 
ACCGACATCG GCGTGCGCTC GGGCCGGATC GCCAGCCTGA CGGCGGGTGT GGCGGACGAG
GCCGATACGG TCATCGACGC GTCCGGCCTG CATGTGCTGC CCGGGCTGAT CGACCCGCAC
GTGCATCTGC GCGACCCGGG CGACGCCGCG GTCGAGAGCA TTCCCACCGG CACGCGCGCG
GCGGCCCTGG GCGGGGTGAC GACGGTGTTC GACATGCCCA ACACCGCGCC ATCGGTGACC
GACGCCGAGA TGCTGCGCTG GAAGCAGGAA TACGCGTCGC GCGAAAGCTG GGTGGATTTC
GCGCTGTATG TCGGCGCCAC GCGCGGCAAT ACGCCGCGCC TGGGCGAGTA CGAGTGTTTC
GACGGGGTCT GCGCCATCAA GGTCTTTGCC GGATCGTCGA CCGGCGACCT GATGATCGAG
GATGACGAGG GCATCCGGCA GGTGCTGGAG AACGGCCACC GCCGCGTCGC CTTCCATTCC
GAGGACGAAT ACCGCCTGCA GGACCGCAAG AAGCTGCTGA CCGAGGGCAT GTCCTACGAC
AGCCATCCCT TCTGGCGGGA CGAGGAATGC GCCTTCCTGG GCACGCGGCG TATCGTGGAT
CTGGCGCGGC GGGCGGGGCG GCCGGTCCAT ATCCTGCATA CCTCGACCGC CGAGGAACTG
GACTGGCTGC CGGCGCATCG CGACGTCGCG ACCGTCGAGG TGCTGGTGAA CCATCTGACC
CAGGTCGCCC CCGAATGCTA TGAGACGCTG GGCCCGCTTG CGATCATGAA CCCGCCGATC
CGCGACCGCC GGCATTACGA GGCCAGTTGG GAGGCCGTGC GCAACGGCAC CGTGGACACG
ATCGGGTCGG ACCACGCGCC CCATTCCCTG GCGGCGAAGG CGCGGCCCTG GCCCGCCACC
CCGGCGGGGC TGACGGGGGT GCAGACGCTG GTTCCCGTCA TGCTGGACCA CGTCAATGCC
GGGCGTCTGT CGCTGGGCCG GATGGTGGAC CTGATGGCGG CAGGGCCGGC GCGGGTCTAT
GGCCTGCAGG CCAAGGGGCG GATCGCCATG GGCTATGACG CCGATTTCAC CCTGGTGGAC
ATGAAGGCGC GGCGCCGTAT CACCAATGAC TGGATCGCCA CCCCGGCGGG CTGGACGCCG
TTCGACGGGA TGGAGGTGAC GGGCTGGCCG ATGGCGACGA TCGTGCGGGG CCACGCCGTC
ATGCGCGAGG ATACGATCCT GGGCGCGCCG GCCGGCAGGC TTGCGCATTT CGCACCCTGA
 
Protein sequence
MHYDLIIRNG VCLLPWGEAR TDIGVRSGRI ASLTAGVADE ADTVIDASGL HVLPGLIDPH 
VHLRDPGDAA VESIPTGTRA AALGGVTTVF DMPNTAPSVT DAEMLRWKQE YASRESWVDF
ALYVGATRGN TPRLGEYECF DGVCAIKVFA GSSTGDLMIE DDEGIRQVLE NGHRRVAFHS
EDEYRLQDRK KLLTEGMSYD SHPFWRDEEC AFLGTRRIVD LARRAGRPVH ILHTSTAEEL
DWLPAHRDVA TVEVLVNHLT QVAPECYETL GPLAIMNPPI RDRRHYEASW EAVRNGTVDT
IGSDHAPHSL AAKARPWPAT PAGLTGVQTL VPVMLDHVNA GRLSLGRMVD LMAAGPARVY
GLQAKGRIAM GYDADFTLVD MKARRRITND WIATPAGWTP FDGMEVTGWP MATIVRGHAV
MREDTILGAP AGRLAHFAP