Gene Gdia_0931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0931 
Symbol 
ID6974328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1053178 
End bp1054572 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content63% 
IMG OID643390456 
Productdihydropyrimidinase 
Protein accessionYP_002275332 
Protein GI209543103 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.303478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCG ACATCGTCAT CCGCAACGGC ACCATCGTGA CCGCCAGCGA CGTGTTTACC 
GCCGATATCG GCATTACCGG CGAAACGATC GACACGATCG GGCACGGTCT GACCGGCCAC
CATGTCGTGG ACGCCACCGG GCTGCTGGTC ATGCCGGGCG GCATCGACGG TCATTGCCAT
ATCGAGCAGG AAGAGGCCGA CGGCACGGTG CATGAGGACG ATTTCGCTTC GGCAAGCGCG
TCCGCACTGC TGGGCGGCAC GACGACGGTG GTGTGTTTCG CGTCCCATGT GCCCGGACGC
AACATCGTCA CACAGTTCGA GGATTATCTC CGGCGGGGGC AGAATTCGCG CATCGATTTC
GCCGTCCACC AGATCGTCAC GCGTACCGAC GCGGATACGA TCGAAAAGGG TATTCCCGAA
ATCGCCCGCC GGGGCGTGGC GGGGCTGAAG GTCTTCATGA CCTACGACGA TTTCCATCTG
ACGGACGGAC AGTTCCTACG CGTTCTCGAC GCGGCGAAGG CCAGCGGCCT CGTGGTTTCG
GTCCATTGCG AAAATTACGA CGCGATCCGC CACATGATCG AAGCCCATCT CGCCGCCGGA
CGAACCGACG CCCACGCCCA CGCCACCTCC CGCCCCCCCT GCCTGGAACG CGAGGCCACC
TATCGCGCCA TGACCCTGGC CGAACTCGTC GACCACGACA TCCAGATCTT CCATGTTTCA
TGCCCGGAAG TCGCCGAGGA AATCGCGCGC GCCCGTCACC GCGGCGTCAA GGTCTCCGCG
GAAACCTGCC CGCATTATCT TTCCCTCACG GAAGCCGATC TGAAGCGGAA CGGATTCGAG
GGCGCAAAAT ATATCTGCAG CCCCCCTCTG CGCACCCAGG CCGACCAGGA CGGCATGTGG
TCGTATCTGC GCGATGGAGT GATCGGCATC GTATCGTCCG ATCATTGCGG CTACAGCTTG
GAAGGAACGT CCGGCAAAGC ACGCCATGGC CGCGACGCGG ATTTTTCACT GATCCCCAAC
GGGATGCCCG GGCTGGGAAC CAGAATGGCC GTCCTGTTCG ATGCCGGTGT AAACAGCCAG
AAGATCGGCC TGACGGATTT CGTCCGCCTG ACGGCCACCG CACCGGCCCA GCGATATGGC
CTTTATCCCC GCAAGGGCAC GATCGCCCCG GGCAGCGACG CGGACCTGGT CCTCTGGGAC
CGGACACAGC AGGTCAGGAT CACCAATGAC CTGTTCCAAA GCAGGATCGA CTACACACCA
TTCGAGGGAC GCCTCGTCAC CGGTTGGCCC GCCATGGTGC TTGCACGCGG CAGGATAGCC
GTCGAGAACG GCCAGGTACT CCGCAAACCC ACGATGGGCC GTTTCCTTAA AGCCCGTTCG
GGACCGCAGG ATTGA
 
Protein sequence
MTFDIVIRNG TIVTASDVFT ADIGITGETI DTIGHGLTGH HVVDATGLLV MPGGIDGHCH 
IEQEEADGTV HEDDFASASA SALLGGTTTV VCFASHVPGR NIVTQFEDYL RRGQNSRIDF
AVHQIVTRTD ADTIEKGIPE IARRGVAGLK VFMTYDDFHL TDGQFLRVLD AAKASGLVVS
VHCENYDAIR HMIEAHLAAG RTDAHAHATS RPPCLEREAT YRAMTLAELV DHDIQIFHVS
CPEVAEEIAR ARHRGVKVSA ETCPHYLSLT EADLKRNGFE GAKYICSPPL RTQADQDGMW
SYLRDGVIGI VSSDHCGYSL EGTSGKARHG RDADFSLIPN GMPGLGTRMA VLFDAGVNSQ
KIGLTDFVRL TATAPAQRYG LYPRKGTIAP GSDADLVLWD RTQQVRITND LFQSRIDYTP
FEGRLVTGWP AMVLARGRIA VENGQVLRKP TMGRFLKARS GPQD