Gene Gdia_0227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0227 
Symbol 
ID6973619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp248002 
End bp249429 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content69% 
IMG OID643389758 
Productphenylhydantoinase 
Protein accessionYP_002274639 
Protein GI209542410 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0770073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.255137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTGG TGCGTGGTGG CACGGTCGTG ACGGCGGAAT GGTCGCGGCG GGCGGACGTG 
CTGTGCGACG ACGCGGGGCG GATCGCCGCC GTCGGGCCGG CGCTGGACGT GCCCGTGGGC
TGCGACGTCC TCGATGCCGG CGGGCTTCTG GTCATGCCGG GCGGGATCGA CCCGCACACC
CATATGGAAA TGCCGTTCAT GGGATCGGTC TCCAGCGACG ATTTCCAGAC CGGAACCGCC
GCCGGGGTGG CGGGCGGCAC GACGACCATC ATCGATTTCG TGATTCCCGA TCCGGGGACA
TCGCTGCTGG GCGCCTGGAA GGACTGGCGG GCCAAGGCCG AAAAGGCGGT CTCGGACTAT
TCGTTCCATG TCGCGGTCAC GCATTGGGAC CAGCGGGTGC ATGACGAGAT GGGCATCCTG
ACCCGGGATT GCGGGGTCAA TTCCTTCAAG CATTTCATGG CGTACAAGGG GGCGCTGATG
GTCGATGACG GGGTGCTGCT CCGTTCGATC GGCCGCGCGC TGGAACTGGG CGCGCTGTGC
AACGTGCATG CCGAAAACGG TGACGCCGTC GCCTACCTGC AGCAGGATTT GCTGACGCGC
GGCGTGACCG GCCCCGCCGC CCATCCCCGG TCCCGCCCGC CCGCGGTCGA GGGCGAGGCC
GCGCAGCGCG TCATCGCCAT TGCCGGCCTG CTGGGCGCGC CGGTCTATAT CGTGCATGTC
TCGACCGAGG AAGCCGCCGC CGCCATCGCC GCCGCCCGCG CCCGTGGCCA GCGCGTGTAT
GGCGAAGTGC TGGCCCAGCA TCTGGTGATC GATGACGGTG TCTATGCCGA CCCGGACTGG
CTGGGCGCCG CCCGGCATGT CATGAGCCCG CCTTTCCGCC CGAAACATCA CCAGCACGCC
CTGTGGGCGG GCCTGGCCTC GGGCCAGTTG CAGACGACGG CGACCGATCA TTGCTGCTTC
TGCGCCGGCC AGAAGCAGCA GGGGCGCGAC GATTTCTCTC AAATCCCGAA CGGCACCCCG
GGCATCGAGG ACCGGATGAG CGTGCTGTGG CACCACGGCG TGCGTACCGG GCGTCTGACG
CCGGAGGAAT TCGTGGCCGT CACCTCGGCC AATGCCGCGA AGATCTTCAA CATCCATCCC
CGCAAGGGCA CCGTCACGCC GGGCGCCGAT GCCGACCTGG TCCTGTGGGA CGCCGATTCC
AGCCGTACCG TATCGGCCGC CACCCATCAC CAGAACGTCG ATTACAATGT CTATGAAGGC
ATGACCCTGA CCGGCCTGGC GCGTCATACG ATCAGCGGGG GCCGGGTGGT GTGGTCGGAT
GGCGACCTGC GCACGGTGCG CGGTGCCGGC CGTTACGTCG AACGGCCCTG CTTCGCCCCC
GACATGGCGG CCCAGGCCAG GCGGAACGCC GTTGCGGCCG GGCGGTGA
 
Protein sequence
MLLVRGGTVV TAEWSRRADV LCDDAGRIAA VGPALDVPVG CDVLDAGGLL VMPGGIDPHT 
HMEMPFMGSV SSDDFQTGTA AGVAGGTTTI IDFVIPDPGT SLLGAWKDWR AKAEKAVSDY
SFHVAVTHWD QRVHDEMGIL TRDCGVNSFK HFMAYKGALM VDDGVLLRSI GRALELGALC
NVHAENGDAV AYLQQDLLTR GVTGPAAHPR SRPPAVEGEA AQRVIAIAGL LGAPVYIVHV
STEEAAAAIA AARARGQRVY GEVLAQHLVI DDGVYADPDW LGAARHVMSP PFRPKHHQHA
LWAGLASGQL QTTATDHCCF CAGQKQQGRD DFSQIPNGTP GIEDRMSVLW HHGVRTGRLT
PEEFVAVTSA NAAKIFNIHP RKGTVTPGAD ADLVLWDADS SRTVSAATHH QNVDYNVYEG
MTLTGLARHT ISGGRVVWSD GDLRTVRGAG RYVERPCFAP DMAAQARRNA VAAGR