Gene Smed_2367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2367 
Symbol 
ID5323228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2445501 
End bp2446955 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content63% 
IMG OID640791305 
Productphenylhydantoinase 
Protein accessionYP_001328034 
Protein GI150397567 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.040332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.334928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG TCATAAAGGG TGGAACCATC GTTACGGCCG ACCTGACCTA CAGGGCCGAC 
GTGAAGATCG AGGGCGGCAG GATCGTCGGG ATCGGGCCGA ACCTATCGGC TGCCGAGACA
CTGGACGCTA CCGGCTGCTA CGTCATGCCG GGCGGCATCG ATCCGCACAC CCATCTCGAA
ATGCCCTTCA TGGGCACCTA TTCCTCGGAC GATTTCGAAA GCGGGACGCG GGCAGCGCTT
GCCGGCGGCA CGACGATGGT CGTCGACTTC GCCCTCCCCT CGCCCGGTCA GTCGCTGCTC
GAAGCGCTCA CGATGTGGGA CAACAAGTCG ACGCGCGCCA ATTGCGACTA TTCCTTCCAC
ATGGCGATCA CCTGGTGGAG CGAGCAGGTC TTCAACGAGA TGGAGTCCGT CGTCAAAGAG
AAGGGCATCA ACACCTTCAA GCACTTCATG GCTTATAAAG GCGCTCTGAT GGTGGACGAT
GACGAGATGT TTTCCTCATT CCAGCGCTGC GCGGCACTTG GCGCCCTGCC GCTGGTTCAC
GCCGAGAACG GCGACGTGGT TGCGCAGTTG CAGGCGAAGC TGCTGGCGGA AGGAAATTCC
GGTCCCGAGG CGCATGCCTA TTCGCGGCCG GCAGAAGTCG AGGGCGAGGC CACCAACCGG
GCGATCATGA TCGCCGACAT GGCCGGCTGT CCCGTCTACA TCGTGCACAC CTCCTGCGAA
CAGTCGCATG AGGCCATCCG GCGCGCCCGC GCCAAGGGCA TGCGCGTCTT CGGCGAGCCC
TTGATCCAGC ACCTGACCCT CGACGAGACG GAGTATTTCG ACAAGGACTG GGACCACGCC
GCACGTCGGG TGATGTCACC GCCCTTCCGC AACAAGCTGC ATCAGGACAG CCTCTGGGCG
GGCCTCGCCT CCGGTTCGCT GCAGGTGGTC GCGACCGATC ATTGCGCCTT CACGACCGAG
CAGAAACGCT TCGGCGTCGG CGATTTCACC AGGATCCCGA ACGGCACCGG CGGCCTGGAG
GATCGCATGC CGATGCTCTG GACCCATGGC GTCGCGACTG GCCGCATCAC CATGAACGAA
TTCGTGGCGG TCACCTCGAC CAACATCGCC AAGATCCTCA ACATCTATCC GAAGAAGGGT
GCAATCCTCG TCGGCGCCGA TGCCGACATC GTCGTGTGGG ACCCGAAGCG GTCGAAGACG
ATCTCTGCGA AGACGCAGCA ATCGGCGATC GACTACAACG TCTTCGAAGG CAAGACCGTT
ACCGGTCTTC CGCGCTTCAC GCTGTCGCGT GGCGTGGTTT CTATCGAGGA GGGTTCGGTC
AAGACACAGG AAGGCCACGG CGAATTCGTG CGGCGCGACC CCTTCCCCGC TGTCAGCACC
GCGCTTTCGA CCTGGAAGGA AGTGACGGCA CCTCGCGCAG TACAGCGCAG CGGCATCCCC
GCAAGCGGCG TCTGA
 
Protein sequence
MSTVIKGGTI VTADLTYRAD VKIEGGRIVG IGPNLSAAET LDATGCYVMP GGIDPHTHLE 
MPFMGTYSSD DFESGTRAAL AGGTTMVVDF ALPSPGQSLL EALTMWDNKS TRANCDYSFH
MAITWWSEQV FNEMESVVKE KGINTFKHFM AYKGALMVDD DEMFSSFQRC AALGALPLVH
AENGDVVAQL QAKLLAEGNS GPEAHAYSRP AEVEGEATNR AIMIADMAGC PVYIVHTSCE
QSHEAIRRAR AKGMRVFGEP LIQHLTLDET EYFDKDWDHA ARRVMSPPFR NKLHQDSLWA
GLASGSLQVV ATDHCAFTTE QKRFGVGDFT RIPNGTGGLE DRMPMLWTHG VATGRITMNE
FVAVTSTNIA KILNIYPKKG AILVGADADI VVWDPKRSKT ISAKTQQSAI DYNVFEGKTV
TGLPRFTLSR GVVSIEEGSV KTQEGHGEFV RRDPFPAVST ALSTWKEVTA PRAVQRSGIP
ASGV