Gene Rleg2_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3038 
Symbol 
ID6981783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3097293 
End bp3098747 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content63% 
IMG OID643397748 
Productphenylhydantoinase 
Protein accessionYP_002282531 
Protein GI209550614 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAG TCATCAAAAA CGGCACCATC GTCACCGCCG ACCTGACCTA CAAGGCTGAC 
GTCAAGATCG ACGGCGGCAA GATCGTCGAG ATCGGCCCGA ACCTCTCGGG CGACGAGACG
CTGGACGCCT CCGGCTGTTA TGTCATGCCT GGTGGTATCG ATCCGCACAC CCATCTCGAA
ATGCCCTTCA TGGGCACCTA TTCCTCCGAC GATTTCGAGA GCGGCACGCG GGCGGCCCTT
TCCGGCGGCA CCACCATGGT GGTCGATTTC GCCCTGCCCG CCCCCGGCCA GTCGCTGCTC
GAAGCGCTGG CCATGTGGGA TAACAAGTCG ACGCGGGCCA ATTGCGACTA TTCTTTCCAC
ATGGCGGTTA CCTGGTGGAG CGAGCAGGTC TTCCAGGAGA TGGAGACCAT CGTCCGCGAC
AAGGGCATCA ACACCTTCAA GCACTTCATG GCCTATAAGG GCGCGCTGAT GGTCGACGAC
GACGAGATGT TCGCCTCGTT CCAGCGCTGC GCCGAACTCG GCGCGCTGCC GCTGGTGCAC
GCCGAAAACG GCGATGTCGT CGCCTCGATG TCGGCAAAGC TGCTCGCCGA AGGCAACAAC
GGTCCCGAGG CGCATGCCTA TTCCCGCCCT GCCGAGGTAG AGGGCGAGGC CACCAACCGC
GCCATCATGA TCGCCGACAT GGCCGGCTGC CCGGTCTATA TCGTCCATAC CTCCTGCGAG
CAGGCGCATG AAGCGATCCG CCGGGCCCGT GCCAAGGGCA TGCGTGTCTA CGGCGAACCG
CTGATCCAGC ACCTGACGCT CGACGAGAGC GAATATTCCA ATCCAGACTG GGACCACGCC
GCCCGCCGCG TGATGTCACC GCCGTTCCGC AACAAGCAGC ACCAGGACAG TCTCTGGGCC
GGGCTTGCCT CCGGCTCGCT GCAGGTGGTC GCTACCGACC ATTGCGCCTT CACCACGGCG
CAGAAGCGTT TCGGCGTCGG TGATTTCACC AAGATCCCGA ACGGCACCGG CGGCCTTGAA
GACCGCATGC CGATGCTCTG GACACGCGGC GTCAACACCG GCCGGCTGAC GATGAACGAG
TTCGTCGCCG TGACCTCGAC CAACATCGCC AAGATCCTCA ACATCTATCC GAAGAAAGGC
GCGATCCTCG TCGGCGCCGA TGCCGATATC GTCGTCTGGG ATCCGAAGCG GTCAAAAACC
ATCTCGTCCA AGAGCCAGCA GTCGGCGATC GACTACAACG TCTTCGAAGG CAAGGAAGTG
ACCGGCCTGC CGCGCTACAC GCTGACGCGC GGCGTCGTCG CTATCGAGGA AAGCACCATC
AAGACCCAAG AGGGCCATGG TGAATTCGTC CGGCGCGAAC CGGTGACGGC CGTCAGCAGG
GCACTCTCCA CCTGGAAGGA GATCACCGCC CCGCGCAAGG TGACGCGCAG CGGCATTCCG
GCGACCGGCG TTTGA
 
Protein sequence
MSTVIKNGTI VTADLTYKAD VKIDGGKIVE IGPNLSGDET LDASGCYVMP GGIDPHTHLE 
MPFMGTYSSD DFESGTRAAL SGGTTMVVDF ALPAPGQSLL EALAMWDNKS TRANCDYSFH
MAVTWWSEQV FQEMETIVRD KGINTFKHFM AYKGALMVDD DEMFASFQRC AELGALPLVH
AENGDVVASM SAKLLAEGNN GPEAHAYSRP AEVEGEATNR AIMIADMAGC PVYIVHTSCE
QAHEAIRRAR AKGMRVYGEP LIQHLTLDES EYSNPDWDHA ARRVMSPPFR NKQHQDSLWA
GLASGSLQVV ATDHCAFTTA QKRFGVGDFT KIPNGTGGLE DRMPMLWTRG VNTGRLTMNE
FVAVTSTNIA KILNIYPKKG AILVGADADI VVWDPKRSKT ISSKSQQSAI DYNVFEGKEV
TGLPRYTLTR GVVAIEESTI KTQEGHGEFV RREPVTAVSR ALSTWKEITA PRKVTRSGIP
ATGV