Gene Rleg_4635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4635 
Symbol 
ID8015379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4758326 
End bp4759705 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content64% 
IMG OID644827210 
ProductAllantoinase 
Protein accessionYP_002978410 
Protein GI241207314 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.453471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.124954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCG ATCTCGTTCT GCAGGGCACA GTGGTGCTGC CGGACCGCAT TGTCGAAGAG 
GGCTATGTCG CCGTGCGCGA CGGCAAGATC GCCGAAGTCG GCCTCGGCGT GCCGCCTGCG
GGCCGCGAAC GGCATCTGCT CGGAAAAGCG CTGATCCTGC CCGGCGCGAT CGACGCGCAG
GTGCATTCGC TTTCCCAAAA AGACCAGGAG GATTTCCTCT GGTCGACACG ATCGGCAGCT
GCCGGCGGCG TGACAACAAT CGTTGACATG CCCTATGACG AAGGCAATCT CGTCTGCTCG
GCAGCGGCAG TGAAGCGGAA GATCGACCAT GCCGCCCCGC AGGCGCGCGT CGATTTCGCG
CTTTACGGCA CAGTCGATCC GGAAGAAGGC CCGACACGTA TCCGCGAAAT GGTGGAGGCA
GGCGTCGCGG CCTTCAAGTT TTCGACCTTC GGCACCGACC CCAAGCGCTT TCCGCGCATT
CCGCCGGCTC TGCTCGACGC CTGCTTTGCG GCAATCGCGC CGACAGGACT GACGGCGGGC
GTGCACAATG AAGACGACGA GGCGGTGCGC ACTTACACGG AACAGGTGAA GGCGAGTGGC
ATCACCGACT GGCGGGCGCA CGGCCTGTCG CGGCCACCGA TCACCGAACT GCTGGCGATG
CATACGATCT TCGAGACCGG CGCCAATACC GGCTGCCCGG CGCATGTGGT GCACTGCTCG
CTCGGGCGCG GCTACGATAT CGCGCGCGCC TATCGCCGCG ATGGCTTTGC GGCGACTGTG
GAATGCTGCA TCCACTACCT GACGCTCGAC GAGGAAAACG ATGTGAAACG CCTCGGCGGT
AAGGCGAAGA TCAATCCGCC GGTGCGGCCG CGCGCCGAGG TGGAGAGGCT CTGGCGGAAG
GTGGCGGAGG GTGATGTCTG GCTGGTTTCG ACCGATCACG TCAGCTGGTC GGAAAACCGC
AAGACCAATC CCGACATGCT CGCCAACGCC TCCGGCGTTC CCGGCCTCGA GGTGATGGTG
CCGCTTTTCG TGAAAGGTGC CACCGAACGC GGCATTCCGC TGACATGGGC AGCCAGGCTG
ATGGCGGAGA ACCCGGCGAA GCATTTCCGG CTCGACCATA TCAAAGGTGC GCTGACCCCG
GGCAAGGATG CCGATATCGT CGTGCTCGAG CCGCGCGAAA GCGTCTATGA TGCATCGGCA
AGCGGCAACA ACGTCATCGG CTGGAGCCCC TATAACGGCA TCCGCCTGCC CTGGACCGTC
TCCGCCACCT ATCTGCGCGG CGAAAAGATT GCCGAGGGCG CGAAGGTGCT GGCTGAGCCC
GGTACCGGCC GCTTCGTGCG GCCGCTGCCG CGCCAGGTCA TTGCGGGAGC TGAAGCATGA
 
Protein sequence
MDFDLVLQGT VVLPDRIVEE GYVAVRDGKI AEVGLGVPPA GRERHLLGKA LILPGAIDAQ 
VHSLSQKDQE DFLWSTRSAA AGGVTTIVDM PYDEGNLVCS AAAVKRKIDH AAPQARVDFA
LYGTVDPEEG PTRIREMVEA GVAAFKFSTF GTDPKRFPRI PPALLDACFA AIAPTGLTAG
VHNEDDEAVR TYTEQVKASG ITDWRAHGLS RPPITELLAM HTIFETGANT GCPAHVVHCS
LGRGYDIARA YRRDGFAATV ECCIHYLTLD EENDVKRLGG KAKINPPVRP RAEVERLWRK
VAEGDVWLVS TDHVSWSENR KTNPDMLANA SGVPGLEVMV PLFVKGATER GIPLTWAARL
MAENPAKHFR LDHIKGALTP GKDADIVVLE PRESVYDASA SGNNVIGWSP YNGIRLPWTV
SATYLRGEKI AEGAKVLAEP GTGRFVRPLP RQVIAGAEA