Gene Rleg2_4364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4364 
Symbol 
ID6983138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4529930 
End bp4531309 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content65% 
IMG OID643399092 
ProductAllantoinase 
Protein accessionYP_002283848 
Protein GI209551931 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00228036 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACTTCG ATCTCGTTCT GCAGGGCACG GTGGTGCTGC CGGACCGTAT TCTCGAAGAA 
GGCTATGTTG CCGCCCGCGA CGGCAGGATC GCCGAGGTTG GTCTCGGCGT GCCGCCCGCG
GCGCGCGATC GGCATCTGCT CGGCAAAGCG CTGATCCTGC CCGGCGCGAT TGACGCGCAG
GTACATTCGC TCTCGCAGAA GGACCAGGAG GATTTCATCT GGTCGACGCG TTCAGCGGCG
GCCGGCGGCG TAACAACGAT CGTCGACATG CCCTATGACG AGGGCGACCT CGTCTGCTCG
GCCGCAGCGG TCAAACGCAA GATCGAGCAT GCCGGCCAGC AGGCACGCGT CGACTTCGCG
CTTTACGGCA CCGTCGACCC GGAAGAAGGC CCGGCGCGGA TCGGCGAGAT GGTCGAGGCT
GGGGTTGCCG CGTTCAAATT CTCGACCTTC GACACCGACC CCAAGCGTTT TCCACGCATT
CCGCCGGCCC TGCTCGACGC CTGTTTTGCG GCGATCGCGC CGACCGGGCT GACCGCAGGC
GTGCACAATG AGGATGACGA GGCGGTGCGC AGCTATATGG CTGAGGTGAA GGCAAGCGGC
ATCACCGACT GGCGGGCGCA CGGCCTGTCA CGGCCGCCGA TTACCGAACT GCTGGCGATG
CATACGATCT TCGAAACGGG TGCCGCGACC CACTGCCCCT CGCATGTGGT GCACTGCTCG
CTCGGGCGCG GCTATGACAT CGCCCGCGCC TATCGCCGCG ACGGCTTTGC GGCGACCGTC
GAATGCTGCA TCCATTATCT GACGCTCGAC GAGGAAAACG ACGTCAAGCG CCTCGGCGGC
AAGGCGAAGA TCAATCCGCC GCTGCGGCCG CGCGCCGAGG TGGAGACCCT CTGGCGGAAG
GTGGCGGAGG GTGATGTCTG GCTGGTCTCG ACCGATCACG TCAGCTGGTC GGAGAACCGC
AAGACCAATC CCGACATGCT CGCCAACGCC TCAGGCGTTC CAGGCCTCGA GGTGATGGTG
CCGCTGTTCG TCAAAGGCGC TATCGAGCGC GGTATTCCAC TGACATGGGC GGCGAAGCTG
ATGGCGGAAA ATCCGGCCAG GCATTTCCGG CTCGACCATA TCAAGGGCGC GCTGACGCCG
GGCAAAGATG CCGATATCAC CGTGCTCGAA CCGCGGGACA GCGTCTATGA CGCTGCGGCC
AGCGGCAACA ATGTCGTCGG CTGGAGCCCT TATAACGGTG TTCGCCTGCC CTGGACCGTT
TCCGCCACCT ATCTCAGGGG CGAGAAGATC GCCGAAGGCG GCAAGGTGCT GGCCGAGCCC
GGCAGCGGAC GGTTCGTGCG GCCGTTGCCG CGCCAGGTCA TTGCGGGAGC TCCTGCATGA
 
Protein sequence
MDFDLVLQGT VVLPDRILEE GYVAARDGRI AEVGLGVPPA ARDRHLLGKA LILPGAIDAQ 
VHSLSQKDQE DFIWSTRSAA AGGVTTIVDM PYDEGDLVCS AAAVKRKIEH AGQQARVDFA
LYGTVDPEEG PARIGEMVEA GVAAFKFSTF DTDPKRFPRI PPALLDACFA AIAPTGLTAG
VHNEDDEAVR SYMAEVKASG ITDWRAHGLS RPPITELLAM HTIFETGAAT HCPSHVVHCS
LGRGYDIARA YRRDGFAATV ECCIHYLTLD EENDVKRLGG KAKINPPLRP RAEVETLWRK
VAEGDVWLVS TDHVSWSENR KTNPDMLANA SGVPGLEVMV PLFVKGAIER GIPLTWAAKL
MAENPARHFR LDHIKGALTP GKDADITVLE PRDSVYDAAA SGNNVVGWSP YNGVRLPWTV
SATYLRGEKI AEGGKVLAEP GSGRFVRPLP RQVIAGAPA