Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4635 |
Symbol | |
ID | 8015379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4758326 |
End bp | 4759705 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644827210 |
Product | Allantoinase |
Protein accession | YP_002978410 |
Protein GI | 241207314 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.453471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.124954 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTCG ATCTCGTTCT GCAGGGCACA GTGGTGCTGC CGGACCGCAT TGTCGAAGAG GGCTATGTCG CCGTGCGCGA CGGCAAGATC GCCGAAGTCG GCCTCGGCGT GCCGCCTGCG GGCCGCGAAC GGCATCTGCT CGGAAAAGCG CTGATCCTGC CCGGCGCGAT CGACGCGCAG GTGCATTCGC TTTCCCAAAA AGACCAGGAG GATTTCCTCT GGTCGACACG ATCGGCAGCT GCCGGCGGCG TGACAACAAT CGTTGACATG CCCTATGACG AAGGCAATCT CGTCTGCTCG GCAGCGGCAG TGAAGCGGAA GATCGACCAT GCCGCCCCGC AGGCGCGCGT CGATTTCGCG CTTTACGGCA CAGTCGATCC GGAAGAAGGC CCGACACGTA TCCGCGAAAT GGTGGAGGCA GGCGTCGCGG CCTTCAAGTT TTCGACCTTC GGCACCGACC CCAAGCGCTT TCCGCGCATT CCGCCGGCTC TGCTCGACGC CTGCTTTGCG GCAATCGCGC CGACAGGACT GACGGCGGGC GTGCACAATG AAGACGACGA GGCGGTGCGC ACTTACACGG AACAGGTGAA GGCGAGTGGC ATCACCGACT GGCGGGCGCA CGGCCTGTCG CGGCCACCGA TCACCGAACT GCTGGCGATG CATACGATCT TCGAGACCGG CGCCAATACC GGCTGCCCGG CGCATGTGGT GCACTGCTCG CTCGGGCGCG GCTACGATAT CGCGCGCGCC TATCGCCGCG ATGGCTTTGC GGCGACTGTG GAATGCTGCA TCCACTACCT GACGCTCGAC GAGGAAAACG ATGTGAAACG CCTCGGCGGT AAGGCGAAGA TCAATCCGCC GGTGCGGCCG CGCGCCGAGG TGGAGAGGCT CTGGCGGAAG GTGGCGGAGG GTGATGTCTG GCTGGTTTCG ACCGATCACG TCAGCTGGTC GGAAAACCGC AAGACCAATC CCGACATGCT CGCCAACGCC TCCGGCGTTC CCGGCCTCGA GGTGATGGTG CCGCTTTTCG TGAAAGGTGC CACCGAACGC GGCATTCCGC TGACATGGGC AGCCAGGCTG ATGGCGGAGA ACCCGGCGAA GCATTTCCGG CTCGACCATA TCAAAGGTGC GCTGACCCCG GGCAAGGATG CCGATATCGT CGTGCTCGAG CCGCGCGAAA GCGTCTATGA TGCATCGGCA AGCGGCAACA ACGTCATCGG CTGGAGCCCC TATAACGGCA TCCGCCTGCC CTGGACCGTC TCCGCCACCT ATCTGCGCGG CGAAAAGATT GCCGAGGGCG CGAAGGTGCT GGCTGAGCCC GGTACCGGCC GCTTCGTGCG GCCGCTGCCG CGCCAGGTCA TTGCGGGAGC TGAAGCATGA
|
Protein sequence | MDFDLVLQGT VVLPDRIVEE GYVAVRDGKI AEVGLGVPPA GRERHLLGKA LILPGAIDAQ VHSLSQKDQE DFLWSTRSAA AGGVTTIVDM PYDEGNLVCS AAAVKRKIDH AAPQARVDFA LYGTVDPEEG PTRIREMVEA GVAAFKFSTF GTDPKRFPRI PPALLDACFA AIAPTGLTAG VHNEDDEAVR TYTEQVKASG ITDWRAHGLS RPPITELLAM HTIFETGANT GCPAHVVHCS LGRGYDIARA YRRDGFAATV ECCIHYLTLD EENDVKRLGG KAKINPPVRP RAEVERLWRK VAEGDVWLVS TDHVSWSENR KTNPDMLANA SGVPGLEVMV PLFVKGATER GIPLTWAARL MAENPAKHFR LDHIKGALTP GKDADIVVLE PRESVYDASA SGNNVIGWSP YNGIRLPWTV SATYLRGEKI AEGAKVLAEP GTGRFVRPLP RQVIAGAEA
|
| |