Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4364 |
Symbol | |
ID | 6983138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4529930 |
End bp | 4531309 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643399092 |
Product | Allantoinase |
Protein accession | YP_002283848 |
Protein GI | 209551931 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00228036 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACTTCG ATCTCGTTCT GCAGGGCACG GTGGTGCTGC CGGACCGTAT TCTCGAAGAA GGCTATGTTG CCGCCCGCGA CGGCAGGATC GCCGAGGTTG GTCTCGGCGT GCCGCCCGCG GCGCGCGATC GGCATCTGCT CGGCAAAGCG CTGATCCTGC CCGGCGCGAT TGACGCGCAG GTACATTCGC TCTCGCAGAA GGACCAGGAG GATTTCATCT GGTCGACGCG TTCAGCGGCG GCCGGCGGCG TAACAACGAT CGTCGACATG CCCTATGACG AGGGCGACCT CGTCTGCTCG GCCGCAGCGG TCAAACGCAA GATCGAGCAT GCCGGCCAGC AGGCACGCGT CGACTTCGCG CTTTACGGCA CCGTCGACCC GGAAGAAGGC CCGGCGCGGA TCGGCGAGAT GGTCGAGGCT GGGGTTGCCG CGTTCAAATT CTCGACCTTC GACACCGACC CCAAGCGTTT TCCACGCATT CCGCCGGCCC TGCTCGACGC CTGTTTTGCG GCGATCGCGC CGACCGGGCT GACCGCAGGC GTGCACAATG AGGATGACGA GGCGGTGCGC AGCTATATGG CTGAGGTGAA GGCAAGCGGC ATCACCGACT GGCGGGCGCA CGGCCTGTCA CGGCCGCCGA TTACCGAACT GCTGGCGATG CATACGATCT TCGAAACGGG TGCCGCGACC CACTGCCCCT CGCATGTGGT GCACTGCTCG CTCGGGCGCG GCTATGACAT CGCCCGCGCC TATCGCCGCG ACGGCTTTGC GGCGACCGTC GAATGCTGCA TCCATTATCT GACGCTCGAC GAGGAAAACG ACGTCAAGCG CCTCGGCGGC AAGGCGAAGA TCAATCCGCC GCTGCGGCCG CGCGCCGAGG TGGAGACCCT CTGGCGGAAG GTGGCGGAGG GTGATGTCTG GCTGGTCTCG ACCGATCACG TCAGCTGGTC GGAGAACCGC AAGACCAATC CCGACATGCT CGCCAACGCC TCAGGCGTTC CAGGCCTCGA GGTGATGGTG CCGCTGTTCG TCAAAGGCGC TATCGAGCGC GGTATTCCAC TGACATGGGC GGCGAAGCTG ATGGCGGAAA ATCCGGCCAG GCATTTCCGG CTCGACCATA TCAAGGGCGC GCTGACGCCG GGCAAAGATG CCGATATCAC CGTGCTCGAA CCGCGGGACA GCGTCTATGA CGCTGCGGCC AGCGGCAACA ATGTCGTCGG CTGGAGCCCT TATAACGGTG TTCGCCTGCC CTGGACCGTT TCCGCCACCT ATCTCAGGGG CGAGAAGATC GCCGAAGGCG GCAAGGTGCT GGCCGAGCCC GGCAGCGGAC GGTTCGTGCG GCCGTTGCCG CGCCAGGTCA TTGCGGGAGC TCCTGCATGA
|
Protein sequence | MDFDLVLQGT VVLPDRILEE GYVAARDGRI AEVGLGVPPA ARDRHLLGKA LILPGAIDAQ VHSLSQKDQE DFIWSTRSAA AGGVTTIVDM PYDEGDLVCS AAAVKRKIEH AGQQARVDFA LYGTVDPEEG PARIGEMVEA GVAAFKFSTF DTDPKRFPRI PPALLDACFA AIAPTGLTAG VHNEDDEAVR SYMAEVKASG ITDWRAHGLS RPPITELLAM HTIFETGAAT HCPSHVVHCS LGRGYDIARA YRRDGFAATV ECCIHYLTLD EENDVKRLGG KAKINPPLRP RAEVETLWRK VAEGDVWLVS TDHVSWSENR KTNPDMLANA SGVPGLEVMV PLFVKGAIER GIPLTWAAKL MAENPARHFR LDHIKGALTP GKDADITVLE PRDSVYDAAA SGNNVVGWSP YNGVRLPWTV SATYLRGEKI AEGGKVLAEP GSGRFVRPLP RQVIAGAPA
|
| |