Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3036 |
Symbol | |
ID | 6981781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3095765 |
End bp | 3097018 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643397746 |
Product | allantoate amidohydrolase |
Protein accession | YP_002282529 |
Protein GI | 209550612 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.376517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCAG CACCAGGCGA AAACATGCGC GTCAATGGCG ACCGTCTCTG GGACAGTCTC ATGGACATGG CCAAGATCGG CCCCGGCATT GCGGGCGGCA ACAATCGCCA GACGCTGACG GATTCGGATG CCGAGGGCCG AAGCCTTTTC AAGACATGGT GCGACGAAGC GGGCCTCACC ATGGGCATCG ACCAGATGGG CACGATGTTC GCTACCCGCC CCGGCACCGA TCCCGATGCC CTGCCCGTCT ATGTCGGTTC GCATCTCGAC ACTCAGCCGA CCGGCGGCAA ATATGACGGC GTGCTCGGCG TGCTGGCTGC CCTCGAAGTC GTGCGCACCA TGAACGATCT CGGCATCAAG ACCAAACATC CCATTGTCGT CACCAATTGG ACGAACGAGG AAGGGGCACG TTTTGCCCCG GCCATGCTGG CCTCAGGCGT CTTTGCCGGC GTGCACAGCC TCGACTTTGC CTATAATCGC AAGGATCCCG AGGGCAATCT CTTCGGCGAC GAACTGAAAC GCATCGGCTG GCTCGGCGAC GAAGAGGTCG GTGCCCGCAA GATGCACGCC TATTTCGAAT ATCACATCGA GCAGGGTCCG ATCCTCGAGG CCGAAGACAA GCAGATCGGC GTCGTCACCC ACTGTCAGGG CCTCTGGTGG CTGGAATTCA CGCTGACCGG CAAGGAAGCC CATACCGGCT CGACGCCGAT GAACATGCGC GTCAATGCCG GGCTTGCCAT GTCGCGCATC CTGGAAATGG TTCAAGGCGT GGCGATGGGC GAGCAGCCGG GCGCCGTCGG CGGTGTCGGG CAGGTGTTCT TCTCGCCGAA TTCGCGCAAC GTGCTGCCCG GCAAGGTCGT CTTCACCGTC GACATCCGCT CGCCCGACAA GGCCAAGCTC GACCGCATGC GGGCAAAGAT CGAGGCGGAA GCGCCAAAGA TCTGCGATGC TTTGGGTGTC GGCTGTTCCG TCGAGGCGAT CGGCCATTTC GCGCCTGTTA CCTTCGACGA AAAGCTCGTC AGCTCGGTCC GCTCCGCCGC CGAGCGCCTC GGCTACAGCC ACATGAACCT CATCTCGGGC GCCGGCCACG ACGCCTGCTG GGCCGCCAAG GTCGCCCCTG CGACGATGGT CATGTGCCCC TGCGTCGGCG GTCTGTCGCA CAATGAAGCG GAAGACATTT CCAAGGAATG GGCGACGGCG GGCGCCGATG TTCTGTTCCA TGCGGTGGTG GAGACGGCGG AGATTGTTCC GTGA
|
Protein sequence | MVAAPGENMR VNGDRLWDSL MDMAKIGPGI AGGNNRQTLT DSDAEGRSLF KTWCDEAGLT MGIDQMGTMF ATRPGTDPDA LPVYVGSHLD TQPTGGKYDG VLGVLAALEV VRTMNDLGIK TKHPIVVTNW TNEEGARFAP AMLASGVFAG VHSLDFAYNR KDPEGNLFGD ELKRIGWLGD EEVGARKMHA YFEYHIEQGP ILEAEDKQIG VVTHCQGLWW LEFTLTGKEA HTGSTPMNMR VNAGLAMSRI LEMVQGVAMG EQPGAVGGVG QVFFSPNSRN VLPGKVVFTV DIRSPDKAKL DRMRAKIEAE APKICDALGV GCSVEAIGHF APVTFDEKLV SSVRSAAERL GYSHMNLISG AGHDACWAAK VAPATMVMCP CVGGLSHNEA EDISKEWATA GADVLFHAVV ETAEIVP
|
| |