Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3291 |
Symbol | |
ID | 8014176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3292371 |
End bp | 3293624 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644825850 |
Product | allantoate amidohydrolase |
Protein accession | YP_002977077 |
Protein GI | 241205981 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.17984 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.145227 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCAG CACCAGGCGA GAACATGCGC GTCAATGGCG ACCGTCTCTG GGACAGTCTC ATGGACATGG CGAAGATCGG CCCCGGCATA GCAGGCGGCA ACAATCGCCA GACGCTGACG GATGCCGATG CGCAAGGGCG CAGCCTCTTC AAGACATGGT GCGACGATGC CGGTTTGACC ATGGGCGTCG ACCGCATGGG CACGATGTTC GCCACCCGCC CCGGCACCGA TCCCGATGCC TTGCCCGTCT ATGTCGGCTC ACATCTCGAC ACCCAGCCGA CCGGCGGCAA ATATGACGGT GTGCTCGGCG TACTTGCGGC TCTCGAAGTC GTGCGCACCA TGAACGATCT CGGCATCAAG ACCAAACATC CGATCGTCGT GACCAATTGG ACGAATGAGG AAGGCGCGCG TTTTGCCCCC GCCATGCTGG CGTCAGGCGT CTTCGCCGGC GTGCACAGCC TGGACTTTGC CTATAATCGC AGGGACCCCG AGGGCAATCT GTTCGGCGAC GAATTGAAGC GTATCGGTTG GGTCGGCGAC GAAGAGGTCG GCGCCCGCAA GATGCACGCC TATTTCGAAT ATCACATCGA GCAGGGACCG ATCCTCGAAG CCGAGGAAAA GCAGATCGGC GTCGTCACCC ATTGCCAGGG CCTCTGGTGG CTGGAATTCA CGCTGATCGG CAAGGAAGCC CATACCGGCT CGACGCCGAT GAACCTGCGT GTCAATGCCG GCCTTGCCAT GGCCCGCATC CTGGAAATGG TCCAGGGCGT GGCGATGGGC GAACAGCCGG GCGCCGTCGG CGGCGTCGGA CAGGTGTTCT TCTCGCCGAA TTCCCGCAAC GTCCTGCCGG GCAAGGTGGT CTTCACCGTC GACATCCGCT CGCCCGACAA GGAGAAGCTC GACCGTATGC GGGCAAAGAT CGAGGCGAAG GCGCCTGAGA TCACCGACGC ACTCGGCGTC GGCTGTTCCA TCGAGGCGAT CGGCCACTTC GAGCCGATCA CCTTCGATCC GGAACTGGTC ACGTCGGTGC GCGATGCCGC CGAGCGACTC GGCTACAGCC ACATGAACAT CATCTCCGGC GCCGGCCACG ACGCCTGCTG GGCCGCCAAG GTCGCACCGG CAACGATGGT CATGTGCCCC TGCGTCGGCG GGCTCTCGCA CAATGAGGCG GAAGAGATCT CCAAGGAATG GGCGATGGCG GGAGCCGATG TGCTATTTCA TGCCGTGGTG GAAACGGCGG AGATCGTGGT GTGA
|
Protein sequence | MVAAPGENMR VNGDRLWDSL MDMAKIGPGI AGGNNRQTLT DADAQGRSLF KTWCDDAGLT MGVDRMGTMF ATRPGTDPDA LPVYVGSHLD TQPTGGKYDG VLGVLAALEV VRTMNDLGIK TKHPIVVTNW TNEEGARFAP AMLASGVFAG VHSLDFAYNR RDPEGNLFGD ELKRIGWVGD EEVGARKMHA YFEYHIEQGP ILEAEEKQIG VVTHCQGLWW LEFTLIGKEA HTGSTPMNLR VNAGLAMARI LEMVQGVAMG EQPGAVGGVG QVFFSPNSRN VLPGKVVFTV DIRSPDKEKL DRMRAKIEAK APEITDALGV GCSIEAIGHF EPITFDPELV TSVRDAAERL GYSHMNIISG AGHDACWAAK VAPATMVMCP CVGGLSHNEA EEISKEWAMA GADVLFHAVV ETAEIVV
|
| |