Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4780 |
Symbol | |
ID | 8007033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 152050 |
End bp | 153297 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644821710 |
Product | allantoate amidohydrolase |
Protein accession | YP_002972970 |
Protein GI | 241113135 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.105367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGC GATCCATCGA TGCGGCGCGT CTGCTTTGGC GCATCAGGAC GCTCGGCGAA ATCGGCCGGG ATAGCGACGG CCGGCTCGTG CGGCTGGCAG CTTCTGATGC CGAAAAACTC GGCCGCGACC AATTCGTCGT ATGGATCGAG GACGCAGGGC TCGCCGTCGC CGTCGATCGC ATCGGCAACA TCTTCGGCAT CTGGAAACCG GACGGCGTCG CCGACGAAGC GCCCCTGCTG CTCGGCTCGC ATATCGACAC CGTCATCGGC GCCGGTATCT ATGACGGCTG CTACGGCGCG CTATCCGGTC TGGAAGTCAT CGAGACGCTG AAGGCCGAAG GCCTGGCGCC ATCCCGGCCG ATCGTCGTGG CGGCCTTCAC CAATGAGGAA GGTGCGCGCT ACGCGCCCGA TATGATGGGG TCGCTGGTCT ATGCCGGCGG TCTCGATGTC GACGCGGCTC TTGCCACCAT TGGCACCGAC GGGACGATAC TTGGCCAGGA GCTCGAGCGG ATCGGCTATG CCGGCGAACA TGAGCCCGGC TTCCTCAGGC CGCACGCCTA TATCGAGCTG CATATCGAGC AAGGCCCGGT CCTCGAACGC GAAGGCATTC CGGTCGGCGC CGTGGAAGAC CTTCAAGGCA TCTCCTGGCA GAGGGTGACC ATCACCGGCG ATGCCAATCA CGCCGGAACA ACGCCGATCT CCATGCGCCG AGACGCCGGG CATGCCGCCG CGCGAGTCGT CATCTTCCTG CGCGAGCGGG CGAAGGCTTC GAACACGCCG ACGGTCGCGA CAGTCGGCTG CATGCGCTTC GAACCTGATG TCATCAACGT GATCCCGTCG CGGGCAACCT TCACCGTCGA CCTTCGCGAT CCGGACGAGG ATCGCCTCAG AGAAGAGGAG ACCGCGCTCA CCAACTTCCT GGAGATTCTA TCAACCGAGG AGCAGGTCGG CATATCGGTG GAAAGGCTTG CCCGGTTCGA GCCTGTGAAG TTCGACCAAG GGATCGTCGG CCTCATCGAA AAGGCTGCGC GGGACCGGGG TCTCGCCTGC CGGCGGATGA CCTCCGGCGC CGGCCACGAC GCGCAGATGA TTGCCAGGAT CGCCCCGTCG GCGATGATCT TCGTCCCGAG CATCGGCGGG ATCAGCCATA ACCCGAGGGA ATACACGGCC GACGAAGATC TCGTTGCCGG AGCGAACATC CTGCTGGATG TCGTTCGCCA GCTCGCCAAG GAAGGACTGC CGGCATGA
|
Protein sequence | MTARSIDAAR LLWRIRTLGE IGRDSDGRLV RLAASDAEKL GRDQFVVWIE DAGLAVAVDR IGNIFGIWKP DGVADEAPLL LGSHIDTVIG AGIYDGCYGA LSGLEVIETL KAEGLAPSRP IVVAAFTNEE GARYAPDMMG SLVYAGGLDV DAALATIGTD GTILGQELER IGYAGEHEPG FLRPHAYIEL HIEQGPVLER EGIPVGAVED LQGISWQRVT ITGDANHAGT TPISMRRDAG HAAARVVIFL RERAKASNTP TVATVGCMRF EPDVINVIPS RATFTVDLRD PDEDRLREEE TALTNFLEIL STEEQVGISV ERLARFEPVK FDQGIVGLIE KAARDRGLAC RRMTSGAGHD AQMIARIAPS AMIFVPSIGG ISHNPREYTA DEDLVAGANI LLDVVRQLAK EGLPA
|
| |