Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3456 |
Symbol | |
ID | 8014328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3487355 |
End bp | 3488524 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644826020 |
Product | amidohydrolase |
Protein accession | YP_002977241 |
Protein GI | 241206145 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.960361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTC CCGCCCGGAT CAGCGACGAT CTGCCGTTTC TCACTGCCCT GCGCCGCGAC CTGCACGCCT ATCCCGAACT CGGTTTCGAG GAGGAGCGCA CCGCCGGTAT CGTCGCGACG CTTCTCGAAG AGGCTGGTAT TGCGGTGCAT CGCGGGCTTG GCGGCAGCGG CGTGGTCGGT ACGCTGCAGA TCGGCAACGG CACGCGCAGG ATCGGGCTGC GCGCCGACAT GGATGCGCTC GCCATGCCTG AAACGGCGGA GCGATCCTAT AAATCGACCG TGCCCGGAAA GATGCATGCC TGCGGCCATG ACGGCCATAC GGCGATGCTG CTTGGCGCCG CGCGACATCT TGCGGCAACG CGGGATTTTT CCGGCACGGT GCATTTCATC TTCCAGCCGG CCGAAGAAGG GCGAGGCGGA GCAAGGCGCA TGGTCGAGGA GGGGCTGTTC ACGCTTTTTC CCTGCGATGC CGTCTACGGG CTGCATAACA TGCCTGGGCT TGCGGTGGAT GAGATCGCCG TGGTCGAGGG ACCGCAGCTT GCCTCCTCCG ACAGCTGGCG CATGACCTTC CGCGGGGCCG GCACGCATGG CGCCAAGCCG CATCTCGGCC GCGATCCGAT CACAGCCGCC GGCACCTTCC TGTCATCGCT GCAGACGATC GTCGGACGGG TGGTCGATCC GCTGCAGCCG GCCGTCGTCA GCGCCTGTTT CCTGCAGGCG GGCGACCCGA AGGCTCTGAA CGTCATTCCT GACATCGTCG AGATCGGTGG CACGGCGCGG GCCTATTCGC CCGATGTGCG CGACCAGTTG GAGACCGAGA TCGGACGATT GGCGCATGGC ACGGCGGCCA TGTACAGTAT CGCTGTGGAC TATGCATTCG AACGGCGGAT TCCGCCTGTC ATCAACGACG CGGATGCGAC CGCACGGGCG TTGGCAGTGG CCGGCTCGGT CTTCGGCGGG AAGGTGCAGA CAAGCTTTCC GCCGTCGATG GCAGGCGACG ATTTTGCCTT CTTCGCCCAG AATGCGCCGG GTTGCTACGT CTGGCTCGGC AACGGTCCGG CGGTGGATGG GGCGCTGCAT CACAACACGG CCTATGACTT CAACGATGAA GCGCTTGGAT ATGGGGCGGC CTATTGGGTG GCGTTGGTCG AGCGGGAGTT GAAGGTTTGA
|
Protein sequence | MSIPARISDD LPFLTALRRD LHAYPELGFE EERTAGIVAT LLEEAGIAVH RGLGGSGVVG TLQIGNGTRR IGLRADMDAL AMPETAERSY KSTVPGKMHA CGHDGHTAML LGAARHLAAT RDFSGTVHFI FQPAEEGRGG ARRMVEEGLF TLFPCDAVYG LHNMPGLAVD EIAVVEGPQL ASSDSWRMTF RGAGTHGAKP HLGRDPITAA GTFLSSLQTI VGRVVDPLQP AVVSACFLQA GDPKALNVIP DIVEIGGTAR AYSPDVRDQL ETEIGRLAHG TAAMYSIAVD YAFERRIPPV INDADATARA LAVAGSVFGG KVQTSFPPSM AGDDFAFFAQ NAPGCYVWLG NGPAVDGALH HNTAYDFNDE ALGYGAAYWV ALVERELKV
|
| |