Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0628 |
Symbol | |
ID | 6979344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 650276 |
End bp | 651436 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643395340 |
Product | amidohydrolase |
Protein accession | YP_002280151 |
Protein GI | 209548234 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.417509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG ACAGCCGGAA GCTGGAACAG GACATGACCG CCTGGAGGCG CGACCTTCAC AGCCATCCCG AATTCGGCTT CGACGAGAAG CGGACGGCGG CTTTCGTCGC GCGGAAATTG CGGGAGTTCG GGCTCGACGA GGTTGTCGAA GGTGTCGGCG GCACAGGGAT TGTCGGAACG CTCCGGCGTG GCGGCGGCAA CCGCTCCATT GCGCTGCGCG CCGATATGGA TGCCTTGAGG ATTGCCGAAC AGGGCGATCG GCCCTATCGA TCGCAGACCG CCGGCGTGAT GCATGCCTGC GGCCATGACG GGCATACGGC AATGCTACTT GGCGCCGCCC AGATGCTGTC TGAGGACGGC AATTTCGACG GCACGGTCCG CTTCATCTTT CAGCCGGCCG AGGAATGGGG CAAAGGCGCC TTGGCGATGC TCGACGATGG GCTGATGACG CGCTTTCCCT TCGACGAGAT CTACGGCCTG CACAATATGC CCGGCCTTCC GGTCGGTGTT TTCCAGACCC GCGCCGGCGC GATCATGTCG GCTGAGGATA ATTTCGAGAT CGTGCTCAAG GGCGTCGGCG GCCACGCCGC GCGTCCGCAC TCCGGCAACG AGGTCCTGGT CGCCGCCTGC GCTTTGGTGA CGAACCTCCA GACCATCGTT TCCCGGCGGC TCGATCCGAC CGATATCGGC GTCGTCTCCG TCACCGAGCT TCTGACCGAC GGCACCCGCA ACGCTTTGCC TGGGCTGGCC CGCATCCTCG GCGATGCCCG CAGCTTCCGC CCCGAAGTCA GCGCGGCGAT CGAAAAGCAC ATGCGCCGCA TCGCCGAAGG CACCGCGCTT GCCTACAATG TCTCGGCCGA GGTGAACTAT ACGAGAGAAT TCGTTCCCCT GCTCAACGAC GCCGCCCTGG CCGAGGAAGC CTTTGCCGCA GCCCGCAGCG TCTTCCCGTC CGAGAACGTC AAGGTCCGGC GCGAGCCGAT GACCGGATCG GAAGACTTCG CCCGCTTCCT CGACCATGTC CCCGGCTGCT TCGTCTTCCT CGGCAATGGC GAAGGTTCCG CGCCGCTGCA CAATCCGAAT TACGATTTCA ACGACGCCGG ACTGATCCAC GGTGCCAAAT TCCACGCGAG CATCGTGCGT CGGCGGTTGA ACGCCAGCTG A
|
Protein sequence | MSVDSRKLEQ DMTAWRRDLH SHPEFGFDEK RTAAFVARKL REFGLDEVVE GVGGTGIVGT LRRGGGNRSI ALRADMDALR IAEQGDRPYR SQTAGVMHAC GHDGHTAMLL GAAQMLSEDG NFDGTVRFIF QPAEEWGKGA LAMLDDGLMT RFPFDEIYGL HNMPGLPVGV FQTRAGAIMS AEDNFEIVLK GVGGHAARPH SGNEVLVAAC ALVTNLQTIV SRRLDPTDIG VVSVTELLTD GTRNALPGLA RILGDARSFR PEVSAAIEKH MRRIAEGTAL AYNVSAEVNY TREFVPLLND AALAEEAFAA ARSVFPSENV KVRREPMTGS EDFARFLDHV PGCFVFLGNG EGSAPLHNPN YDFNDAGLIH GAKFHASIVR RRLNAS
|
| |