Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3896 |
Symbol | |
ID | 8014716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3964060 |
End bp | 3965193 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826466 |
Product | Serine-type D-Ala-D-Ala carboxypeptidase |
Protein accession | YP_002977678 |
Protein GI | 241206582 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1686] D-alanyl-D-alanine carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00266538 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGTCGACGC GCCACTTCCG TTTGTTTGCT GCCTTGCGGC CGCTTTCTTT CGTATCAGCG GCGACTGCCG TTTTTCTGGG CTCTTTTTCG CTTGCCCAGG CCAATCCGCA TATTCTGGTC GATGTGCAGA CCGGCCGCGT GCTCGAGCAT GAAGAAGCCT TCCGCAAATG GTATCCGGCC TCGCTGACCA AGCTGATGAC CGTCTATACC GTGTTCGATG CGATCCGCGC CGGGCAGATC AGCCTCGATA CACCCATCGT CATGAGCAAG CGCGCCGCCG CGCAGCCTGC CGCCAAGATG TATTTCAAGC CGGGCCAGAA GCTGACGCTC GATAGCGCGC TGAAGATCCT GATGGTGAAA TCGGCCAACG ACATCGCGGT CGCGGTTGCC GAAGCCATCG GCGGCACGCA GGAGGGCTTC GTGACGCGGA TGAACGGCGA GGCGCTGAAG CTCGGCATGA CGGATTCGCA TTTCGTCAAT CCGAACGGCC TGCCCGGCAA GGGCCAGTAT ACGACGGCGC GCGACCTTGC GGTGCTGACG GTGGCGCTGC GCCGCGATTT TCCGCAATAT GCCGGCTATT TTTCGCTGGA AGGTTTCACC AACGGCCAGC AGAACGTGCC GAGCCTCAAT ATGCTGATCG GCCGTTTCGC GGGCGCCGAC GGCATGAAGA CCGGTTTCAT CTGCGCCTCG GGCTTCAACC AGATCGGCTC GGCGACGCGC AACGGCCGCA CGCTGGTCTC CGTTGTGCTC GGCACCGACA GCCTTGCGGC GCGTGCCGAT GCGACGGCGA ATCTTTTGCA GAAGGGCTTC ACCACCCAGC CTGCCAGCAA CGACACGCTG GGTTCGCTGA GACCTTACGG GGTGGGACAG GACCAGGTAA CCGACATCAG CGCCGATATC TGCAGCGCCA AGGGCGCCAA GGTGCGCAGC GAAACGCGCG ACGAGGTCGG CCGCATGAAG GTGCAGTCAC CCTATATCCA GCCGATGGAC CACGATCCGC AATTCGTCTT TGCCGGGCTC ATTCCGGGCC AGGATCCGCA GCCGGCCGCG CAGCCGGAAA AAATGGCGCG CGGTGATACG GCAGGAGCGA TCGCCAACGT GCCGGTGCCG ATGCCGCGCC CGACATCCTT CTAA
|
Protein sequence | MSTRHFRLFA ALRPLSFVSA ATAVFLGSFS LAQANPHILV DVQTGRVLEH EEAFRKWYPA SLTKLMTVYT VFDAIRAGQI SLDTPIVMSK RAAAQPAAKM YFKPGQKLTL DSALKILMVK SANDIAVAVA EAIGGTQEGF VTRMNGEALK LGMTDSHFVN PNGLPGKGQY TTARDLAVLT VALRRDFPQY AGYFSLEGFT NGQQNVPSLN MLIGRFAGAD GMKTGFICAS GFNQIGSATR NGRTLVSVVL GTDSLAARAD ATANLLQKGF TTQPASNDTL GSLRPYGVGQ DQVTDISADI CSAKGAKVRS ETRDEVGRMK VQSPYIQPMD HDPQFVFAGL IPGQDPQPAA QPEKMARGDT AGAIANVPVP MPRPTSF
|
| |