Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4181 |
Symbol | |
ID | 8014971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4276231 |
End bp | 4277289 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826751 |
Product | peptidase M19 renal dipeptidase |
Protein accession | YP_002977961 |
Protein GI | 241206865 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.351187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTCG TATTTGACGG TCACAACGAC GTTCTCCTTC GACTCTGGAC ACATTCAAAA GACGGCAGCG ACCCGATCGC GGAATTCGCA GACGGCACGA CGGTCGGCCA TATCGATGCG CATCGGGCTA GAGAGGGCGG TCTTTCAGGC GGCCTCTGTG CCATCTATAT TCCCTCGGGC GATCTCGTCT TCGCCGATCC GGATGCCCAC GGCCGCTATA TGACGCCGAT GGCGGCCCCT CTCGATCCGC TGCCTTCGCT TGCCATCGCC AATGAAATGG CGGCGATTGC GCTGCGGCTC GACCAGGCCG GCGCCTGGCG GCTCTGCCGG ACGGTGAAGG ATATCCGCGG CGCCATGGCG GACGACATTT TCGCCGCCGT CATGCATATG GAAGGCTGCG AGGCGATCGG CGCCGATCTC TCGGCGCTCG AGGTGTTCTA CGCGGCAGGG CTGCGGTCGC TCGGGCCGGT CTGGAGCCGG CACAATGTCT TCGGTCACGG CGTGCCCTTC GCCTTCCCGA TGTCGCCGGA CACGGCGCCG GGCCTTACCG ATGCCGGCTT CGCGCTGGTC AGGGAATGCA ATCGCCTCGG CATCCTGATC GACCTTGCCC ATATCACCGA GAAGGGTTTC TGGGACGTGG CGAAGAAGAC GGACCAGCCG CTGGTCGCCA GCCATTCCAA TGCCCACGCC CTGACGCCGG TCGCGCGCAA CCTGACGGAC AGGCAGCTCG ATGCGATCCG CGAAAGCCGC GGGCTCGTCG GCATCAACTA TGCCACCGCC ATGTTGCGTG CCGACGGCCG CTCAGACAGC GACACGCCGC TTGCCGACAT GATCCGCCAT ATCGACTATC TTGTGAATCG CATCGGCATC GACTGCGTGG CGCTCGGATC GGACTTCGAC GGGGCCACCA TTCCGGAGGA AATCGGTGAT GCAGCCGGTA ATCAGAAGCT GATTGCCGCT CTCAGAGAGG TTGGCTATGC TGACGCCGAC CTGGCAAAAC TTGCCCGTGA AAACTGGCTT CGCATTCTGG CCCAGGCTTG GCGGGAGGAC CACGCCTAA
|
Protein sequence | MQFVFDGHND VLLRLWTHSK DGSDPIAEFA DGTTVGHIDA HRAREGGLSG GLCAIYIPSG DLVFADPDAH GRYMTPMAAP LDPLPSLAIA NEMAAIALRL DQAGAWRLCR TVKDIRGAMA DDIFAAVMHM EGCEAIGADL SALEVFYAAG LRSLGPVWSR HNVFGHGVPF AFPMSPDTAP GLTDAGFALV RECNRLGILI DLAHITEKGF WDVAKKTDQP LVASHSNAHA LTPVARNLTD RQLDAIRESR GLVGINYATA MLRADGRSDS DTPLADMIRH IDYLVNRIGI DCVALGSDFD GATIPEEIGD AAGNQKLIAA LREVGYADAD LAKLARENWL RILAQAWRED HA
|
| |