Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3853 |
Symbol | |
ID | 6982616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3995568 |
End bp | 3996626 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398575 |
Product | peptidase M19 renal dipeptidase |
Protein accession | YP_002283341 |
Protein GI | 209551424 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.417509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTCG TATTTGACGG CCACAACGAC GTTCTCCTTC GACTCTGGAC ACATTCAAAA GACGGCAGCG ATCCGATCTC GGAATTTGCA GACGGCACGA CGGTCGGGCA TATCGATGCG CGCCGGGCCA GGGAGGGCGG CCTTTCGGGC GGTCTTTGCG CCATCTACAT TCCCTCCGGC GACCTGGTCT TTGCCGATCC GGATGCTGAC GGCCGTTATA TCACGCCGAT GGCGGCGCCT CTCGATCCGC TGCCGTCCCT TGCCATCGCC ACTGAAATGG CGGCGATCGC GTTGCGGCTC GACCAGGCCG GCGCCTGGCG GCTCTGCCGG ACGGTGAAGG ATATCCGCGG CGCCATGGCG GACGATATTT TTGCCGCCGT CCTGCATATG GAAGGCTGCG AAGCGATCGG CGCCGATCTT GCGGCGCTCG AAGTCTTCTA CGCAGCCGGG CTGCGGTCGC TCGGGCCGGT CTGGAGCCGG CACAATGTCT TCGGTTACGG CGTGCCCTTC GCCTTTCCGA TGTCGCCGGA CACGGCACCC GGCCTCACCG ATGCCGGTTT TGCGCTGGTG CGGGAATGCA ATCGCCTCGG TATCGTGATC GACCTTGCCC ACATCACCGA GAAGGGCTTC TGGGACGTGG CGAAGACGAC GGACCAGCCG CTGGTCTCCA GCCATTCCAA TGCCCATGCG CTGACGCCGG TCGCGCGCAA TCTGACCGAC AGGCAGCTCG ATGCGATCCG CGAAAGCCGC GGGCTCGTCG GCATCAATTA TGCCACCGCC ATGCTGCGTC CCGACGGCCG CTCGGACAGC GATACACCGC TTGCCGACAT GATCCGCCAT ATCGACTATC TGGTGAACCG CATCGGCATC GATTGCGTCG GCCTCGGATC GGACTTCGAC GGGGCCACAA TTCCTGAGGA AATCGGCGAT GCAAGCGGCA ATCAGAAGCT GATTGCCGCT CTCAGGGAGG TTGGTTATGG TGAGGCCGAT CTCACGAAAC TTGCCCGTGA AAATTGGCTT CGCATCCTCG CACAAGCCTG GCGCGAGGAC GACGCCTAA
|
Protein sequence | MQFVFDGHND VLLRLWTHSK DGSDPISEFA DGTTVGHIDA RRAREGGLSG GLCAIYIPSG DLVFADPDAD GRYITPMAAP LDPLPSLAIA TEMAAIALRL DQAGAWRLCR TVKDIRGAMA DDIFAAVLHM EGCEAIGADL AALEVFYAAG LRSLGPVWSR HNVFGYGVPF AFPMSPDTAP GLTDAGFALV RECNRLGIVI DLAHITEKGF WDVAKTTDQP LVSSHSNAHA LTPVARNLTD RQLDAIRESR GLVGINYATA MLRPDGRSDS DTPLADMIRH IDYLVNRIGI DCVGLGSDFD GATIPEEIGD ASGNQKLIAA LREVGYGEAD LTKLARENWL RILAQAWRED DA
|
| |