Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2970 |
Symbol | |
ID | 8015746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2963010 |
End bp | 2963930 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644825540 |
Product | protein of unknown function zinc metallopeptidase putative |
Protein accession | YP_002976768 |
Protein GI | 241205672 |
COG category | [R] General function prediction only |
COG ID | [COG2321] Predicted metalloprotease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0222761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATGGA AAGGCCGGCG TCAGTCCGAC AATATCGAGG ATCGCCGCAG CGATCCGACC GGCGGCGGTT TCGGCCGCGG CGGCGGCTTC AACTTTCCTT CCGGCGGCGG TGTTCGCCGC GCCGGTGGCG GCCTCAGCAT CGGCACGATC GTCTTCCTCA TCGTCATCTA TTTAATCTTC AAGATGATGG GCATTGATCT GCTGCAGATG CTCGATACCG GCGGCACGAC AAGCGGTCCT GGCTACGAGC AGAGCCAGTC AGGCGGAACA CGCACGCCCG CCAACGACGA GATGACCGCA TTCATGCGCA CCGTCCTTGC CGAGACCGAG GATACCTGGA AGGGCATCTT CCAGGCGCAG GGCCAAAATT ACGAAGAGCC GCGTCTCGTG CTGTTCTCGG GCTCGACGGC ATCGGCCTGC GGCTCCGCAT CATCTGCAAC AGGCCCGTTC TACTGCCCGA GCGACCACAA GGTCTATCTC GATACCGAAT TCTTCCAGGA GCTTTCCGAC CGCTTCGGCG CGTCGGGTGA TTTCGCCGAG GCCTATGTCG TAGCTCATGA GGTCGGCCAT CACGTGCAGA ACCTGCTCGG CATCCTGCCG AAGTTCAACC AGGCCCGCCA GCGCATGAGC GAGGCGGACG CCAACAAGAT GTCGGTGCGC GTCGAGCTGC AGGCCGATTG CTTTGCCGGC ATCTGGGGCA AATATACCCA GCAGAAGGGC CTACTTGAGT CAGGCGATCT GGAGGAAGCG CTGAACGCAG CCCAGCAGAT TGGTGACGAT TCGCTGCAGA AGCGGTCACA GGGTTACGTC GTCCCGGAAA GCTTCAACCA CGGTACCTCG GAGCAGCGGG TCAGATGGTT CAAGCGCGGT TTCGACAGCG GCCAGCTGTC GGCCTGCGAT ACGTTCTCCG GCCCAATTTG A
|
Protein sequence | MEWKGRRQSD NIEDRRSDPT GGGFGRGGGF NFPSGGGVRR AGGGLSIGTI VFLIVIYLIF KMMGIDLLQM LDTGGTTSGP GYEQSQSGGT RTPANDEMTA FMRTVLAETE DTWKGIFQAQ GQNYEEPRLV LFSGSTASAC GSASSATGPF YCPSDHKVYL DTEFFQELSD RFGASGDFAE AYVVAHEVGH HVQNLLGILP KFNQARQRMS EADANKMSVR VELQADCFAG IWGKYTQQKG LLESGDLEEA LNAAQQIGDD SLQKRSQGYV VPESFNHGTS EQRVRWFKRG FDSGQLSACD TFSGPI
|
| |