Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3024 |
Symbol | |
ID | 8013939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3018344 |
End bp | 3019495 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644825592 |
Product | peptidase M24 |
Protein accession | YP_002976820 |
Protein GI | 241205724 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.504097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGC ATTTCGAAAA GGCCGAATTC GCAAGCCGGC TTGCGCGCCT CACCGAGAAG ATGAAGGAAG AAAAGCTCGA CGCCTTGCTG CTCTTCGCCC AGGAAAGCAT GTACTGGCTG ACCGGCTACG ACACCTTCGG CTATTGCTTC TTCCAGACGC TGGTCGTCAA GAGCGACGGC ACCATGGCGC TGATCACCCG CTCGGCTGAT CTTCGCCAGG CCAGGCATAC CTCGATCCTC GAGGACATCC ATATCTGGGT CGACCGGGTC AATGCCGACC CGACGCTCGA CCTGAAGAAC CTGCTGGTTG AGCTGGACCT GCTCGGCGCC CGCATCGGCA TCGAATATGA TACCCACGGC ATGACCGGCC GCGTCGCCCG GCTGCTCGAC GCGCAATTGA CCACCTTCGG CCAGATCGTC GACGCCTCCT ACCTCGTCAG CCGGCTGCGC CTGATCAAGA GCCCGACGGA GGTCGCCTAT GTCGAGCGCG CCGCCGCTCT CGCCGACGAT GCGCTCGATG CCGCGATCCG GTTGACAAAG CCCGGCGCCG ACGAGGCGGA TATCCTCGCT GCCATGCAGG GTGCGATTTT TTCCGGCGGC GGCGACTATC CCGCCAACGA GTTTATCATC GGCTCCGGCG CCGACGCGCT GCTCTGCCGC TACAAGGCCG GCCGCCGCAA GCTCGACGCC AACGACCAGT TGACGCTCGA ATGGGCTGGC GCCTATGCGC ATTACCATGC CGCCATGATG CGCACGATCG TCATCGGCGA GCCGACGCAT CGCCACCGCG AGCTTTACAA CGCCTGCCGC GAAACCATCG AGGCGATCGA AACCGTGCTG AAGCCCGGCC AGACCTTTGG CGATGTCTTC GACATGCATG CCAGGATCAT CGACGAGCGC GGCCTGGCTC GCCACCGGCT GAATGCCTGC GGTTATTCGC TCGGCGCCCG CTTCTCGCCC TCCTGGATGG AGCATCAGAT GTTCCATGTC GGCAATCCGC AGCCGATCGA GCCGAACATG TCGCTCTTCG TGCACATGAT CATCGCCGAC TCAGATACGG GCACGGCGAT GACGCTCGGC CAGACCTATC TGACGACAGC GGATGCGCCG CGCGCGCTAT CCTGCCATCC GCTCGATTTC ATCGGGCTCT GA
|
Protein sequence | MALHFEKAEF ASRLARLTEK MKEEKLDALL LFAQESMYWL TGYDTFGYCF FQTLVVKSDG TMALITRSAD LRQARHTSIL EDIHIWVDRV NADPTLDLKN LLVELDLLGA RIGIEYDTHG MTGRVARLLD AQLTTFGQIV DASYLVSRLR LIKSPTEVAY VERAAALADD ALDAAIRLTK PGADEADILA AMQGAIFSGG GDYPANEFII GSGADALLCR YKAGRRKLDA NDQLTLEWAG AYAHYHAAMM RTIVIGEPTH RHRELYNACR ETIEAIETVL KPGQTFGDVF DMHARIIDER GLARHRLNAC GYSLGARFSP SWMEHQMFHV GNPQPIEPNM SLFVHMIIAD SDTGTAMTLG QTYLTTADAP RALSCHPLDF IGL
|
| |