Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0917 |
Symbol | |
ID | 8012064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 907606 |
End bp | 908526 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823501 |
Product | putative aminopeptidase protein |
Protein accession | YP_002974752 |
Protein GI | 241203656 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0248171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.161085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCGC TTGAACAAAA TTCTTTTTTC CATCAACGCC GTGCCGTACT GGCGGGTCTT GCCGGTGCAC TTGTCCTGCC GCGCATGGCG GCAGCTTTCG ATGTCCCGGA CGAGCCGCGC CTTGCCAAGC ACGACTACGC CAAGGTCCGC CACCATTTCC GCACCAAGCT GCTGCAGAAG GGCCCGGCGC CCGACAAATA CGAACTGCTG AATGCGCCTG CCGATGCCGA CAAGATCTTC TACCGCTCCG GTTACGGCGA ACTGGAACTG GCGGCCTGGG TCTCGAAATA CAAGCGCGAG CGCGCTGCCA AGCCTGCCGT GCTCTTCCTG CACGGCGGCA ATGCCATGGG CATCGGCCAC TGGCAGCTGA TGAAACCCTA TATGGATGCC GGTTATGTCG TGATGATGCC GTCGTTGCGC GGCGAAAACG GCCAGATGGG CAATTTCTCC GGCTTCTACG ACGAGGTCGA CGACGTGCTC GCCGCCACCG AGCGCCTGGC GCATCTGCCG GGTGTCGATC CCGAACGGCT GTTCATCGCC GGCCACAGCA TCGGCGGCAC GCTGACCATG CTGACGGCGA TGACCACTCA CAAATTCCGC GCCGCCGCAC CGATTTCAGG CAACCCCGAT GCCTTCCGCT TCTTCAACCG CTATCCGCAG GATATTCGCT TCGACGACAG CAATAAGCAT GAATTCGAGG TGCGTTCGGC CCTGTGTTAC GCTCATAGCT TCAAATGTCC GATCCGCGTC GTCCACGGCA CGGAGGAGCC GCATTTCAAC GACCGCGCCG ATCTGCTGGC CCGCCGCGCC CGCGCCGCCG GCGTCCATAT CGAGACCGAA ACCATCGCCG GCAATCACAC CTCGGCGCTG CCGGCCGAGA TCGAACAGAG CATCCGCTTC TTCCACGGGG TGGCGGCCTG A
|
Protein sequence | MSSLEQNSFF HQRRAVLAGL AGALVLPRMA AAFDVPDEPR LAKHDYAKVR HHFRTKLLQK GPAPDKYELL NAPADADKIF YRSGYGELEL AAWVSKYKRE RAAKPAVLFL HGGNAMGIGH WQLMKPYMDA GYVVMMPSLR GENGQMGNFS GFYDEVDDVL AATERLAHLP GVDPERLFIA GHSIGGTLTM LTAMTTHKFR AAAPISGNPD AFRFFNRYPQ DIRFDDSNKH EFEVRSALCY AHSFKCPIRV VHGTEEPHFN DRADLLARRA RAAGVHIETE TIAGNHTSAL PAEIEQSIRF FHGVAA
|
| |