Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4994 |
Symbol | |
ID | 8007585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 377330 |
End bp | 378520 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644821909 |
Product | peptidase M24 |
Protein accession | YP_002973169 |
Protein GI | 241113334 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.510729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA ATGCAGGGCA AACAAGGCAG GCCGCCGTGA CGCCGTTCGA CACGGCCAAG CTCGACCGTC TCATGGAGGA AGCCGGTATC GACGTCATTG TCGCCACCTC CAAACACAAC ACGCAATATC TCATGGGCGG CTACAAGTTC ATCTTCTTCG CGGCCATGGA TGCGATCGGG CATAGCCGGT ATCTGCCGAT GGTGATTTAT GAGAAAGGCG CGCCCGATCA CTCCGCCTAT GTGGGCAACC GCATGGAGGG CGCAGAACAC CAGAATAATC CGTTCTGGAC GCCCGCCGTG CATACGGCGA GCTGGGGCAC TCAGGATGCC GCCGGGCTCG CCGTCGAGCA CCTGAAGAAA ATCGGCAAGA CCGGTGGGCG CATTGGCATC GAGCCAGCCT TCCTGCCATC CGACGCCCGC GACCTGCTTG CCGATCGGCT GGATGGCGCC CGATTTGTCG ATGCCACGCA TGTGTTGGAG CGGTTAAGGG CGGTCAAGAC GCCAGATGAA CTGGCAAAGC TGAGACGCGC CTCCGAACTG ATCACCGACT CGATGCTGGC GACGGTCGCG GCGGCCCGCG CCGGTTCGAC GAAGATGGAG ATTATCGAGC AGCTGCGCCG GGAGGAAACG AACCGCGGCC TGCACTTCGA ATATTGCCTG CTGACGCTCG GCTCCAGCCA CAACCGCGCG GGCTCGCCGC AAGCCTGGGT CGAGGGTGAG ATCCTGTCGA TCGATTCCGG CGGCAACTAT CACGGATATA TCGGTGACCT GTGCCGCATG GGTGTGCTCG GCGAGCCGGA TGCGGAGCTG GAGGATCTTC TGGCCGAAGT CGAATGCATC CAGCAAGCGG CCTTCGCCAA TGTCAGGGCA GGGGCTGCGG GCAGGGAGAT GATCGTGGCA GCGGAGGCCG AGCTCAAGGC CTCGCCGTCA GCGGCCTTCA CCGACTTTTT CTGCCATGGC ATGGGCCTCA TCGCCCACGA AGCGCCATTC CTCATGACGA ACCACCCCGT CACCTATGAT GGCATCGATG CCGACAAGCC GCTGGAAGCC GGGAGCGTGA TCTCGGTCGA GACGACGATG CTTCACCCAA AGCGCGGATT CATCAAGTTG GAGGATACGC TCGCCATCAC CGATGGCGGC TATGAAATGT TCGGCGAGCG TGGCCGCGGC TGGAACCGCG GCGGAGCGTG A
|
Protein sequence | MNDNAGQTRQ AAVTPFDTAK LDRLMEEAGI DVIVATSKHN TQYLMGGYKF IFFAAMDAIG HSRYLPMVIY EKGAPDHSAY VGNRMEGAEH QNNPFWTPAV HTASWGTQDA AGLAVEHLKK IGKTGGRIGI EPAFLPSDAR DLLADRLDGA RFVDATHVLE RLRAVKTPDE LAKLRRASEL ITDSMLATVA AARAGSTKME IIEQLRREET NRGLHFEYCL LTLGSSHNRA GSPQAWVEGE ILSIDSGGNY HGYIGDLCRM GVLGEPDAEL EDLLAEVECI QQAAFANVRA GAAGREMIVA AEAELKASPS AAFTDFFCHG MGLIAHEAPF LMTNHPVTYD GIDADKPLEA GSVISVETTM LHPKRGFIKL EDTLAITDGG YEMFGERGRG WNRGGA
|
| |