Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1074 |
Symbol | |
ID | 7091903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 1163048 |
End bp | 1163908 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643464414 |
Product | peptidase C15 pyroglutamyl peptidase I |
Protein accession | YP_002361405 |
Protein GI | 217977258 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2039] Pyrrolidone-carboxylate peptidase (N-terminal pyroglutamyl peptidase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.254638 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATTC TTGTCACGGG CTTTGGCGGC TTTCCCGGCG CCCCGCGCAA TCCGACCGAA CGGATCATCG CCAATCTGGC CCGCCATCGG CCGCGTCTGG CGCGGGCCGG CCTCGAGCTT GATCTCAGCG TGCTCCCGGT CGTCTATGCC GAGATAGAGC CGCGCCTCGA GGCTTTGACG CGGGAGGCGG CGCCCGACGC AATTCTGCAT TTTGGCCTCG CCAGCCGGCG AAGCAAGCTC TGCGTCGAGA CGCGAGCCTT TAACCGCATC AGCCTCTTGC GGCCGGACGC CGCGGGAGCC TTTGCGCAAA GACGCCTTCT TCTCGCCGGC GGGAGTCAGA CCGGCGTCGA CGGCGTTCAG GTCCCAAGCG ACGAAGGCCG GCGCGTCCCA AGCGGGGAGG GGCAAGCGCT GGCCGGTGGG AGCCAACCGC TCACAGGCAG GGATCAAATG GACCCGATCC GCGCCGGCAG GCCGCAATCT CCGAGGGGCG CCGCCCAAAG CTTAAAATCG AGCGCGCCCG CCGGCCTGAT TGCCGCCAGA CTTCGCCGCG GCGGGTTTCA CGCCGCCGTT TCGATCGACG CCGGCGATTA TGTCTGCAAT CAAACCCTGT TTTTATCGTT GAGCTGCCAT CCGAACGCGC TGGTCGGCTT TATCCATGTG CCGCCGCTCG CCTCGCTCCG GCCGCAGTCC TCGCTCGCCG CGCCGCGTCG GCTGCGTCGG GTCGACGAGG CAGACAGGCC AGGCAACACT CTGATCCGTG GCGGGGGGCG CCTCACTCTT GACGAGGCGG TGCGCGCCGC CGTCCTCGCC ATTCTTGCGC TGATCCCGAA ACTTCAATCG CGACGATTAT CCCGCCGCTG A
|
Protein sequence | MRILVTGFGG FPGAPRNPTE RIIANLARHR PRLARAGLEL DLSVLPVVYA EIEPRLEALT REAAPDAILH FGLASRRSKL CVETRAFNRI SLLRPDAAGA FAQRRLLLAG GSQTGVDGVQ VPSDEGRRVP SGEGQALAGG SQPLTGRDQM DPIRAGRPQS PRGAAQSLKS SAPAGLIAAR LRRGGFHAAV SIDAGDYVCN QTLFLSLSCH PNALVGFIHV PPLASLRPQS SLAAPRRLRR VDEADRPGNT LIRGGGRLTL DEAVRAAVLA ILALIPKLQS RRLSRR
|
| |