Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1886 |
Symbol | |
ID | 7094165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2049016 |
End bp | 2050926 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643465213 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_002362193 |
Protein GI | 217978046 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCATG GCTTGATTCC ACGATCCCAT TTGTTCGGCA ATCCTTATAA ATTTTCCGGC AAGATCAGTC CCGACGGACT CTTCCTCGCC TGGCTGGCGC CCCTCGACGG AGTTCTCAAT GTCTGGATCG CGCCGATTGA TGCGATCGAT CGCGCCGAAC CCGTCACCAA AGACACGAAT CGCGGCATAC GGACTTTTGA ATGGGCTAAC GACGGCCATC ACCTTGTCTA TATGCAAGAT AAGGAGGGCG ATGAGAATTT CCACATCTAC GCCGTCGACA CGGAAACGCG CGCCATTCGC GACCTCACGC CATTCGACGG CGTAACGGCG TGGATCGATC GCGTCAGCCG AACGATCCGC GACCGCATCC TCGTCAGGAT CAATCGCCGC GACCCGAAAT TTCACGATCT CTACACTGTC GAACTTGCGA GCGGCGATAT TGCCTTGATC CAAGAGAACT TCGGCGTCGC TGCATTCGTG ACGGACCATC ATTACAACGT CCATCTCGCA ATCAGGGACC TGCCAAGTGG CGAAAGAGAG GTTCTGCGCC GCGTCGACGG CGTCTGGACG CCGTGGATCA CTTTTGCGAC GGAAGACGCG CGGGTGTCTC ACCCTCTTCA TTTGGACACG CACGCGAGAA TGCTTTTTCT TCGCGACAGT CGTGGCCGCG ACAAGGCGGG TCTGACGAGA GTTGATCTCG CCACGGGCGA AACGGCGTTG CTTGCCGAAA GCGACAAGGC CGACATCTTC GGGGTTCTGT GCGATCTGGA AACGAGGGAG CCGATCGCAT ATAGCGTCGT TCACGAGCGC CTCCAATATT TCGCGCTTGA GGCAAAACTT CAGGCCGATC TCGATTTTCT GGCGGCGCAG GATATCGGCG ACTGGTTTCT TTTAAGCCGG ACGCTGGATG ATCGTCTTTG GGTCATTGGC GCTTATTCGG ATACGCAGCC CTTCATCGAA TATCTTTTTG ACCGCGGAAC GAGATCGCTT CGCGAACTCC ATCGTGTCTA CCCGGAACTC GACGATGCGC CACTGCTGCC GATGCGGCCG CTCATCATCA AATCGCGCGA CGGACTCGAT CTCGTCACCT ATCTCACGCT TCCGGGAGAC GTCTCCGCCG CCGCGCCAGG AGCTGCCGTC CTTCTCGTCC ATGGCGGCCC ATGGGCGCGC GACAGTTTCG GCTACCACAG CCTCCATCAA TGGCTCGCCA ATCGCGGTTA TGCCGTGTTG AGCGTTAATT TTCGCGGTTC AGCCGGGTTC GGCAAGGCAT TCATCAACGC CGGCGACGGT GAATGGGGCC GGCGCATGGA CGACGACCTT CTCGACGCCG TCGCCTGGGC GATCGAACGA CGGATCGCCG ATCCCCAACG GATCGCCATT ATGGGGGGAA GCTACGGCGG TTATGCGACG CTCGTCGGCC TCACCCGTAA CCCCGATACC TATGCCTGTG GGGTCGATAT CGTCGGACCG TCAAATCTCG AAACGCTCGT CCGAACCATT CCTCCATATT GGGAATCTTT TCGCGCGCCG CTGACGAAAG CGGTGGGCGA TCCCGAAACG GAAGAAGGCT TGCGGCTTCT GCGCGAGCGT TCTCCGCTCT TCAATGCAGA CAAGATCGCC AAACCGCTTT TGATCGCACA TGGCGCGAAT GACCCCAGAG TGAAGCAGGC GGAAGCAGAC CAGATGGTCG AAGCGCTGAA AGAAAGAAAC ATCCCGGTCC CCTATCTGCT TTTTCCAGAC GAAGGCCATG GTTGCGTGCG GCCCGAGAAC AATATTGCGC TCTTTGCGAT TGTAGAGAAC TTCCTTGCGC GCCACCTGGG TGGACTCGCT GAACCCATCC ATGCAGATGA GTTGAAGAAA AGCTCTCTCG AAATCAGGGA GGGCGCGGAG CAGCTTTCCC TACCGCAGTG A
|
Protein sequence | MSHGLIPRSH LFGNPYKFSG KISPDGLFLA WLAPLDGVLN VWIAPIDAID RAEPVTKDTN RGIRTFEWAN DGHHLVYMQD KEGDENFHIY AVDTETRAIR DLTPFDGVTA WIDRVSRTIR DRILVRINRR DPKFHDLYTV ELASGDIALI QENFGVAAFV TDHHYNVHLA IRDLPSGERE VLRRVDGVWT PWITFATEDA RVSHPLHLDT HARMLFLRDS RGRDKAGLTR VDLATGETAL LAESDKADIF GVLCDLETRE PIAYSVVHER LQYFALEAKL QADLDFLAAQ DIGDWFLLSR TLDDRLWVIG AYSDTQPFIE YLFDRGTRSL RELHRVYPEL DDAPLLPMRP LIIKSRDGLD LVTYLTLPGD VSAAAPGAAV LLVHGGPWAR DSFGYHSLHQ WLANRGYAVL SVNFRGSAGF GKAFINAGDG EWGRRMDDDL LDAVAWAIER RIADPQRIAI MGGSYGGYAT LVGLTRNPDT YACGVDIVGP SNLETLVRTI PPYWESFRAP LTKAVGDPET EEGLRLLRER SPLFNADKIA KPLLIAHGAN DPRVKQAEAD QMVEALKERN IPVPYLLFPD EGHGCVRPEN NIALFAIVEN FLARHLGGLA EPIHADELKK SSLEIREGAE QLSLPQ
|
| |