Gene Msil_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1886 
Symbol 
ID7094165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2049016 
End bp2050926 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content58% 
IMG OID643465213 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_002362193 
Protein GI217978046 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCATG GCTTGATTCC ACGATCCCAT TTGTTCGGCA ATCCTTATAA ATTTTCCGGC 
AAGATCAGTC CCGACGGACT CTTCCTCGCC TGGCTGGCGC CCCTCGACGG AGTTCTCAAT
GTCTGGATCG CGCCGATTGA TGCGATCGAT CGCGCCGAAC CCGTCACCAA AGACACGAAT
CGCGGCATAC GGACTTTTGA ATGGGCTAAC GACGGCCATC ACCTTGTCTA TATGCAAGAT
AAGGAGGGCG ATGAGAATTT CCACATCTAC GCCGTCGACA CGGAAACGCG CGCCATTCGC
GACCTCACGC CATTCGACGG CGTAACGGCG TGGATCGATC GCGTCAGCCG AACGATCCGC
GACCGCATCC TCGTCAGGAT CAATCGCCGC GACCCGAAAT TTCACGATCT CTACACTGTC
GAACTTGCGA GCGGCGATAT TGCCTTGATC CAAGAGAACT TCGGCGTCGC TGCATTCGTG
ACGGACCATC ATTACAACGT CCATCTCGCA ATCAGGGACC TGCCAAGTGG CGAAAGAGAG
GTTCTGCGCC GCGTCGACGG CGTCTGGACG CCGTGGATCA CTTTTGCGAC GGAAGACGCG
CGGGTGTCTC ACCCTCTTCA TTTGGACACG CACGCGAGAA TGCTTTTTCT TCGCGACAGT
CGTGGCCGCG ACAAGGCGGG TCTGACGAGA GTTGATCTCG CCACGGGCGA AACGGCGTTG
CTTGCCGAAA GCGACAAGGC CGACATCTTC GGGGTTCTGT GCGATCTGGA AACGAGGGAG
CCGATCGCAT ATAGCGTCGT TCACGAGCGC CTCCAATATT TCGCGCTTGA GGCAAAACTT
CAGGCCGATC TCGATTTTCT GGCGGCGCAG GATATCGGCG ACTGGTTTCT TTTAAGCCGG
ACGCTGGATG ATCGTCTTTG GGTCATTGGC GCTTATTCGG ATACGCAGCC CTTCATCGAA
TATCTTTTTG ACCGCGGAAC GAGATCGCTT CGCGAACTCC ATCGTGTCTA CCCGGAACTC
GACGATGCGC CACTGCTGCC GATGCGGCCG CTCATCATCA AATCGCGCGA CGGACTCGAT
CTCGTCACCT ATCTCACGCT TCCGGGAGAC GTCTCCGCCG CCGCGCCAGG AGCTGCCGTC
CTTCTCGTCC ATGGCGGCCC ATGGGCGCGC GACAGTTTCG GCTACCACAG CCTCCATCAA
TGGCTCGCCA ATCGCGGTTA TGCCGTGTTG AGCGTTAATT TTCGCGGTTC AGCCGGGTTC
GGCAAGGCAT TCATCAACGC CGGCGACGGT GAATGGGGCC GGCGCATGGA CGACGACCTT
CTCGACGCCG TCGCCTGGGC GATCGAACGA CGGATCGCCG ATCCCCAACG GATCGCCATT
ATGGGGGGAA GCTACGGCGG TTATGCGACG CTCGTCGGCC TCACCCGTAA CCCCGATACC
TATGCCTGTG GGGTCGATAT CGTCGGACCG TCAAATCTCG AAACGCTCGT CCGAACCATT
CCTCCATATT GGGAATCTTT TCGCGCGCCG CTGACGAAAG CGGTGGGCGA TCCCGAAACG
GAAGAAGGCT TGCGGCTTCT GCGCGAGCGT TCTCCGCTCT TCAATGCAGA CAAGATCGCC
AAACCGCTTT TGATCGCACA TGGCGCGAAT GACCCCAGAG TGAAGCAGGC GGAAGCAGAC
CAGATGGTCG AAGCGCTGAA AGAAAGAAAC ATCCCGGTCC CCTATCTGCT TTTTCCAGAC
GAAGGCCATG GTTGCGTGCG GCCCGAGAAC AATATTGCGC TCTTTGCGAT TGTAGAGAAC
TTCCTTGCGC GCCACCTGGG TGGACTCGCT GAACCCATCC ATGCAGATGA GTTGAAGAAA
AGCTCTCTCG AAATCAGGGA GGGCGCGGAG CAGCTTTCCC TACCGCAGTG A
 
Protein sequence
MSHGLIPRSH LFGNPYKFSG KISPDGLFLA WLAPLDGVLN VWIAPIDAID RAEPVTKDTN 
RGIRTFEWAN DGHHLVYMQD KEGDENFHIY AVDTETRAIR DLTPFDGVTA WIDRVSRTIR
DRILVRINRR DPKFHDLYTV ELASGDIALI QENFGVAAFV TDHHYNVHLA IRDLPSGERE
VLRRVDGVWT PWITFATEDA RVSHPLHLDT HARMLFLRDS RGRDKAGLTR VDLATGETAL
LAESDKADIF GVLCDLETRE PIAYSVVHER LQYFALEAKL QADLDFLAAQ DIGDWFLLSR
TLDDRLWVIG AYSDTQPFIE YLFDRGTRSL RELHRVYPEL DDAPLLPMRP LIIKSRDGLD
LVTYLTLPGD VSAAAPGAAV LLVHGGPWAR DSFGYHSLHQ WLANRGYAVL SVNFRGSAGF
GKAFINAGDG EWGRRMDDDL LDAVAWAIER RIADPQRIAI MGGSYGGYAT LVGLTRNPDT
YACGVDIVGP SNLETLVRTI PPYWESFRAP LTKAVGDPET EEGLRLLRER SPLFNADKIA
KPLLIAHGAN DPRVKQAEAD QMVEALKERN IPVPYLLFPD EGHGCVRPEN NIALFAIVEN
FLARHLGGLA EPIHADELKK SSLEIREGAE QLSLPQ