Gene Msil_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3687 
Symbol 
ID7093041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4049605 
End bp4051674 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content64% 
IMG OID643466974 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_002363933 
Protein GI217979786 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.115326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA CGCACCCGGG CGAGGCGAAT CCGCTGCTCG AACCCTGGAC CGGGCCGTTC 
GAGGCGCCGC CCTTCGGACT GATCCGCTCC GATGAATTTC GTCCGGCCTT CAATCGCGCG
CTCGCCGAGG CGCGGGCGGA GACCGACGCC GTCGCGGCCA ATCCCGAGCC GCCGACCTTC
GCCAATACCA TCGAGGCGAT CGAGCGCAGC GGCAAGAATC TCGACAAGGT CGCAAGCGTC
TTCTTCAATC TCGTCGGCAC GGATTCCGAT GAGACGCTGG AAGCGGTCGA GCGCGACATG
GCGCCGATCC TGTCGCGTCA TCGCAGCGCG TTCTTTTTGA ATGAGGCGCT TTTCGCGCGG
GTCGCCGCGC TCCATGCGCA GCGCGATTCT CTGGGGCTCG ACGCCGAGCA GGCGCGCGTG
CTCGAACGCT ATCATCTGAA TTTCACGCGC AATGGCGCCG GCCTGCCCTC TGAGGCCAAG
GCGCGCCTTG CCGATATCGG CGAGCGCCTC GCCAGCCTTG GCGCGCAATT CGGCCAGAAT
GTTCTGGCCG ACGAAAAAGC CTATCTGCTG ATTCTCGACA TCGAGGATCT CGGCGGCCTG
CCCGATTTCC TGGTTGCGAG CGCGGCCCGC ATCTCCGCCG AGCGCGGCCA TCCCGGCCGA
TACGGGATCA CCCTGTCGCG CTCCAGCATC GAGCCGTTCC TGCAGTTCTC GAACCGGCGC
GATCTGCGGG AAAGGGCGTT CCGCGCTTGG TCTGCGCGCG GCGAAAGCGA CGGCCATACG
GATAATCGCC CGATCGCCGC CGAAATGGTC AAGCTCAGGG CCGAGCGCGC GGCGCTTCTT
GGCTATGAAA GCTTCGCCCA TTTTCGCCTG GCCGACACCA TGGCCAAGAC GCCCGAGGCG
GCGCTCGATC TTCTGCAATC GGTCTGGACG CCAGCGGTGC AGCGCGCCGC GGAGGAGGAG
CAGGCGCTGC AAAAGCTCGC CGCCGCCGAG GGCGAGAATT TCCGCATCGC GCCCTGGGAC
TGGCGCTATT ACGCGGAAAA GCAGCGCAAG GCCGAATTCG ACCTCGATGA GGGCGAGATC
AAGCCTTATC TGCAGCTCGG CAAATTGATC GAGGCCGCCT TCTACGCCGC GGGCCGGCTG
TTTGGGCTCA GTTTTACCGA ACGCTTCGAC ATTCCGCTCT ACAACAAGGG CGCGCGCGCC
TTTGAAGTCG CGCGGGACGG CAAGCCGGTC GCGCTGTTCA TCGGCGATTA TCTGGCGCGG
CCGTCCAAAC GCAGCGGCGC TTGGATGAGC GATTTTCGCG GCCAGCACAA ACTCGACGGC
GCGCAATTGC CCATCATCGT CAATGTCATG AATTTTGCGC AGGGCGGCGA GGGCGAGCCA
AGCCTGCTCA GCTTCGATGA CGCGCGCACG CTGTTCCATG AATTCGGCCA CGGTCTGCAT
GGCATGCTGT CCGACGTGAC CTATCCGACG CTTTCGGGCA CCAATGTCGC GCGCGACTTC
GTTGAATTTC CCTCGCAGCT CTATGAGCAT TGGCTGGAGC AGCCCGAGAT TTTGCGTCGC
TTCGCGCTGC ATTATGAGAC GGGTGAGCCG ATGCCGGAGG CGCTCATCGA AAAGCTCGTC
GCCGCGCGAA AATTCAATCA GGGCTTCGCG ACGCTCGAAT ATGCCGCCTC CGCGCTGGTC
GACCTCAGCC TGCATCTGAA CGCGACGCCG GAGGATCTCG ACGTCGTCGC GCTCGAACAG
AAGGAGCTGG CGCGCATCGG CATGCCGGAG GCCATCGCCA TGCGTCACCG CACGCCGCAT
TTCCAGCACA TCTTTTCGGG CGAATCCTAT TCGGCCGGCT ACTACAGCTA TCTCTGGTCG
GAAATTCTCG ACGCCGACGG TTTTGAGGCC TTCCACGAGA CCGGCGACAT CTTCCATACG
GAGACGGCGC GGCGGCTGCA TGATTTCGTC TATGCCGCCG GCGGCAGCCG CGACTATGAG
GACGCCTACG CAGGATTTCG CGGGCGGGCG CCCTCGCCAC AGGCGCTGCT GCGAAAGCGC
GGCCTGGATA GTGCGGCGGC GGCGAGTTAG
 
Protein sequence
MTKTHPGEAN PLLEPWTGPF EAPPFGLIRS DEFRPAFNRA LAEARAETDA VAANPEPPTF 
ANTIEAIERS GKNLDKVASV FFNLVGTDSD ETLEAVERDM APILSRHRSA FFLNEALFAR
VAALHAQRDS LGLDAEQARV LERYHLNFTR NGAGLPSEAK ARLADIGERL ASLGAQFGQN
VLADEKAYLL ILDIEDLGGL PDFLVASAAR ISAERGHPGR YGITLSRSSI EPFLQFSNRR
DLRERAFRAW SARGESDGHT DNRPIAAEMV KLRAERAALL GYESFAHFRL ADTMAKTPEA
ALDLLQSVWT PAVQRAAEEE QALQKLAAAE GENFRIAPWD WRYYAEKQRK AEFDLDEGEI
KPYLQLGKLI EAAFYAAGRL FGLSFTERFD IPLYNKGARA FEVARDGKPV ALFIGDYLAR
PSKRSGAWMS DFRGQHKLDG AQLPIIVNVM NFAQGGEGEP SLLSFDDART LFHEFGHGLH
GMLSDVTYPT LSGTNVARDF VEFPSQLYEH WLEQPEILRR FALHYETGEP MPEALIEKLV
AARKFNQGFA TLEYAASALV DLSLHLNATP EDLDVVALEQ KELARIGMPE AIAMRHRTPH
FQHIFSGESY SAGYYSYLWS EILDADGFEA FHETGDIFHT ETARRLHDFV YAAGGSRDYE
DAYAGFRGRA PSPQALLRKR GLDSAAAAS