Gene Msil_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1336 
Symbol 
ID7091674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1439605 
End bp1441206 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content67% 
IMG OID643464674 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_002361663 
Protein GI217977516 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCGCA CCGTTTCGGC AGCGCTTCGG CGAGCCGCCG CGCCTGCCGC ACTGGCTATC 
GCCCTTGTGA CAGCCTCGCT CACCGCGGCA CGCCTTTTCA CAGCGCCGTT CCTGGCGAGC
GCCGCGGCTG AAGAAGCCGC GCCTGCGCCG AGCGCGGAGG CGGAATCCGC CCCGGCCGCT
GCGTCAAAAC CCGCAACCGC GGCGCCGACG ACGCCGCCTC ACCCCCTCCC GCCGCCGGCG
ACGACGCGGC AAAGCGTCGA CCTGCCGGGC CGCGCCCTGC GCTTCAAGGC GGTCGCCGGT
GCGATCCGCC TCAGCGATGC GCAGAGCGGC GAGGCGCAGG CCGACGTCGC CACTGTCGCC
TACAGCCTCG AAGGTCAGGA CGCGGCCAAA CGTCCCTTGG TCTTCGTCGT CAACGGCGGA
CCCGGCGCGG CCTCCGCCTG GCTCAACCTC GGCGCCCTCG GTCCATGGCG GCTCGATCTC
GGCAAGGATC CGGTCTCGCC CTCCGCCCCG GCCGCGCTCG TCGGCAATGC CGAGACCTGG
CTCGATTTCG CCGATCTGGT GTTCATCGAT CCGCCGCTCA CTGGCTATAG CCGCATCCTC
GCCAAGGGCG ACGGCGCGCG GCGTCAGTTA CTGTCGGTCG ACGGCGACAT CGAGGCGCTC
GCCGTTGTCA TCCGCAAATG GCTGACGTCC AATCAAAGGC TCGAAAGCCC AAAATTCATC
GTCGGCGAAA GCTACGGCGG TTTTCGCGCG CCAAAACTCA CCCGGCGCCT TCAGGAAAAC
GAAGGCGTCG GGATCAAGGG GATTATTCTG ATCTCTCCGG TTATCGATTT CAGCTGGTTC
GAAGCGGCCA ACAGCCCTTT GCCGGTGATG ACCCAATTGC CCTCGCTTAC TGCCGCGGCG
CGGGGCCTGA GGCCCGCGGA CGCCAAGGCG CTTGACGAGG TCGAGAGTTT TGCCTCCGGG
CCTTATCTGA CCGACCTCTT GCGCTCGGAA CGCGACCCCG CCGCCCTGAC CCGCATCGTC
GATGGGGTCG CGCGCCTGAC CGGCCTCGAA CCCGCCTTTG TCCGCCGCCT GGGCGGCCGG
GTGGACCCGT CGAGCTTTGC GCGCGAGGCG GGCCGCGCGG AAGCGAAAAT CTTCAGCCGC
TACGACATGA CGATAGCCGG CTTCGATCCC TCGCCGCACG CCGCCGACAC GAATTTTTCC
GATCCGGTGC TCGATGCGAC GAAAACGCCT TTCGCCAGCG CTATGGCCAA CCTCACGGCG
ACAAAGCTCG GCTGGCCGGT CGATGCGCGC TACGAGATCC TGAACGAATC CGTGAGCCGC
CAGTGGAATT GGGACGGCGG ACGCGGCAAG AACCAGTCCT TGAGCGACCT CAGCCAGGCG
CTCGCGATCG ATGCGGAGTT TCGCGTGCTG ATTGTGCATG GGCTGACCGA TCTGGTGACG
CCCTATTTCG CCTCAAAGCT GCTGATCGGC CAGATCCCGC CCTTCGGCGA TCCCGGCCGC
GTCGCGCTCA AAATCTATGA AGGCGGCCAC ATGCCCTGGC TCCGCGAAAG CGGCCGGGCC
GCCCTGCGCG ACGACGCGCG CAAATTGATC GAGGGGAAAT AG
 
Protein sequence
MIRTVSAALR RAAAPAALAI ALVTASLTAA RLFTAPFLAS AAAEEAAPAP SAEAESAPAA 
ASKPATAAPT TPPHPLPPPA TTRQSVDLPG RALRFKAVAG AIRLSDAQSG EAQADVATVA
YSLEGQDAAK RPLVFVVNGG PGAASAWLNL GALGPWRLDL GKDPVSPSAP AALVGNAETW
LDFADLVFID PPLTGYSRIL AKGDGARRQL LSVDGDIEAL AVVIRKWLTS NQRLESPKFI
VGESYGGFRA PKLTRRLQEN EGVGIKGIIL ISPVIDFSWF EAANSPLPVM TQLPSLTAAA
RGLRPADAKA LDEVESFASG PYLTDLLRSE RDPAALTRIV DGVARLTGLE PAFVRRLGGR
VDPSSFAREA GRAEAKIFSR YDMTIAGFDP SPHAADTNFS DPVLDATKTP FASAMANLTA
TKLGWPVDAR YEILNESVSR QWNWDGGRGK NQSLSDLSQA LAIDAEFRVL IVHGLTDLVT
PYFASKLLIG QIPPFGDPGR VALKIYEGGH MPWLRESGRA ALRDDARKLI EGK