Gene Msil_3227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3227 
Symbol 
ID7090642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3540566 
End bp3541906 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content64% 
IMG OID643466535 
Productprotease-like protein 
Protein accessionYP_002363496 
Protein GI217979349 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000167214 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGAGGG CGACAATAGC ATCCGTTGCG GCGTTCGCCG TCGCGCTTTC TTATGGGGCG 
CAGGCCTTGG CGCAAGAGGT CGAGGCGCCA GCCACAGGGC GCTTCATTCA TGCGCCGAAA
GCGTCAGTGA CGACCCCTTC CTCGAGCGTC GCCAAACCCG CCGATGCAGG CAAGGCGGCG
CATACGAATA CAAAATTCAT AGGCCCGAAT GGTCTTGCTC CGCCGAACGC GGCCGGGCCG
CGATCCGGGG CTGCGCCGGC CGGAAGCCCG CCTTACGCGG ACTATGGGTA TGAAACGCCT
GCTTCGCTCG CCTGTCTCTA CGGGCTGGTC GCCGCTTCCC CCGGATGCAA TCCGAACGTC
GCCTCCGCCG TGCCGACCGC CAAGGGCTCC AAGGCGATAG CTTTGGTGGA CGCCTATGAT
TACCCGACCG CCCTCAGCGA TCTGCAAACC TTCAGCGTTC AGTTTGGCCT TCCCTTGCCC
AATCTGATCG TGAAATACGC TACGGCGGGA GGCGCGTGCA ACGGACCAAA GCCGGCGAAT
GATCCGGGAT GGGAGGGCGA GGAAGCGCTC GACGTTCAAA TGGCGCACGC CATGGCGCCG
CAGGCGACCC TTTATCTCGT CGAGGCGCAA GACAACTCCA ACGCCAATTT GGCGGGGGCG
ATCGTTTGCG CCAACAGCCT GCTTCAGGCG AGCGGGGGCG GTGAAGTCTC GATGAGCTGG
GGCGGAAGCG AAGTCTCAAC CTACGAAAGC GCCTTCAGCG CAAACAACGT CGTCTATTTC
GCGTCGAGCG GCGACGCCCC GGGGCCAAGC TGGCCATCGA CCTCGCCGAA TGTCGTGTCG
GTTGGCGGAA CGAGCATCGC CCGCGACCCG CAAACCTATA AATTCCTGCA TTATGCAAGC
TGGAGCGAGG CGGGCGGCGG CGCGAGCTTG ATTTTTCCGC GCCCGGCCTA CCAAAGCGGC
CCCGGAATAG CGGGAACCGC CCGGCTGACG CCTGACATCT CGGCTGTCGC CAACCCCGCC
ACCGGCGTGT GGGTCTATGA CAGCAATCCC TCTTTCGGCG CAGGCTGGTA TGTGTTCGGC
GGCACCAGCG TCGCTGCGCC GCTGGTGGCG GCGATCACCA ACGCGCATAA TAATTTCCGC
GCCAACACGG CGACCGAGCT GACCGCGATC TACAAGGCGA AAAAGGCGAG CGCCAAAGCC
TTCGCGACGG CCACCATCGG CTATTGCGGC CCTTATGCGG CGTCTCAACC TACCGCGAAG
TGGAACATCT GCCTCGGGGT CGGGACGATC AAGGGAACGG GAACCGCAAA TGTCCTGCCG
GTGGTGGATG CGGAGCAATA G
 
Protein sequence
MKRATIASVA AFAVALSYGA QALAQEVEAP ATGRFIHAPK ASVTTPSSSV AKPADAGKAA 
HTNTKFIGPN GLAPPNAAGP RSGAAPAGSP PYADYGYETP ASLACLYGLV AASPGCNPNV
ASAVPTAKGS KAIALVDAYD YPTALSDLQT FSVQFGLPLP NLIVKYATAG GACNGPKPAN
DPGWEGEEAL DVQMAHAMAP QATLYLVEAQ DNSNANLAGA IVCANSLLQA SGGGEVSMSW
GGSEVSTYES AFSANNVVYF ASSGDAPGPS WPSTSPNVVS VGGTSIARDP QTYKFLHYAS
WSEAGGGASL IFPRPAYQSG PGIAGTARLT PDISAVANPA TGVWVYDSNP SFGAGWYVFG
GTSVAAPLVA AITNAHNNFR ANTATELTAI YKAKKASAKA FATATIGYCG PYAASQPTAK
WNICLGVGTI KGTGTANVLP VVDAEQ