Gene Msil_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1809 
Symbol 
ID7090926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1969442 
End bp1970743 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID643465136 
ProductDyp-type peroxidase family 
Protein accessionYP_002362116 
Protein GI217977969 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.64569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAATC GAAAGACTCC GCTCTCGCTT TCCCCGGATC GCCGCAGCCT GCTGCTCGCG 
GGCGGCGCCC TTGCCGGCGC CCTTTCGGCC GGCGCGCCGC AACGGGCGCT TGCGCAAAGC
GACAGCACCA ACGTCACCAA CGCCCCGATC AGCGAAAAAC AGCAGCAGCG CAACCCGTTC
TACGGCGCGC ATCAGGCGGG AATCGTCACG CCGCGCCAAG AATTCGGCAT GATCGCCACC
TTCGACGTGA TCGCCTCGAG CCCGCCCGAT CTCGTGCGCT TGTTCAGGAC GCTGACGGCC
CGCTTCGCCC TTTTGACGCA AGGATGGACG CCGCCGGAGC TCGACCCCCG CCTGCCGCCG
CCGGACTCGG GGTTGCTCGG TCCGGTCGTC GAGCCTGACA ATCTGACGGC GACGCTGTCG
GTCGGCTCGT CGATGTTCGA CGAACGTTTT GGCCTTGCGA AAGTGAAGCC GGCCGTTCTG
ACGCGCATGA CCAGCTTCAA GAACGATGCG CTGGACCCCG CGCTCTGTCA TGGCGATCTG
TCGATACAGT TTTCGTCGAA CTCGGCCGAC GCCAACATCC ACGCCCTGCG CGATATCCTG
AAGAGCCTGC CGGATCTTCT GGTGCTGCGC TGGAAGCAGG AGGGCTATGT TCCGGCTCTG
CCGGCAAAGC CCGGCCAGCC GCCGGAAAGC GCGCGCAATT TCCTCGGCTT CCGCGACGGA
TCGGCCAATC CGCACGCAGG CGATCCGGCC GCAATGAATG AGATCGTCTG GGTCCAGCCG
GGCTCGAAAG AGCCGGCCTG GGCCGCCGGC GGAACCTATC AGGCCGTGCG CATCATCCGC
AATTTCGTCG AACGCTGGGA CCGCACGCCG CTCGGCGAGC AAGAGCGGAT CATCGGCCGA
AGAAAGCCCT CCGGCGCGCC GTTCGACGGC AAAACCGAAG CGGACGTTCC GGATTTCGCC
GCCGATCCCA ACGGCAAGAT CACGCCGATT GACGCCCATA TCAGGCTCGC CAATCCGCGC
ACGCCGGAAA GCCGCGCCAA TCTCATTTTG CGCCGCCCGT TCAACTATTC CAACGGCGTG
TCGAAATCCG GCCAGCTCGA AATGGGCCTG CTCTTCATCG CCTATCAGGC GGATCTCGAA
AAGGGCTTTA TCACGGTCCA GCACCGGCTC GATGGCGAAC CGCTCGAAGA ATACATCAAC
CCGATCGGCG GCGGCTTTTT CTACACGCTG CCAGGCGCAA GAGACGAACA GGATTTCCTC
GGCCGCTCCA TGCTGGAGGC GGCAGGCATC GCTCTGTCGT AG
 
Protein sequence
MSNRKTPLSL SPDRRSLLLA GGALAGALSA GAPQRALAQS DSTNVTNAPI SEKQQQRNPF 
YGAHQAGIVT PRQEFGMIAT FDVIASSPPD LVRLFRTLTA RFALLTQGWT PPELDPRLPP
PDSGLLGPVV EPDNLTATLS VGSSMFDERF GLAKVKPAVL TRMTSFKNDA LDPALCHGDL
SIQFSSNSAD ANIHALRDIL KSLPDLLVLR WKQEGYVPAL PAKPGQPPES ARNFLGFRDG
SANPHAGDPA AMNEIVWVQP GSKEPAWAAG GTYQAVRIIR NFVERWDRTP LGEQERIIGR
RKPSGAPFDG KTEADVPDFA ADPNGKITPI DAHIRLANPR TPESRANLIL RRPFNYSNGV
SKSGQLEMGL LFIAYQADLE KGFITVQHRL DGEPLEEYIN PIGGGFFYTL PGARDEQDFL
GRSMLEAAGI ALS