Gene Msil_3310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3310 
Symbol 
ID7090806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3633043 
End bp3634125 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content63% 
IMG OID643466618 
ProductDyp-type peroxidase family 
Protein accessionYP_002363579 
Protein GI217979432 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0157694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCAAGG AATATAAGCT TGGCGGCGGG AGCGCCGCGC CGTCGGGCGA GACCGCTCCA 
CCCCCGATTC CGCAGCCTGT CGTCGCCCCG CTTACTCGAG CCGCCATCTT CATAGTGGTT
GCGATCAACC CCGGTCCGGA CAGCCGGGCC GCGCTGCGGT CATTTTGCGG CGATCTTTCC
GGCCTCATCC GGGCGGTCGG CTTTCGCGAT ATCGAAGGCG GCCTGTCCTG CGTCATCGGA
TTTGGCTCCG AGGCCTGGGA CGCGCTTTTT GGCGCGCCGC GGCCCGCGGA GCTGCATCCT
TTCCGCGAGA TCCGCTCCGG CAGCCGCGCC GCCATAGCGA CCCCGGGCGA CATCCTGTTT
CACATCCGCG CCAGCCGCAT GGATCTTTGT TTCGAGCTGG CCGCGCAGAT CATGGAGCGC
ATCGGCGCAT TCGTTTCTCC GGTCGATGAG GCGCAAGGTT TTCGCTACTT CGACGATCGC
GATCTGCTCG GCTTCGTCGA CGGCACCGAA AATCCAGTCG ACGCCGCCGC CGTGGAGGCC
GCCCTCATCG GCGGGGAAGA CGCAGATTTC ACCGGCGGCA GTTATGTCAT CGTGCAAAAA
TATCTGCATG ACATGAAGGC GTGGAACGCG ATCCCAACGG AGATGCAGGA GCTCATCGTC
GGCCGCAAGA AATTGTCCGA CGTTGAACTC GACGAAAGCG TCAAGCCAAG CTGGGCGCAC
GCCGCTTTAA CGATCATCGA AGAGGGCGGC AAGGAAATCA AGATCCTGCG CGCCAACATG
CCGTTCGGGA GCCCGACGCA GGGCGAATTC GGCACCTATT TCATCGGCTA CAGCCGCTCG
CCGCGCACCA TCGAGCAGAT GCTGGAAAAT ATGTTCATCG GCCGCCCGCC CGGCAATTAC
GACAAGCTGC TCGACGTCAG CCGACCTGTC ACGGGCAATC TGTTTTTCGT GCCCACCGCG
ACCTTTTTGG ACAATGTCGC GGACGAGTCA GCGGCGGCGC CCGAGCCGGC GCCGGCCGCC
CCGCGGACAA AAAATGAATC CCTCGCCATC GGGTCCCTCA AAGGAGAAAA AAGGCATGAA
TAA
 
Protein sequence
MFKEYKLGGG SAAPSGETAP PPIPQPVVAP LTRAAIFIVV AINPGPDSRA ALRSFCGDLS 
GLIRAVGFRD IEGGLSCVIG FGSEAWDALF GAPRPAELHP FREIRSGSRA AIATPGDILF
HIRASRMDLC FELAAQIMER IGAFVSPVDE AQGFRYFDDR DLLGFVDGTE NPVDAAAVEA
ALIGGEDADF TGGSYVIVQK YLHDMKAWNA IPTEMQELIV GRKKLSDVEL DESVKPSWAH
AALTIIEEGG KEIKILRANM PFGSPTQGEF GTYFIGYSRS PRTIEQMLEN MFIGRPPGNY
DKLLDVSRPV TGNLFFVPTA TFLDNVADES AAAPEPAPAA PRTKNESLAI GSLKGEKRHE