Gene Msil_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0004 
Symbol 
ID7092332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3086 
End bp4144 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID643463339 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002360351 
Protein GI217976204 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAC TTGGCATTGA AACCACCTGT GACGAGACCG CCGCCGCGGT CGTGCAGCTG 
AACCCCGGCG GCGCCGGCGA GATACTCTCC AATGAAGTCA TGAGCCAGAT CGCCGAACAT
GCCGCCTATG GCGGCGTGGT TCCCGAAATC GCCGCGCGCG CCCATATCGA AGTGCTCGAC
CGGCTAGTCG CCCGCGCCCT GGAAGACGCC AAGATCAAGC TCGCCGAGCT CGACGGGATC
GCCGCCGCGG CCGGACCGGG GCTCGTCGGC GGCGTCATCG TTGGCCTCAC CACGGCCAAG
GCGCTGGCTC TGGCGAGCCA CAAGCCCTTT ATCGCCGTCA ATCATCTCGA GGCGCATGCG
TTGACCGCCC GGCTGACCGA CGGCGTCGAC TTCCCCTACC TGCTCCTGCT GGTCTCGGGC
GGCCATACCC AGCTCGTCGC CGTCAAGGGC GTCGGCGACT ATCTGAGGCT CGGCTCGACC
GTCGACGACG CCGTCGGCGA GGCGTTCGAC AAAGTCGCCA AGATGCTTGG CCTCGCCTAT
CCGGGCGGCC CCGAAGTGGA GCGCATGGCG GCCAAGGGCG ATCCAACAAG GTTTGATTTT
CCTCGGCCGA TGCAAGGACG CGCCAAGCCG GATTTTTCTC TCTCGGGCCT CAAGACCGCC
GTCCGGGTGG CGGCGCAGCG CATTCATTCG CCGAGCCAGA CGGATGTCGC CGATCTTTGC
GCCTCGTTTC AGGCGGCGAT CGTCGACACG ATGATCGACC GCTCGCGCGC AGGCTTGCGG
CTGTTTCGCG AGCGCGTTGG CGACTGCAAC GCAATGGTTG TCGCTGGCGG AGTCGGCGCC
AATGGCGCGA TCCGTCGCGC CTTGAGCCGA TTTTGCGCCG AAAGCGGGCT GCGGCTCATT
TTGCCGCCGC CGCAGCTTTG CACCGACAAT GGCGCGATGA TCGCCTGGGC TGGGATCGAG
CGGCTGTCGC TCGGCCTCGT CGACGATATG ACTTTCGCCG CGCGGCCGCG CTGGCCGCTC
GACTCCAACG CCGAAGCCGC ACATCACGGC AAGGCTTAA
 
Protein sequence
MRVLGIETTC DETAAAVVQL NPGGAGEILS NEVMSQIAEH AAYGGVVPEI AARAHIEVLD 
RLVARALEDA KIKLAELDGI AAAAGPGLVG GVIVGLTTAK ALALASHKPF IAVNHLEAHA
LTARLTDGVD FPYLLLLVSG GHTQLVAVKG VGDYLRLGST VDDAVGEAFD KVAKMLGLAY
PGGPEVERMA AKGDPTRFDF PRPMQGRAKP DFSLSGLKTA VRVAAQRIHS PSQTDVADLC
ASFQAAIVDT MIDRSRAGLR LFRERVGDCN AMVVAGGVGA NGAIRRALSR FCAESGLRLI
LPPPQLCTDN GAMIAWAGIE RLSLGLVDDM TFAARPRWPL DSNAEAAHHG KA