Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0004 |
Symbol | |
ID | 7092332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3086 |
End bp | 4144 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643463339 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002360351 |
Protein GI | 217976204 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTAC TTGGCATTGA AACCACCTGT GACGAGACCG CCGCCGCGGT CGTGCAGCTG AACCCCGGCG GCGCCGGCGA GATACTCTCC AATGAAGTCA TGAGCCAGAT CGCCGAACAT GCCGCCTATG GCGGCGTGGT TCCCGAAATC GCCGCGCGCG CCCATATCGA AGTGCTCGAC CGGCTAGTCG CCCGCGCCCT GGAAGACGCC AAGATCAAGC TCGCCGAGCT CGACGGGATC GCCGCCGCGG CCGGACCGGG GCTCGTCGGC GGCGTCATCG TTGGCCTCAC CACGGCCAAG GCGCTGGCTC TGGCGAGCCA CAAGCCCTTT ATCGCCGTCA ATCATCTCGA GGCGCATGCG TTGACCGCCC GGCTGACCGA CGGCGTCGAC TTCCCCTACC TGCTCCTGCT GGTCTCGGGC GGCCATACCC AGCTCGTCGC CGTCAAGGGC GTCGGCGACT ATCTGAGGCT CGGCTCGACC GTCGACGACG CCGTCGGCGA GGCGTTCGAC AAAGTCGCCA AGATGCTTGG CCTCGCCTAT CCGGGCGGCC CCGAAGTGGA GCGCATGGCG GCCAAGGGCG ATCCAACAAG GTTTGATTTT CCTCGGCCGA TGCAAGGACG CGCCAAGCCG GATTTTTCTC TCTCGGGCCT CAAGACCGCC GTCCGGGTGG CGGCGCAGCG CATTCATTCG CCGAGCCAGA CGGATGTCGC CGATCTTTGC GCCTCGTTTC AGGCGGCGAT CGTCGACACG ATGATCGACC GCTCGCGCGC AGGCTTGCGG CTGTTTCGCG AGCGCGTTGG CGACTGCAAC GCAATGGTTG TCGCTGGCGG AGTCGGCGCC AATGGCGCGA TCCGTCGCGC CTTGAGCCGA TTTTGCGCCG AAAGCGGGCT GCGGCTCATT TTGCCGCCGC CGCAGCTTTG CACCGACAAT GGCGCGATGA TCGCCTGGGC TGGGATCGAG CGGCTGTCGC TCGGCCTCGT CGACGATATG ACTTTCGCCG CGCGGCCGCG CTGGCCGCTC GACTCCAACG CCGAAGCCGC ACATCACGGC AAGGCTTAA
|
Protein sequence | MRVLGIETTC DETAAAVVQL NPGGAGEILS NEVMSQIAEH AAYGGVVPEI AARAHIEVLD RLVARALEDA KIKLAELDGI AAAAGPGLVG GVIVGLTTAK ALALASHKPF IAVNHLEAHA LTARLTDGVD FPYLLLLVSG GHTQLVAVKG VGDYLRLGST VDDAVGEAFD KVAKMLGLAY PGGPEVERMA AKGDPTRFDF PRPMQGRAKP DFSLSGLKTA VRVAAQRIHS PSQTDVADLC ASFQAAIVDT MIDRSRAGLR LFRERVGDCN AMVVAGGVGA NGAIRRALSR FCAESGLRLI LPPPQLCTDN GAMIAWAGIE RLSLGLVDDM TFAARPRWPL DSNAEAAHHG KA
|
| |