Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_2719 |
Symbol | |
ID | 4482571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | + |
Start bp | 3441729 |
End bp | 3444626 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639723464 |
Product | hypothetical protein |
Protein accession | YP_866618 |
Protein GI | 117926001 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000742651 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCAGA ACATCAAAGC CCCCAAGAAA CTGATCGAAG TTGCCCTGCC ATTGGACGAC ATCAACGCGG CAGCAGCGCG GGAGAAGTCC ATCCGGCATG GGCATCCTTC CACATTGCAT CTGTGGTGGG CGCGGCGGCC GTTGGCGGCG GCAAGGGCGG TTCTGTTCGC GCAGATGGTC AACGATCCGG GTGGAGAACG AGGCTATTAC GCCGGCAAGA CCAAGGCCCA GGCGGATGCC GAACGGGAGG AGCTGTTCAA GATCATTCGT GAACTGGTGC TGTGGGAAAA CACCAACAAT GAGGAGGTGC TGAATAAGGC GCGGGCCGCT ATTCGGAAAT CATGGCGGGA GACCTGTGAG CTGAACAAGG GCAAGTCAGG ATTCGATCCG GACAAGCTTC CCGCTTTCCA CGATCCATTT GCCGGTGGTG GGGCCATACC CCTGGAGGCG CAGAGGTTGG GAATGGAGTC TCACGCCTCC GACCTCAACC CGGTGGCTGT GCTCATCAAT AAGGCGATGA TCGAGATTCC GCCCAAATTT GCCGGGCGCA AACCAGTGGG GCCGATTCCA GAAGGCGAGA AGCAGGGCCG CATGGAGAGC GACTGGCCTG GAGCCACTGG ACTGGCCGAG GATGTGCGTC GTTATGGCCA CTGGATGCGG GAGGAGGCCT TCAAACGCAT TGGTCACCTC TACCCGCAGG TGGAGATCAC CGCCGAGATG GCCAAGGAAC GGCCAGACCT GAAGGGGATA GTTGGACAGA AACTCACCGT CATTGCCTGG TTGTGGGCGC GGACGGTAAG AAGTCCGAAC CCTGCCTTTT CTCATATTGC AGTACCGCTG GTTTCAAGCT TTTTGCTCTC AACCAAGAAA GGCAAGGAAG CCTATATTGA ACCTGTTGTA GATGCTAATA GCTATTATTT TTCAGTAAAA AAAGGGACTC CGTCGAAAGA TTCTGCTAGG GGGACTTCGG CTGGTAAACG AGGAGGTTTC CGTTGCATTT TTTCTGACGC GCCAATTGAT TACAATTATA TCCGTGATGA AGGTTCTGCA GGAAGGATTG GCACCAGGCT AATGGCTATT GTTGCTGAAG GTGTCCGTGG TCGGATCTAT CTGTCCGCTA CACCTGAACT CGAGATAATT GCGAATAGTG CAAAGCCTGA GTGGAGTCCA GATGTCAAGC TACATGGTAA ATGCCGAGTC AACGTTTCTA ATTATGGTTT GGATGTGTAT AGCGATCTCT TTACCCCCCG TCAGTTAGTT GCTCTGACGA CGTTTTCCAA TCTAGTGCAA GAGGCACGCG TGAAGGCCGT CAATGATGCA AAAATTACAG GAATGGCCGA TGATGGTATG GGAATAGATG AAGGAGGTTT CGGAGCCGGG GCTTATGGAG ATGCTGTGGC AGTATATTTG GGATTTATTG TCGATAAAGT TTCTGAAAGT TTATCAACAA TTTGCACTTG GAGTTCATCT CCTAAAAATG AGCTTATTGT AAGTACGTTC CGAAGGCAAG CAATACCAAT GACGTGGGAT TTTGGGGAGG CTAACCCCTT CGCAAATTCA AGCGGGTCGC TTGAGAAAAT TGTCCCTGCG GTTTCAAAAG TTATCAAGAC ATCATTATGT GGAAGTGTAG ATGGGAATGC TATTCAATTT GACGCCCGAA CGGTTAATCT AAGCGATAGG GTCGTGTCAA CAGACCCGCC ATACTATGAC AATATTGGCT ATGCAGATTT ATCTGATTTT TTCTATGTGT GGTCCCGCAG GGCACTTAAA TCAATATTTC CATCTTTATA CTCTACATTG GCAGTCCCAA AAGCAGAGGA GTTAGTGGCA ACGCCTTACC GTCATGGATC TAAAGAGGAA GCAGAAGCTT TTTTCATGAA TGGTATGATT TGCGCAATCA ATAATTTCGC AAATCAGGCT CATCCAAGTT TTCCAGTCAC AATCTACTAC GCTTTCAAAC AGTCAGAAAC AAAGGAAACA GGTACAACTT CTACTGGTTG GGAAACATTT TTGGAAGCAG TGATTCAGGC TGGGTTCGGT ATTACTGGCA CTTGGCCGAT GCGGACAGAA CGAGGTGCAC GTTCAATTGG GATTGGAGCA AACGCTTTAG CTTCCTCTAT TATTTTAGTT TGCCGCAAAA GAGATAATAG TGCCGAATCT ATCTCCCGCC GTCAGTTTCA ACGGGAGTTG CGCGAAATCT TGCCGGAAGC CTTAGAAACC ATGATCGGAG GCAAGGAAGG GGCTTCTCCT GTGGCTCCGG TCGATCTGGC TCAGGCATCC ATCGGCCCTG GTATGGCTGT GTACTCCAAA TACGCGGCTG TACTCAACCA GGACGGCAAT CCCATGTCCG TGCATGACGC CCTGATCCTT ATCAACCGAG AGATAACTGA CTTCCTGACA CCAGACTCAG GCAGCTTCGA CAACGACACT CTGTTCTGCT CTACCTGGTT TGACCAGTAT GGCTGGAAGG CTGGCCCCTT CGGCGAGGCG GACACACTCT CCCGTGCAAA AGGCACCAGT GTCGATGGGG TTCAGGAAGC CGGTGTTGTC CAATCTGGTG GCGGGAAAGT GCGCCTGTTC AAATGGGATG AGTACCCGGA CGATTGGGAT CCCAAGAAGG ACAACCGCAC CCCGGTATGG GAGGCGCTCC ATCATTTGAT CCGCGCCTTG AACAAGGATG GTGAATCCGT CTCCGGCGGT CTTCTAGCCC GCATGCCCGA ACGCGCCGAG GCCATCCGCC AACTGGCCTA CCACCTCTAC ACCCTGTGCG AGCGGAAGAA ATGGGCTGAT GACGCCCGGG CCTACAATGA GTTGATCACC TCTTGGCACG GCATCGCCGC CGCTTCCCAC GAGGTTGGCC ATCTCGGAAC CCAATGCACC CTGGACCTGG GAGATTGA
|
Protein sequence | MSQNIKAPKK LIEVALPLDD INAAAAREKS IRHGHPSTLH LWWARRPLAA ARAVLFAQMV NDPGGERGYY AGKTKAQADA EREELFKIIR ELVLWENTNN EEVLNKARAA IRKSWRETCE LNKGKSGFDP DKLPAFHDPF AGGGAIPLEA QRLGMESHAS DLNPVAVLIN KAMIEIPPKF AGRKPVGPIP EGEKQGRMES DWPGATGLAE DVRRYGHWMR EEAFKRIGHL YPQVEITAEM AKERPDLKGI VGQKLTVIAW LWARTVRSPN PAFSHIAVPL VSSFLLSTKK GKEAYIEPVV DANSYYFSVK KGTPSKDSAR GTSAGKRGGF RCIFSDAPID YNYIRDEGSA GRIGTRLMAI VAEGVRGRIY LSATPELEII ANSAKPEWSP DVKLHGKCRV NVSNYGLDVY SDLFTPRQLV ALTTFSNLVQ EARVKAVNDA KITGMADDGM GIDEGGFGAG AYGDAVAVYL GFIVDKVSES LSTICTWSSS PKNELIVSTF RRQAIPMTWD FGEANPFANS SGSLEKIVPA VSKVIKTSLC GSVDGNAIQF DARTVNLSDR VVSTDPPYYD NIGYADLSDF FYVWSRRALK SIFPSLYSTL AVPKAEELVA TPYRHGSKEE AEAFFMNGMI CAINNFANQA HPSFPVTIYY AFKQSETKET GTTSTGWETF LEAVIQAGFG ITGTWPMRTE RGARSIGIGA NALASSIILV CRKRDNSAES ISRRQFQREL REILPEALET MIGGKEGASP VAPVDLAQAS IGPGMAVYSK YAAVLNQDGN PMSVHDALIL INREITDFLT PDSGSFDNDT LFCSTWFDQY GWKAGPFGEA DTLSRAKGTS VDGVQEAGVV QSGGGKVRLF KWDEYPDDWD PKKDNRTPVW EALHHLIRAL NKDGESVSGG LLARMPERAE AIRQLAYHLY TLCERKKWAD DARAYNELIT SWHGIAAASH EVGHLGTQCT LDLGD
|
| |