Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA3008 |
Symbol | |
ID | 3103540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 3186991 |
End bp | 3190029 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637172134 |
Product | hypothetical protein |
Protein accession | YP_115396 |
Protein GI | 53802911 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCG TCAAAGCCCC GAAAAAACTC ATCGAAGTGG CGCTCCCCTT GGACGCCATC AACGAAGCCA GCGCGCGCGA GAAATCCATC CGCCACGGCC ACCCCTCCAC GCTACACCTG TGGTGGGCGA GGCGGCCGCT GGCGGCGGCG CGGGCGGTGA TCTTCGCCCA GATGGTCAAC GACCCCGGCT ACCAGCAGGG CGGCGGCTTT CGCTACGGCG TGAACAAGGA AAAAGCCCAG CTCGAACGCG AGCGCCTCTT CAAGATCATC GAGGAGCTCG TCCAGTGGGA GAACACCAAC AACGAAGCCG TGCTGTCCCG CGCCCGCGCC GAAATCTGGA AAAGCTGGCG CGAGACCTGC GAGCTCAACA AAAATCATCC CTGCGCCGCC GAGCTCTTCA ACCCCGACAA GCTCCCCGCC TTTCACGACC CCTTTGCCGG CGGCGGCGCG ATCCCGCTCG AAGCCCAGCG CCTGGGTCTG GAGAGCTACG CCTCCGACCT CAACCCCGTG GCGGTGACGA TCAACAAGGC CATGATCGAA ATCCCGCCGC GCTTCGCGGG CCGCGCGCCG GTCGGGCCTG TGCCCCCCTC TCCCGATGGG AGAGGGGTTG GGGGTGAGGG CCTGTTCGCG CAGGACTGGG CCGGCGCGAA AGGACTCGCC GAAGACGTGC GCCGCTACGG CGCGTGGATG CGCTCAGAAG CGGAAAAACG CATCGGCCAC CTCTACCCGC AGGTGGAAGT CACCCGCGAA CTCGCCCAGG GCCGACAGGA CCTGCAGCCG CTGGTGGGGC AGAAGCTCAC CGTCATCGCG TGGCTGTGGG CGCGCACGGT GAAGAGCCCC AATCCGGCCT TTTCCCACGT GGAGGTGCCG CTGGCTTCCA CCTTCGTGCT CTCCAGCAAG GCGGGCAAGG AAGCCTATGT GCAGCCCATG ATCTCCCCTC TCCCCCTGGG AGAGGGGCTG GGGGTGAGGG CCGGGAGTGA GGGCTATTAC CGCTTCACCG TGCAGGTGGC GGGCACGCCG GGGTTCGACA AGGCGGACTA TGCGCGGGCG AAGAGCGGTA CCAAGCTGGC ACGCGGCGCG AACTTCGAGT GCCTGCTGTC GAACACGCCC ATCGAACCGA ACCACATTTA CACTGAGGCG AACGCCGGGC GCATGGGCGC GCGGCTGATG GCCATCGTCG CCGAGGGCGC GCGCGGGCGC GTCTACCTGC CGCCACTGCC CGAGCACGAG GCGATCGCCC GGCAGGCGCA GCCGGAGTGG AAGCCGGAAG TCGCCATGCC TGATAACCCG CGCTGGTTCT CGCCACCGCT TTACGGTTTG AAGAATTACG GCGACCTCTT CACCCCCCGC CAGTTGGTGG CGCTGACCAC CTTTTCCGAC CTCGTGATTG ATGCCATCGA GCGCTGCCGC CGCGACGCCG CAGCCGCCGG CCTGCCCGAC GACGGCGTGC CGCTCGATGC CGGCGGCACC GGCGCCACCG CCTACGCCCA GGCGGTGGGG GTGTATTTGG CAATAGCAAT TAGCCGATTT TCGGACCGCA ATAATTCTAT TTGCACTTGG GATAGTGGGC CGACCGGCAC GAAAGCATCT ACTGGTGGCT CGGCGAGGAC AGCATCTTTG CGCAATTTGT TCGCACGTCA GGCCATTCCT ATGGCTTGGG ATTTCGGGGA AGCTAATCCC TTTAGCGATT CCGGTGGTGG CTTTTCTAGT GCCTTTGAAT GGATTGAGCC CGCGGTACGT TCACTTCGTG GCGGGTGTGC TGGATATGGT GACGGTGCTG ACGCTCAAAC CCAAACGCTC TCCCGCGACA AGGTCGTCTC CACCGACCCG CCGTATTACG ACAACATCGG CTATGCCGAT CTCTCGGACT TTTTCTACGT CTGGCTGCGC CGCAGCTTGA AGCCCATCTT CCCCGGCCTC TACGCCACGC TCGCCGTCCC CAAGGCCGAG GAACTCGTCG CCACCCCCTA CCGCCACGGC AGCAAGGAGG CGGCGGAAGC CTTCTTTCTC GACGGCATGC GTCGGGCGCT CAAGAACCTC GCCGAGCAGG CGCACCCGGC CTTTCCAGTG ACCATCTACT ACGCCTTCAA GCAGAGCGAG ACCACCGACG CCGCGGGGAC GTCTAGCACC GGCTGGGAGA CCTTCTTGCA GGCGGTGCTC GATGCCGGCT TTGCGCTCAC CGGCACCTGG CCGATGCGCA CTGAACTCGG CAACCGCATG ATCGGCGCGG GCACCAACGC GCTCGCCTCC AGCATCGTGC TGGTCTGCCG CCAGCGCGCG ACGGACGCCC CCACCGCCAG CCGCCGGGAG TTCCTGCGCG AGCTCAACGC CACGCTGCCG GAGGCCATCG CCGACATGAT CGGCGCCGAC CCCTCACCCC AACCCCTCTC CCCGAGGGAG AGGGGCTACG GTCGGGTGGC GCCGGTGGAC CTCTCGCAGG CCATCATCGG CCCGGGCATG GCGATCTTCT CGCAATACGC CGCGGTGCTG GAGGCCGACG GCACGCCGAT GACGGTGAAG ACGGCGCTTG CGCTCATCAA CCGCTTCCTC GCCGAAGACG ACTTCGACCA CGACACCCAG TTCTGCCTGC ACTGGTTCGA GCAGCAGGGC TGGGCCAGCG GCAAGTATGG CGAAGCCGAC GTGCTGGCGC GCGCCAAGGG CACGGCGGTG GATGCGCTGG TGGCCGCGGG CGTGGCGGAA TCCGCCAAGG GCAGCGTGCG CCTTTTGAAG TGGCCCGAGT ACCCCGCCGA CTGGTCGCCC GAGAGCGACA CCCGCACGCC CATCTGGGAA GCGCTGCACC AGCTCATCCG CGCGCTCAAC CAAGCGGGTG AAACCGAAGC CGGGCGGCTG CTGGCTCGCA TGCCCGCGCG CGCCGAGCCC ATCCGCGCGC TCGCCTACCG GCTCTACACC CTGTGCGAAC GCAAGGGCTG GGCGGAGGAT GCCCGCGCCT ACAACGAGCT CGTCACCGCC TGGAGCGGCA TCGAGCAGGC GGCCAACGAG GCCGGCGTGG TCGGCGCGCA GATGCAACTG GAACTCTGA
|
Protein sequence | MTTVKAPKKL IEVALPLDAI NEASAREKSI RHGHPSTLHL WWARRPLAAA RAVIFAQMVN DPGYQQGGGF RYGVNKEKAQ LERERLFKII EELVQWENTN NEAVLSRARA EIWKSWRETC ELNKNHPCAA ELFNPDKLPA FHDPFAGGGA IPLEAQRLGL ESYASDLNPV AVTINKAMIE IPPRFAGRAP VGPVPPSPDG RGVGGEGLFA QDWAGAKGLA EDVRRYGAWM RSEAEKRIGH LYPQVEVTRE LAQGRQDLQP LVGQKLTVIA WLWARTVKSP NPAFSHVEVP LASTFVLSSK AGKEAYVQPM ISPLPLGEGL GVRAGSEGYY RFTVQVAGTP GFDKADYARA KSGTKLARGA NFECLLSNTP IEPNHIYTEA NAGRMGARLM AIVAEGARGR VYLPPLPEHE AIARQAQPEW KPEVAMPDNP RWFSPPLYGL KNYGDLFTPR QLVALTTFSD LVIDAIERCR RDAAAAGLPD DGVPLDAGGT GATAYAQAVG VYLAIAISRF SDRNNSICTW DSGPTGTKAS TGGSARTASL RNLFARQAIP MAWDFGEANP FSDSGGGFSS AFEWIEPAVR SLRGGCAGYG DGADAQTQTL SRDKVVSTDP PYYDNIGYAD LSDFFYVWLR RSLKPIFPGL YATLAVPKAE ELVATPYRHG SKEAAEAFFL DGMRRALKNL AEQAHPAFPV TIYYAFKQSE TTDAAGTSST GWETFLQAVL DAGFALTGTW PMRTELGNRM IGAGTNALAS SIVLVCRQRA TDAPTASRRE FLRELNATLP EAIADMIGAD PSPQPLSPRE RGYGRVAPVD LSQAIIGPGM AIFSQYAAVL EADGTPMTVK TALALINRFL AEDDFDHDTQ FCLHWFEQQG WASGKYGEAD VLARAKGTAV DALVAAGVAE SAKGSVRLLK WPEYPADWSP ESDTRTPIWE ALHQLIRALN QAGETEAGRL LARMPARAEP IRALAYRLYT LCERKGWAED ARAYNELVTA WSGIEQAANE AGVVGAQMQL EL
|
| |