Gene MCA3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA3008 
Symbol 
ID3103540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3186991 
End bp3190029 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content67% 
IMG OID637172134 
Producthypothetical protein 
Protein accessionYP_115396 
Protein GI53802911 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG TCAAAGCCCC GAAAAAACTC ATCGAAGTGG CGCTCCCCTT GGACGCCATC 
AACGAAGCCA GCGCGCGCGA GAAATCCATC CGCCACGGCC ACCCCTCCAC GCTACACCTG
TGGTGGGCGA GGCGGCCGCT GGCGGCGGCG CGGGCGGTGA TCTTCGCCCA GATGGTCAAC
GACCCCGGCT ACCAGCAGGG CGGCGGCTTT CGCTACGGCG TGAACAAGGA AAAAGCCCAG
CTCGAACGCG AGCGCCTCTT CAAGATCATC GAGGAGCTCG TCCAGTGGGA GAACACCAAC
AACGAAGCCG TGCTGTCCCG CGCCCGCGCC GAAATCTGGA AAAGCTGGCG CGAGACCTGC
GAGCTCAACA AAAATCATCC CTGCGCCGCC GAGCTCTTCA ACCCCGACAA GCTCCCCGCC
TTTCACGACC CCTTTGCCGG CGGCGGCGCG ATCCCGCTCG AAGCCCAGCG CCTGGGTCTG
GAGAGCTACG CCTCCGACCT CAACCCCGTG GCGGTGACGA TCAACAAGGC CATGATCGAA
ATCCCGCCGC GCTTCGCGGG CCGCGCGCCG GTCGGGCCTG TGCCCCCCTC TCCCGATGGG
AGAGGGGTTG GGGGTGAGGG CCTGTTCGCG CAGGACTGGG CCGGCGCGAA AGGACTCGCC
GAAGACGTGC GCCGCTACGG CGCGTGGATG CGCTCAGAAG CGGAAAAACG CATCGGCCAC
CTCTACCCGC AGGTGGAAGT CACCCGCGAA CTCGCCCAGG GCCGACAGGA CCTGCAGCCG
CTGGTGGGGC AGAAGCTCAC CGTCATCGCG TGGCTGTGGG CGCGCACGGT GAAGAGCCCC
AATCCGGCCT TTTCCCACGT GGAGGTGCCG CTGGCTTCCA CCTTCGTGCT CTCCAGCAAG
GCGGGCAAGG AAGCCTATGT GCAGCCCATG ATCTCCCCTC TCCCCCTGGG AGAGGGGCTG
GGGGTGAGGG CCGGGAGTGA GGGCTATTAC CGCTTCACCG TGCAGGTGGC GGGCACGCCG
GGGTTCGACA AGGCGGACTA TGCGCGGGCG AAGAGCGGTA CCAAGCTGGC ACGCGGCGCG
AACTTCGAGT GCCTGCTGTC GAACACGCCC ATCGAACCGA ACCACATTTA CACTGAGGCG
AACGCCGGGC GCATGGGCGC GCGGCTGATG GCCATCGTCG CCGAGGGCGC GCGCGGGCGC
GTCTACCTGC CGCCACTGCC CGAGCACGAG GCGATCGCCC GGCAGGCGCA GCCGGAGTGG
AAGCCGGAAG TCGCCATGCC TGATAACCCG CGCTGGTTCT CGCCACCGCT TTACGGTTTG
AAGAATTACG GCGACCTCTT CACCCCCCGC CAGTTGGTGG CGCTGACCAC CTTTTCCGAC
CTCGTGATTG ATGCCATCGA GCGCTGCCGC CGCGACGCCG CAGCCGCCGG CCTGCCCGAC
GACGGCGTGC CGCTCGATGC CGGCGGCACC GGCGCCACCG CCTACGCCCA GGCGGTGGGG
GTGTATTTGG CAATAGCAAT TAGCCGATTT TCGGACCGCA ATAATTCTAT TTGCACTTGG
GATAGTGGGC CGACCGGCAC GAAAGCATCT ACTGGTGGCT CGGCGAGGAC AGCATCTTTG
CGCAATTTGT TCGCACGTCA GGCCATTCCT ATGGCTTGGG ATTTCGGGGA AGCTAATCCC
TTTAGCGATT CCGGTGGTGG CTTTTCTAGT GCCTTTGAAT GGATTGAGCC CGCGGTACGT
TCACTTCGTG GCGGGTGTGC TGGATATGGT GACGGTGCTG ACGCTCAAAC CCAAACGCTC
TCCCGCGACA AGGTCGTCTC CACCGACCCG CCGTATTACG ACAACATCGG CTATGCCGAT
CTCTCGGACT TTTTCTACGT CTGGCTGCGC CGCAGCTTGA AGCCCATCTT CCCCGGCCTC
TACGCCACGC TCGCCGTCCC CAAGGCCGAG GAACTCGTCG CCACCCCCTA CCGCCACGGC
AGCAAGGAGG CGGCGGAAGC CTTCTTTCTC GACGGCATGC GTCGGGCGCT CAAGAACCTC
GCCGAGCAGG CGCACCCGGC CTTTCCAGTG ACCATCTACT ACGCCTTCAA GCAGAGCGAG
ACCACCGACG CCGCGGGGAC GTCTAGCACC GGCTGGGAGA CCTTCTTGCA GGCGGTGCTC
GATGCCGGCT TTGCGCTCAC CGGCACCTGG CCGATGCGCA CTGAACTCGG CAACCGCATG
ATCGGCGCGG GCACCAACGC GCTCGCCTCC AGCATCGTGC TGGTCTGCCG CCAGCGCGCG
ACGGACGCCC CCACCGCCAG CCGCCGGGAG TTCCTGCGCG AGCTCAACGC CACGCTGCCG
GAGGCCATCG CCGACATGAT CGGCGCCGAC CCCTCACCCC AACCCCTCTC CCCGAGGGAG
AGGGGCTACG GTCGGGTGGC GCCGGTGGAC CTCTCGCAGG CCATCATCGG CCCGGGCATG
GCGATCTTCT CGCAATACGC CGCGGTGCTG GAGGCCGACG GCACGCCGAT GACGGTGAAG
ACGGCGCTTG CGCTCATCAA CCGCTTCCTC GCCGAAGACG ACTTCGACCA CGACACCCAG
TTCTGCCTGC ACTGGTTCGA GCAGCAGGGC TGGGCCAGCG GCAAGTATGG CGAAGCCGAC
GTGCTGGCGC GCGCCAAGGG CACGGCGGTG GATGCGCTGG TGGCCGCGGG CGTGGCGGAA
TCCGCCAAGG GCAGCGTGCG CCTTTTGAAG TGGCCCGAGT ACCCCGCCGA CTGGTCGCCC
GAGAGCGACA CCCGCACGCC CATCTGGGAA GCGCTGCACC AGCTCATCCG CGCGCTCAAC
CAAGCGGGTG AAACCGAAGC CGGGCGGCTG CTGGCTCGCA TGCCCGCGCG CGCCGAGCCC
ATCCGCGCGC TCGCCTACCG GCTCTACACC CTGTGCGAAC GCAAGGGCTG GGCGGAGGAT
GCCCGCGCCT ACAACGAGCT CGTCACCGCC TGGAGCGGCA TCGAGCAGGC GGCCAACGAG
GCCGGCGTGG TCGGCGCGCA GATGCAACTG GAACTCTGA
 
Protein sequence
MTTVKAPKKL IEVALPLDAI NEASAREKSI RHGHPSTLHL WWARRPLAAA RAVIFAQMVN 
DPGYQQGGGF RYGVNKEKAQ LERERLFKII EELVQWENTN NEAVLSRARA EIWKSWRETC
ELNKNHPCAA ELFNPDKLPA FHDPFAGGGA IPLEAQRLGL ESYASDLNPV AVTINKAMIE
IPPRFAGRAP VGPVPPSPDG RGVGGEGLFA QDWAGAKGLA EDVRRYGAWM RSEAEKRIGH
LYPQVEVTRE LAQGRQDLQP LVGQKLTVIA WLWARTVKSP NPAFSHVEVP LASTFVLSSK
AGKEAYVQPM ISPLPLGEGL GVRAGSEGYY RFTVQVAGTP GFDKADYARA KSGTKLARGA
NFECLLSNTP IEPNHIYTEA NAGRMGARLM AIVAEGARGR VYLPPLPEHE AIARQAQPEW
KPEVAMPDNP RWFSPPLYGL KNYGDLFTPR QLVALTTFSD LVIDAIERCR RDAAAAGLPD
DGVPLDAGGT GATAYAQAVG VYLAIAISRF SDRNNSICTW DSGPTGTKAS TGGSARTASL
RNLFARQAIP MAWDFGEANP FSDSGGGFSS AFEWIEPAVR SLRGGCAGYG DGADAQTQTL
SRDKVVSTDP PYYDNIGYAD LSDFFYVWLR RSLKPIFPGL YATLAVPKAE ELVATPYRHG
SKEAAEAFFL DGMRRALKNL AEQAHPAFPV TIYYAFKQSE TTDAAGTSST GWETFLQAVL
DAGFALTGTW PMRTELGNRM IGAGTNALAS SIVLVCRQRA TDAPTASRRE FLRELNATLP
EAIADMIGAD PSPQPLSPRE RGYGRVAPVD LSQAIIGPGM AIFSQYAAVL EADGTPMTVK
TALALINRFL AEDDFDHDTQ FCLHWFEQQG WASGKYGEAD VLARAKGTAV DALVAAGVAE
SAKGSVRLLK WPEYPADWSP ESDTRTPIWE ALHQLIRALN QAGETEAGRL LARMPARAEP
IRALAYRLYT LCERKGWAED ARAYNELVTA WSGIEQAANE AGVVGAQMQL EL