Gene MCA2684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2684 
Symbol 
ID3103056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2866650 
End bp2869787 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content63% 
IMG OID637171818 
Producthypothetical protein 
Protein accessionYP_115088 
Protein GI53803231 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAATCGG TCGATTTCAG CGATCCGGGC CGTCCCAAGA CCTGCCTGGA GGTGGACTTC 
CCGATCCTGC CCGTCAACCA GGTGGCAGTG ATTGAGGGCA ATGCAGGCAA GCCGATCTAC
CAGATGTCCA AGTGGTGGGC GCGGCGGCGT TCCAGCGTGT TCCGCTCGAT GCTCATCGCG
GCAGCGGCAA AGGCGCCGGA CGACCCGGCT CACGCGGCCC GGCTGGTGTG GGACAACTAC
TACGCCAACC ACCAGAAGAA GGGTTCGTTC AAGCACCTGA AGGTGGCCGA CATCTTCATG
GGCGGCGGGA CCACGCTGGT GGAAGGATCG CGCCTGGGCA TGCAGATGGT CGGCAACGAC
CTAAACCCGG TCGCGTGGTT CGTGGTCAAG CAGGAACTCG CGAACGTCGA CCTGGGGGAA
GTAAAGAAAC TCCTCGCGGA CGTCGAGGCC GAGGTCAAGC CGCAGATCAT GCCTTACTAC
TACTGTGACG GCCCGGAAGG TGAGAAAGGC ACATGGACAC ACCTGCCGAC TAAGAAGGTG
ATGCCCGCGG ACTTTGACCC GCTCGTCATC CCGCGTGATG AGCGCAAGGA CTACCGCTAC
GAAGGCCCGG AGATCATCTA CACGTTCTGG GCCAAGCATG GTCCCTGTCA GGTGACCGGC
TGCGGCCACC GGACACCAAT CATGTCGAGC CCCGTGATGG CAGTGAAGAC TCTCAGCGTC
AAACACTGGG AGCACACCTG CGGCAAGTGT GGCGGCGAGT TCCACGTCGA GGAAGAAGCG
GCACGGATGG CGCCCGATGT GCCGCTGTAC GTCGCCCCAT CCGAATACCC ATTCTCCGTC
CTCGACCGCA AGAAGGGCGT GATCTGCCCG CATTGTGGCC ACACAGCCCT AGTCAATCTC
GGCAAGGGCA AGAACAAGAA GGTGGAGCTG AGTCTGTTGG TGCATCCGCA GTGGCTGGCC
GGCGAGGCCA AGCAGGATGT CAGCGGCCAG CCCTACGGCG GCTCGGCGCA AGATGACGCG
GCTTCTACCG CGCGCTGGAA CCATGCGCGC GCCGCCAAGA TCCGCCTGCT GGAGGTGCGC
GGAGCCCTGC CGGATCAGGT GACCTGCCCC GAGACGGGGG TGACCTTCCG CACCGACAAG
GGCACGGTGC CCAAGAAATC CAACTACGCC TGCGGCGCCT GCGGCACGGC GCAGGACGTG
CTCACCACGG TGAAGGCCAG TGGTAAGACT GGGCCGATGG CTGCCTATGC GGTGCAGGGC
TATGCACCCA AACGTGACGA AGCGCGCAAA CCCTACAACG GCCGCTTCTT CGCACCTCTC
GATGAGCGAC TGGCCCGCCA GTACGACGCC GCCTCCGAGG AATGGGAGGC GCGGAAGGAT
ACCGACCTCA AGGACTACTG GCCACGTTCC GCCGTGCCCT ACGGCTTCAT GACCGGCATC
GCCAACGGCG ACATCCGAGA AGGCCACGGC TTCACGCATT GGTGGACGAT GTTCAACCCG
CGCCAACTGC TGGTGCATGC GCAGCTGCTC AAAGCTATCG TCGAAGTCGG TAACTACGAC
TGGACTGTGC GTGAGTATGT GCTGGGGGCG TTTCAGCAGT ATTTGCGGAA CCAGTGTCTG
TTCAGCTTCT GGAATCCGCA GCGCGATACC CCAGAGCCGA TGTTCTCGAA CAACAACTAC
CACCCTAAGT CGACCGTGGT TGAGAACTGC GTGTTTCCGG CGTTGGGTCG AGGGAATTGG
GCTTCAAGTG TCGAAGGCAT TCTCGAAGGT CGCGATTGGG CCATCGATCC CTGGGAAGCG
GTGAGCGCCG AGGCCCTCCG GCGCAAGGAC AACGCGCTGG CTGGTGAAAT AAGCGGCAAG
AGCGAAAAGG TCTTCCCTGG TGATCCGGTG GGCGATGTCA CCGTATTTCA GGGTTCCTCG
ACGGACCTCG CACGGATTGA AGCCGGCAGC CTTGATCTGG TCATCACCGA CCCACCCTTC
GGTGGCCTGC TGCACTACTC GGAACTGGCC GACTTCTTCT ACGTTTGGCT GCGTCTGGTG
CTCAAGGGCA AGTACCCGGA GTATTTCAGC GCGGACTACA CGCCCAAGTC ACTGGAGGCG
GTGGCCAACA AGGCCCGCGA GCCGGAAGAC CCGGACGGCT TTTATCAGCG ACTGCTCACC
CAGTGCTGGC GCGAGGCGCA CCGCATTCTC AAGCCCGGTG GCATCCTCGC CTTCACCTTC
CACCACAGCG AGGACGAACC CTGGGTGGCA GTGCTGGAGT CCCTGTTCGA TGCGGGCTTC
TACCTTGAGG CCACCTACCC AATCCGCTCG GACGAGACCA AGGGCGAAGG AGAGTTCGGC
TCCAAGACCA TCGAGTACGA CATCATTCAC GTCTGTCGCA AGCGCACCGA AGAGCCGAAG
CCCGTGAGCT GGGGCCGCAT GCGTCGCGAG GTCATGGCCG ATGTGCGCCA GCTTCAGGCA
ATGCTGGAGA ACCACGCCAA AGAGGGCTTG CCCGCTGCCG ACATCCAGGT GATACGGCGA
GGCAAGGCAC TGGAGTACTT CTCGCGCCAC TACGGCAAGG TCTACGTGGA CGAAGGCCGC
ACCATCACGG TGAAGGACGC GCTTGTTGGT ATCAACCAGC TGATCGACGA AGACGCCGAC
AAGGGTAAGG AGCCGCCGCC CGTCAACGCC GAGCCGATCA CACGCCAGTT TCTGCGCACC
TTCGGCGCGG CCGGCGAGCT CAAGCGCGAC CAGCTCCAGA AGTTTCTGCG CGGCACCATC
ACCACACCGG ACGACTTCGT CCAGCGCGGC TGGTGCGCGG AGAAGAACAA GGTCTTCACG
CGCACCAATC CGCTGGATTT CGCCCGAGAG TGGTCGGGCA AGCACCGGCG GCGCCTGACG
TCCGACTTGG ACCAGGCGCT GGTACTGATC GGCGCCTGCT TCGACGGCAG CGGCATCAAT
GCGTCGGACA CGCTCAAGAA CGAGAACTTC AAGCCGCACG TGGCGCTGAA GCCTCTGCTG
GAGTGGCTGC ATCGCAACGG CCCGGAACAG GCCACGCGCA ACGCAGCCTC CCGCGCCGTG
TCGATCTACA ACACTTGGCA CGCCAGCCAA GCGGTGAAGC CCACCCAGGG ATCGCTCTTT
GAGGAATACG AGCTATGA
 
Protein sequence
MESVDFSDPG RPKTCLEVDF PILPVNQVAV IEGNAGKPIY QMSKWWARRR SSVFRSMLIA 
AAAKAPDDPA HAARLVWDNY YANHQKKGSF KHLKVADIFM GGGTTLVEGS RLGMQMVGND
LNPVAWFVVK QELANVDLGE VKKLLADVEA EVKPQIMPYY YCDGPEGEKG TWTHLPTKKV
MPADFDPLVI PRDERKDYRY EGPEIIYTFW AKHGPCQVTG CGHRTPIMSS PVMAVKTLSV
KHWEHTCGKC GGEFHVEEEA ARMAPDVPLY VAPSEYPFSV LDRKKGVICP HCGHTALVNL
GKGKNKKVEL SLLVHPQWLA GEAKQDVSGQ PYGGSAQDDA ASTARWNHAR AAKIRLLEVR
GALPDQVTCP ETGVTFRTDK GTVPKKSNYA CGACGTAQDV LTTVKASGKT GPMAAYAVQG
YAPKRDEARK PYNGRFFAPL DERLARQYDA ASEEWEARKD TDLKDYWPRS AVPYGFMTGI
ANGDIREGHG FTHWWTMFNP RQLLVHAQLL KAIVEVGNYD WTVREYVLGA FQQYLRNQCL
FSFWNPQRDT PEPMFSNNNY HPKSTVVENC VFPALGRGNW ASSVEGILEG RDWAIDPWEA
VSAEALRRKD NALAGEISGK SEKVFPGDPV GDVTVFQGSS TDLARIEAGS LDLVITDPPF
GGLLHYSELA DFFYVWLRLV LKGKYPEYFS ADYTPKSLEA VANKAREPED PDGFYQRLLT
QCWREAHRIL KPGGILAFTF HHSEDEPWVA VLESLFDAGF YLEATYPIRS DETKGEGEFG
SKTIEYDIIH VCRKRTEEPK PVSWGRMRRE VMADVRQLQA MLENHAKEGL PAADIQVIRR
GKALEYFSRH YGKVYVDEGR TITVKDALVG INQLIDEDAD KGKEPPPVNA EPITRQFLRT
FGAAGELKRD QLQKFLRGTI TTPDDFVQRG WCAEKNKVFT RTNPLDFARE WSGKHRRRLT
SDLDQALVLI GACFDGSGIN ASDTLKNENF KPHVALKPLL EWLHRNGPEQ ATRNAASRAV
SIYNTWHASQ AVKPTQGSLF EEYEL