Gene MCA1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1049 
Symbol 
ID3102274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1099135 
End bp1100334 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content62% 
IMG OID637170233 
Producthypothetical protein 
Protein accessionYP_113524 
Protein GI53804792 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAT TGTCGGTATT GCGGCTGAAG AAGAACGAGG AGCGCCGTTT GCGGGCGGGG 
CACCTGTGGG TGTTCAGCAA TGAGGTGGAT GGCGAGAAGA CGCCGCTCAA GCGGTTTGCC
CCTGGCGAAT ACGTGATAGT GGAGGACTCC CGCCAACAGC CGTTGGGTCT GGCTTACGTC
AACCCGGAAT CGCTGATCTG CGCGCGTCTC CTGAGCCGCG ACCATCGCGT GGCGATCGAT
CATGCCTTTC TGCTGAAGCG CCTGAGCAGG GCCCTGCATC TGCGCGAAAT GCTTTTCGCC
AAGCCTTATT ACCGTCTGGT GTATGGTGAA AGCGACGGCC TGCCCGGTCT GGTGATCGAC
CGCCTGGGCG ACGTGCTGGT GCTGCAGGCC AGTACGCTCG GCGCGGAGCG GCTGCAGGAG
CAGGTGATGG ACGTGCTCGA CAAGCTGCTC TCACCCCGCA CCCTGGTGGT GAAGAACACT
TCGAGCCTGC GCCAGTTCGA GAAGCTCGAG AATTATGTCC GGGTGTTGGG TGCGCCGCTC
GAGGGTCCGA TACCGATCGA GGAGAACGGT GCGAGATTCC TGGTCGATCC AGTGGAGGGG
CAGAAGACCG GTTGGTTCTT CGACCATCGT CTGAACCGGG CGGTGGCGGC GCGGTTGTCG
AAGTGCCAGC GGGTGCTCGA TCTGTTCTCG TATACCGGGG GCTGGGGGGT GCAGGCGGCT
CTGGGCGGGG CGGAATCCGT GGATTGCGTG GACAGTTCCG AATCGGCGCT CGCGCTTGCC
GCTGAAAACG CCCGGCTCAA TGGCGTGGCG GACCGTATGG GCTTCATCCG CCAGGATGTG
TTCGAATTCC TGAAGCAGCT TCGGCACAAA CGCCAGCGCT ACGATCTGAT CGTGGCGGAT
CCGCCCGCCC TCATCAAGCG CAAGAAAGAC GTCAAGGCGG GAGTGGAAGC CTACCACCGG
CTCAACCAGG CGGCCATGCA AGTGCTGAAT CCCGGTGGGG TCCTGGTGTC GGCCTCCTGC
TCATTCAATC TGCCGCGCTC CACCCTGCAC GACATCCTGC GTACCAGCAG CCGGCATCTG
GATCGGCATT TGGTGATCCT GGGCCAGGGT TGCCAGGGGC CGGATCATCC GGTTCACCCC
GCGATTCCGG AAACCGAGTA CCTCAAGACC TTCTTCTGTC ACCTGTCGAT GCCGCTCTAG
 
Protein sequence
MSELSVLRLK KNEERRLRAG HLWVFSNEVD GEKTPLKRFA PGEYVIVEDS RQQPLGLAYV 
NPESLICARL LSRDHRVAID HAFLLKRLSR ALHLREMLFA KPYYRLVYGE SDGLPGLVID
RLGDVLVLQA STLGAERLQE QVMDVLDKLL SPRTLVVKNT SSLRQFEKLE NYVRVLGAPL
EGPIPIEENG ARFLVDPVEG QKTGWFFDHR LNRAVAARLS KCQRVLDLFS YTGGWGVQAA
LGGAESVDCV DSSESALALA AENARLNGVA DRMGFIRQDV FEFLKQLRHK RQRYDLIVAD
PPALIKRKKD VKAGVEAYHR LNQAAMQVLN PGGVLVSASC SFNLPRSTLH DILRTSSRHL
DRHLVILGQG CQGPDHPVHP AIPETEYLKT FFCHLSMPL