Gene MCA1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1840 
Symbol 
ID3102351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1968909 
End bp1970135 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID637170999 
Producthypothetical protein 
Protein accessionYP_114277 
Protein GI53803886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGATG CCGTGCGCCA ATTGCGCAGT GTCGCGGAGC TTGCCTGTCG AGACCTGCTC 
CACGAATGGC GGGTCTCGCT CTGCCTGGCG CTGGCGATCG CCGCCGTGCT GGCGCCCTTG
CTGGTGCTGT TCGGCCTGAA GTCCGGCATC GTCGATACCC TGACGGCGCA GATGAAATCG
GATCCGCGCA ACCTCGAGAT CGTCTGGCGG CTGAACGGTT CACTGGACCG CGATTGGCTG
GAACGGCTGC GCGCGAATCC GCGAGTCGGC TTCGCCGTGC CCAGCACCCG CACCCTCGCC
GCCACCCTGG ACCTTGCCGC CGCCGACGGG AAGGGTCTGG AGGATGTCGA CCTGATGCCT
ACGGATGCGG GCGATCCGCT GTTCGGCGTC GAGGATGCAG TTCCGCAAGG GCTGGGCGAG
CTGGCTTTGA CCCACGAGGC GGCAGACAAA CTCGGCGTTT CGGCCGGCAT GAGCGTGGAA
GGCGTGGTCT ACCGCAATCT CCACCAGCAG CGGCAGGTGT TGAGGCTGTC GCTGCGGGTG
AGCGCGGTGC TGGCCGAATC CGTTCATTCC GGCTGGGGTG CGTTCGTGTC CTTACCCCTC
TTGGAAGCCC TGGAGCATTA CCGCGACGGT TACGCCGTGC CCGAGCTGGG CGTCGCCGAC
GGGGCGTCTC CCACCTCCGG TGCGTCCCGC TACGCCCGGC TCCGGCTCTA TGCGCGCAGT
CTGGAAGCCG TGCCGGGGCT GGCCGAATCG CTGCGCGCCC AAGGCTACGA CGTTTCCACC
CGCAGCAAGG ACATCGAACT GGTCAAGAAC ATCGACCACG CCCTGAGCTT CCTGTTCCGC
CTGATCGCCG GAGGCGGGAT CGCCGGCTGC GTGCTGTCGC TGGGTGCCAG CCTCTGGGCC
AGCGTCGAGC GCAAGCGGCG CGACCTGGCG CTGCTGCGCC TGGTCGGCAT CCGCAATACC
GTGCTTGCCG GCTTCCCCGC CATCCAGGCC GCAGGCGTCG CCGCGGCTGG AATCGCGCTG
GCGTTCGCCG CCTATTTCGC GGCAGCCGAG GCGATAAACC GGACCTTCCG GGCGGATCTG
AGCCGGGAAG AGTTCGTATG CCGTCTGCTG CCGAATGACG GAGTGACGGC CGCTTTTCTG
ACCGAAAGCC TGGCCGTTCT GGCTGCACTG ATCGCCGTGA CGGCCGTGCT GCGGATCGAA
CCGGGGGAGA GCCTGCGGGA GAACTAA
 
Protein sequence
MRDAVRQLRS VAELACRDLL HEWRVSLCLA LAIAAVLAPL LVLFGLKSGI VDTLTAQMKS 
DPRNLEIVWR LNGSLDRDWL ERLRANPRVG FAVPSTRTLA ATLDLAAADG KGLEDVDLMP
TDAGDPLFGV EDAVPQGLGE LALTHEAADK LGVSAGMSVE GVVYRNLHQQ RQVLRLSLRV
SAVLAESVHS GWGAFVSLPL LEALEHYRDG YAVPELGVAD GASPTSGASR YARLRLYARS
LEAVPGLAES LRAQGYDVST RSKDIELVKN IDHALSFLFR LIAGGGIAGC VLSLGASLWA
SVERKRRDLA LLRLVGIRNT VLAGFPAIQA AGVAAAGIAL AFAAYFAAAE AINRTFRADL
SREEFVCRLL PNDGVTAAFL TESLAVLAAL IAVTAVLRIE PGESLREN