Gene M446_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3089 
Symbol 
ID6130721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3418615 
End bp3419634 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID641643280 
Productformate dehydrogenase, gamma subunit 
Protein accessionYP_001769933 
Protein GI170741278 
COG category[C] Energy production and conversion 
COG ID[COG2864] Cytochrome b subunit of formate dehydrogenase 
TIGRFAM ID[TIGR01583] formate dehydrogenase, gamma subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.971342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.037846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGAG GTCTGACGCA TCTCGGGATC CTGGTGATGG CCGCCCTGAT CTGGGCGGTC 
GTCGGTCTGG CCGGCCCGGC CCGCGCGGTC GACGCCCCGG ACGGCGCCCC GAACCCCACC
ACCTCCTCGG TCAACGAGGA CCTGCTGTTC CGGCAGGCCC CCAAGATCGG CGGGCGCATC
TCGATTCCCG ACCAGAAGGC CGCGAACCTG ATCCAGCCGC AAGGGCGCGA GTGGCGGAGG
TTCCACGAAT CCTGGATGCC CTGGGTCGGG GGATTGGTCA TCCTCGGCAT CGTGGCGGCG
CTCGCCCTGT TCTACTTCAC CCGCGGGCGC ATCCGACTCG ACCACACCGA GGAATCCGGC
CGCAAGCTGC TGCGCTTCAA CGTGTTCGAG CGCTTCTCGC ACTGGATGAC GGCGTTCTGC
TTCATCCTGC TGGGGCTGTC GGGGTTGAAC TACATCTTCG GCAAGCGCCT GCTGATGCCG
CTGATCGGGC CCGACGCCTT CGCGGCCCTG TCGCAATGGG CGAAGTACGC GCACGTCTTC
CTGGCCTGGC CGTTCATGCT CGGCGTGCTG TTCATGGCCG TGCTGTGGGT GCGCGACAAC
ATCCCCAACC GCATCGACAT CGAGTGGCTG AAGAAGGGCG GCGGCTTCCT GAGCGACGCG
CATCCCCACG CCGAGCGCTT CAATGCCGGC CAGAAGCTGG TCTTCTGGAT GGTGGTCGGC
TTCGGCACCG CCATGGGCGC CACCGGGCTG ATGATGCTGT TTCCCTTCGC GCTCACCGAC
ATCAACGGGA TGCAGGTGAT GCAGGTGGTG CACTCGCTGA TCGGCGTCGT CTTCGTCGCC
GGCATCCTGG CCCACATCTA CATCGGCTCG CTCGGGATGG AGGGCGCCTA CGACGCCATG
GGCAGCGGCG AGGTCGATAT CGCCTGGGCG CGGGTGCACC ACGACCTCTG GGTCAAGGAG
CAGCTCGCCA AGAACGCGGA CGGGCCTCAG CTCGGCCGCG GGCAGGTGCC GGCGGAATAG
 
Protein sequence
MRRGLTHLGI LVMAALIWAV VGLAGPARAV DAPDGAPNPT TSSVNEDLLF RQAPKIGGRI 
SIPDQKAANL IQPQGREWRR FHESWMPWVG GLVILGIVAA LALFYFTRGR IRLDHTEESG
RKLLRFNVFE RFSHWMTAFC FILLGLSGLN YIFGKRLLMP LIGPDAFAAL SQWAKYAHVF
LAWPFMLGVL FMAVLWVRDN IPNRIDIEWL KKGGGFLSDA HPHAERFNAG QKLVFWMVVG
FGTAMGATGL MMLFPFALTD INGMQVMQVV HSLIGVVFVA GILAHIYIGS LGMEGAYDAM
GSGEVDIAWA RVHHDLWVKE QLAKNADGPQ LGRGQVPAE