Gene MCA1138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1138 
Symbol 
ID3103124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1193735 
End bp1195318 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content69% 
IMG OID637170323 
Producthydrogenase subunit 
Protein accessionYP_113608 
Protein GI53804773 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0311036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCATCG ATCGGCTGAC GGCAAGCTTG CAGAGCCTGG GAACCCCGGA GTTCGGCAAC 
GAGAAGGCCC AGGCAGGCGC CGTAGGGCCC GCCCTGCTGT GTCCGGTCAT CCCCGAGGAC
TGGGGGGATG CCGCCGAAGC CGCCGCGGCC CAAGGCTGCC GCTGGGCGGG GGGCTGGGGC
GAGGCCCGCG GCGAAAGCTG GGTCGTCAAT GCCCTCCTCG AAAAATCCGG CGACTACCTG
CTGCTCAGAA CGGTGCTCCC GCCGGGCCGG ACCGGCCTGC CCTCGCACAC GCCGCACTTC
CCCGGCGCGG CCCGTCCCGA ACGTCATACG CGGGACCTGC TGGGGCTGAA ATTCATCGGC
CATCCCGACG ACCGGCGCTG GGTGCGCCAT CAGGCCTGGG GTGAACGCGA GTTCCCGCTG
CGGCCGGATT TTCCCCTGGC CGGCCAGCCA CCGGCCGTCA CGGCGCCGGA CCGGAGCTAT
GGGTTCGCCT CCGCCCATGG CCCCGAGGTG AACGAAATCC CGGTCGGCCC CGTCCATGCC
GGCATCATCG AACCGGGGCA TTTCAGATTC CTGGCGCTGG GAGAAACGGT GCTGAACCTG
GAAGAACGCC TCGGGTACGT GCACAAGGGT ATCGAGAAAC TGGCCGTGGG CCGTGATCCC
GAAGGACTGG CGCGACTGGC GGGGCGGGTA TCCGGCGACA CGACGGTGGG TCACGCCTGG
GCCGCGTGCC AGGCCATGGA ACACGCCGCC GGCGTTGCGC CGCCGGAGCG GGCCCTGTGG
CTGCGGGCGA TCTTCATCGA ACGGGAGCGG GTCGCCAATC ATCTCAACGA CATCGGCGCG
ATCTGCAACG ACACGGCCTT CGCCTTCGGT CACTCCCAGT TCAGCCGCCT GCGCGAACTC
TGGCTCCGTG ACAACCTGCG CTGGTTCGGC CATCGCCTGC TGATGGACCG GATCGTGCCG
GGCGGTGTCG CCGTGGACCT GCCGGCGGAA GCCACCAGCG CGATGCCGGC CGCCATGGCT
GCCTTGCGAA GCGAGCTGGA AGAACTGAAG CCGATATTGG ATGAAAGCAC GATCTTTCAG
GACCGCGTGG TCGGAGCGGG CGTGCTGTCG GAGCGCATCG TCCGGGAACT GGGCTGTCTG
GGCTACGTGG CCCGGGCCTG CGGCATCGGC CGCGACGTGA GGGAGCGGGC GCCGCACGCA
CCGTACGACC GGCTGGGGGT CAAGGCCGTC ACCCGCGGCG ACGGCGATGT CGCTGCCCGC
CTGTATGTCC GTTACGAAGA GCTGCTCGCT TCGCTGGACA TCCTCGACCA ATGCCTGCGG
CGGATCGAAC CGGGACCGCT GCGGGCCGAC TGGCGCACCC CTCCGGCAAA TGCCGAGGGA
CTGGGCCTGG TGGAGGGGTG GCGCGGCGAA ATCGCCACCT ATGTGCGCTT CGACGGCGCC
GGCAGCATCG TCCGCTTCTT CCCCCGCGAC CCCAGCGTCT TCAACTGGCC TGCCCTGGAA
AAACTCATCC TTGGCAACAT CGTGCCGGAC TTCCCTCTGT GCAACAAGTC GGTCAACGGC
TCGTATTCCG GCCACGACCT CTAG
 
Protein sequence
MVIDRLTASL QSLGTPEFGN EKAQAGAVGP ALLCPVIPED WGDAAEAAAA QGCRWAGGWG 
EARGESWVVN ALLEKSGDYL LLRTVLPPGR TGLPSHTPHF PGAARPERHT RDLLGLKFIG
HPDDRRWVRH QAWGEREFPL RPDFPLAGQP PAVTAPDRSY GFASAHGPEV NEIPVGPVHA
GIIEPGHFRF LALGETVLNL EERLGYVHKG IEKLAVGRDP EGLARLAGRV SGDTTVGHAW
AACQAMEHAA GVAPPERALW LRAIFIERER VANHLNDIGA ICNDTAFAFG HSQFSRLREL
WLRDNLRWFG HRLLMDRIVP GGVAVDLPAE ATSAMPAAMA ALRSELEELK PILDESTIFQ
DRVVGAGVLS ERIVRELGCL GYVARACGIG RDVRERAPHA PYDRLGVKAV TRGDGDVAAR
LYVRYEELLA SLDILDQCLR RIEPGPLRAD WRTPPANAEG LGLVEGWRGE IATYVRFDGA
GSIVRFFPRD PSVFNWPALE KLILGNIVPD FPLCNKSVNG SYSGHDL