Gene MCA0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0166 
Symbol 
ID3102900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp177012 
End bp178100 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content64% 
IMG OID637169389 
Productnickel-iron hydrogenase, small subunit 
Protein accessionYP_112703 
Protein GI53802530 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.410491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACGA CCGCACGCCC GGACACGTTC TACGATGTGA TGCGCCGCCA AGGGGTGACC 
CGGCGCAGTT TTCTCAAGTT CTGCGGCTTG ACCGCATCGG CCCTCGCACT GGGGCCGGAA
TTCATCGGCA CGATCGCCCA CGCCATGGAG ACCAAGCCGC GCACGCCGGT GCTGTGGCTG
CATGGCCTGG AATGCACCTG CTGTTCCGAA TCCTTCATCC GTTCGGCCCA CCCGCTGGCC
AAGGACGTGG TGCTGTCCAT GCTCTCGCTG GACTACGACG ACACCCTCAT GGCGGCCGCC
GGCTTCCAGG CGGAAGCCAT GCTGGAAGAC ACCATGCAGA AGTACAAAGG CCGCTACATC
CTGGCCGTGG AGGGCAACCC GCCGCTGAAC GAGGACGGCA TGTTCTGCAT CGTCGGCGGC
AAACCCTTCA TCGAACGGCT GCGCTATGCC GCCAAGGACG CCGCCGCCGT CATCGCCTGG
GGATCCTGCG CCTCCAATGG CTGCGTGCAG GCGGCCCGCC CCAACCCGAC CCAGGCCACG
CCGATCCACA AGGTCATCAC CGACAAGCCC ATCATCAAGG TGCCCGGCTG TCCGCCCATC
GCCGAGGTCA TGACCGGCGT CGTGACCTAC ATGCTGGCCT TCGACAAGAT TCCCGAACTC
GATGCCCAGG GTCGGCCCAA GATGTTCTAC GGCCAGCGCA TCCACGACAA ATGCTACCGC
CGCCCCCACT TCGACGCCGG CCAGTTCGTC GAGCAATGGG ACGACGAGGC CGCGCGCAAG
GGCTACTGTC TGTACAAGGT CGGCTGCAAG GGGCCGACCA CCTACAACGC GTGCTCGACG
GTACGCTGGA ACAACGGCGT CTCCTTCCCG ATCCAGTCCG GCCACGGCTG CATCGGCTGT
TCCGAGGAGA ATTTCTGGGA CAAGGGCTCG TTCTACGACC GCGTCACCGA ACTCAACGTG
TTCGGCGTCG AGGCCAATGC CGACAAGGTC GGGCTGGTCG CCGCCGGCGC CGTGGGCGCA
GGCATCGCGG CGCATGCCGC CATCTCGATC GCCAAGAAGA AAGACCACGA GAAAGAAACT
CAAGAATAA
 
Protein sequence
MATTARPDTF YDVMRRQGVT RRSFLKFCGL TASALALGPE FIGTIAHAME TKPRTPVLWL 
HGLECTCCSE SFIRSAHPLA KDVVLSMLSL DYDDTLMAAA GFQAEAMLED TMQKYKGRYI
LAVEGNPPLN EDGMFCIVGG KPFIERLRYA AKDAAAVIAW GSCASNGCVQ AARPNPTQAT
PIHKVITDKP IIKVPGCPPI AEVMTGVVTY MLAFDKIPEL DAQGRPKMFY GQRIHDKCYR
RPHFDAGQFV EQWDDEAARK GYCLYKVGCK GPTTYNACST VRWNNGVSFP IQSGHGCIGC
SEENFWDKGS FYDRVTELNV FGVEANADKV GLVAAGAVGA GIAAHAAISI AKKKDHEKET
QE