Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0166 |
Symbol | |
ID | 3102900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 177012 |
End bp | 178100 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637169389 |
Product | nickel-iron hydrogenase, small subunit |
Protein accession | YP_112703 |
Protein GI | 53802530 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.410491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACGA CCGCACGCCC GGACACGTTC TACGATGTGA TGCGCCGCCA AGGGGTGACC CGGCGCAGTT TTCTCAAGTT CTGCGGCTTG ACCGCATCGG CCCTCGCACT GGGGCCGGAA TTCATCGGCA CGATCGCCCA CGCCATGGAG ACCAAGCCGC GCACGCCGGT GCTGTGGCTG CATGGCCTGG AATGCACCTG CTGTTCCGAA TCCTTCATCC GTTCGGCCCA CCCGCTGGCC AAGGACGTGG TGCTGTCCAT GCTCTCGCTG GACTACGACG ACACCCTCAT GGCGGCCGCC GGCTTCCAGG CGGAAGCCAT GCTGGAAGAC ACCATGCAGA AGTACAAAGG CCGCTACATC CTGGCCGTGG AGGGCAACCC GCCGCTGAAC GAGGACGGCA TGTTCTGCAT CGTCGGCGGC AAACCCTTCA TCGAACGGCT GCGCTATGCC GCCAAGGACG CCGCCGCCGT CATCGCCTGG GGATCCTGCG CCTCCAATGG CTGCGTGCAG GCGGCCCGCC CCAACCCGAC CCAGGCCACG CCGATCCACA AGGTCATCAC CGACAAGCCC ATCATCAAGG TGCCCGGCTG TCCGCCCATC GCCGAGGTCA TGACCGGCGT CGTGACCTAC ATGCTGGCCT TCGACAAGAT TCCCGAACTC GATGCCCAGG GTCGGCCCAA GATGTTCTAC GGCCAGCGCA TCCACGACAA ATGCTACCGC CGCCCCCACT TCGACGCCGG CCAGTTCGTC GAGCAATGGG ACGACGAGGC CGCGCGCAAG GGCTACTGTC TGTACAAGGT CGGCTGCAAG GGGCCGACCA CCTACAACGC GTGCTCGACG GTACGCTGGA ACAACGGCGT CTCCTTCCCG ATCCAGTCCG GCCACGGCTG CATCGGCTGT TCCGAGGAGA ATTTCTGGGA CAAGGGCTCG TTCTACGACC GCGTCACCGA ACTCAACGTG TTCGGCGTCG AGGCCAATGC CGACAAGGTC GGGCTGGTCG CCGCCGGCGC CGTGGGCGCA GGCATCGCGG CGCATGCCGC CATCTCGATC GCCAAGAAGA AAGACCACGA GAAAGAAACT CAAGAATAA
|
Protein sequence | MATTARPDTF YDVMRRQGVT RRSFLKFCGL TASALALGPE FIGTIAHAME TKPRTPVLWL HGLECTCCSE SFIRSAHPLA KDVVLSMLSL DYDDTLMAAA GFQAEAMLED TMQKYKGRYI LAVEGNPPLN EDGMFCIVGG KPFIERLRYA AKDAAAVIAW GSCASNGCVQ AARPNPTQAT PIHKVITDKP IIKVPGCPPI AEVMTGVVTY MLAFDKIPEL DAQGRPKMFY GQRIHDKCYR RPHFDAGQFV EQWDDEAARK GYCLYKVGCK GPTTYNACST VRWNNGVSFP IQSGHGCIGC SEENFWDKGS FYDRVTELNV FGVEANADKV GLVAAGAVGA GIAAHAAISI AKKKDHEKET QE
|
| |