Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1123 |
Symbol | |
ID | 8136445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1314887 |
End bp | 1315993 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644868734 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_003020942 |
Protein GI | 253699753 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 149 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAG AGATGTTCTG CGAGGGGGTA ACCCGGAGGA GTTTCATGAA GGCCTGCGTC ACCGCGACGG CCATGATGGG TCTTCCCTTC GCCATGCATA CGCAGGTTGC CGAGGCGATC GAGAAAAACG GCAACCCCGT TGTGATCTGG CTCCATTTCC AGGAGTGCAC CGGCTGCTCC GAATCGCTTC TGCGCTCGAG CCATCCGACT ATTTCCACGC TGATCCTGGA CCTGATCTCG CTGGACTATC ACGAGACGCT GATGGCAGGT GCCGGGCACC AGGCCGAGAA GTCGCTGCAT GACTCGATGA AGGCGAATCA GGGGAAGTAC ATCCTGGTGG TCGAGGGCGC TATACCGACC AAGCAAAACG GGATCTTCTG CAAGGTTTCC GGCAAGACCG CCATGGAGTC GCTGCAGGAA GCGGCCAAGG GGGCTGCCGC CATCATCAGC ATCGGGACCT GTGCCTCCTA CGGCGGCATC CAGTCCGTGT CCCCGAACCC TACCGGCGCC GTCGGCGTCA GGGACATCGT GAAGGACAAG CCGATCATCA ACATCCCCGG CTGCCCTCCC AACCCCTACA ACTTCCTTTC CACCGTCCTC TATTACCTGA CCTTCAAGAA GCTTCCCGAG CTCGACGCGC TGGGGCGCCC GAAGTTCGCC TACGGCAGGA AAATCCACGA ACATTGCGAA AGGCGTCCCC ACTTCGACGC CGGCCGATTC GCCATGGCCT ACGGAGACCC GACCCACGCC CAGGGGTACT GCCTGTACAA GCTGGGGTGC AAGGGTCCCG CGACCAGCGC CAACTGTTCC GTGCAGCGCT TCAACGACGT GGGGGCGTGG CCGGTCTCCA TCGGGCATCC CTGCATCGGC TGCACCGAGC CGGACATCCT CTTCAAGACT GCCATCGCCG AGAAGGTGCA GATCCACGAG CCCACCCCGT TCGACAGCTA CGCGCCGGTC GACCTGAAGG AAAAGGGGAA GGGTCCCGAC CCGCTCACCA CCGGCTTCGT GGGGCTCGCC GCCGGCGCGG CACTCGGAGC AGGCGCCATG CTGGCCAAGA AGCTTCCGGG CAACACGCAG GAGGGCGGCG ATGAAAAATC CGAGTAG
|
Protein sequence | MKEEMFCEGV TRRSFMKACV TATAMMGLPF AMHTQVAEAI EKNGNPVVIW LHFQECTGCS ESLLRSSHPT ISTLILDLIS LDYHETLMAG AGHQAEKSLH DSMKANQGKY ILVVEGAIPT KQNGIFCKVS GKTAMESLQE AAKGAAAIIS IGTCASYGGI QSVSPNPTGA VGVRDIVKDK PIINIPGCPP NPYNFLSTVL YYLTFKKLPE LDALGRPKFA YGRKIHEHCE RRPHFDAGRF AMAYGDPTHA QGYCLYKLGC KGPATSANCS VQRFNDVGAW PVSIGHPCIG CTEPDILFKT AIAEKVQIHE PTPFDSYAPV DLKEKGKGPD PLTTGFVGLA AGAALGAGAM LAKKLPGNTQ EGGDEKSE
|
| |