Gene GM21_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1123 
Symbol 
ID8136445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1314887 
End bp1315993 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID644868734 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_003020942 
Protein GI253699753 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones149 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAG AGATGTTCTG CGAGGGGGTA ACCCGGAGGA GTTTCATGAA GGCCTGCGTC 
ACCGCGACGG CCATGATGGG TCTTCCCTTC GCCATGCATA CGCAGGTTGC CGAGGCGATC
GAGAAAAACG GCAACCCCGT TGTGATCTGG CTCCATTTCC AGGAGTGCAC CGGCTGCTCC
GAATCGCTTC TGCGCTCGAG CCATCCGACT ATTTCCACGC TGATCCTGGA CCTGATCTCG
CTGGACTATC ACGAGACGCT GATGGCAGGT GCCGGGCACC AGGCCGAGAA GTCGCTGCAT
GACTCGATGA AGGCGAATCA GGGGAAGTAC ATCCTGGTGG TCGAGGGCGC TATACCGACC
AAGCAAAACG GGATCTTCTG CAAGGTTTCC GGCAAGACCG CCATGGAGTC GCTGCAGGAA
GCGGCCAAGG GGGCTGCCGC CATCATCAGC ATCGGGACCT GTGCCTCCTA CGGCGGCATC
CAGTCCGTGT CCCCGAACCC TACCGGCGCC GTCGGCGTCA GGGACATCGT GAAGGACAAG
CCGATCATCA ACATCCCCGG CTGCCCTCCC AACCCCTACA ACTTCCTTTC CACCGTCCTC
TATTACCTGA CCTTCAAGAA GCTTCCCGAG CTCGACGCGC TGGGGCGCCC GAAGTTCGCC
TACGGCAGGA AAATCCACGA ACATTGCGAA AGGCGTCCCC ACTTCGACGC CGGCCGATTC
GCCATGGCCT ACGGAGACCC GACCCACGCC CAGGGGTACT GCCTGTACAA GCTGGGGTGC
AAGGGTCCCG CGACCAGCGC CAACTGTTCC GTGCAGCGCT TCAACGACGT GGGGGCGTGG
CCGGTCTCCA TCGGGCATCC CTGCATCGGC TGCACCGAGC CGGACATCCT CTTCAAGACT
GCCATCGCCG AGAAGGTGCA GATCCACGAG CCCACCCCGT TCGACAGCTA CGCGCCGGTC
GACCTGAAGG AAAAGGGGAA GGGTCCCGAC CCGCTCACCA CCGGCTTCGT GGGGCTCGCC
GCCGGCGCGG CACTCGGAGC AGGCGCCATG CTGGCCAAGA AGCTTCCGGG CAACACGCAG
GAGGGCGGCG ATGAAAAATC CGAGTAG
 
Protein sequence
MKEEMFCEGV TRRSFMKACV TATAMMGLPF AMHTQVAEAI EKNGNPVVIW LHFQECTGCS 
ESLLRSSHPT ISTLILDLIS LDYHETLMAG AGHQAEKSLH DSMKANQGKY ILVVEGAIPT
KQNGIFCKVS GKTAMESLQE AAKGAAAIIS IGTCASYGGI QSVSPNPTGA VGVRDIVKDK
PIINIPGCPP NPYNFLSTVL YYLTFKKLPE LDALGRPKFA YGRKIHEHCE RRPHFDAGRF
AMAYGDPTHA QGYCLYKLGC KGPATSANCS VQRFNDVGAW PVSIGHPCIG CTEPDILFKT
AIAEKVQIHE PTPFDSYAPV DLKEKGKGPD PLTTGFVGLA AGAALGAGAM LAKKLPGNTQ
EGGDEKSE