Gene GM21_2914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2914 
Symbol 
ID8138257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3392487 
End bp3393671 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content69% 
IMG OID644870512 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_003022701 
Protein GI253701512 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value0.188453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGATC TTCAGCAGTG GGCAGATGGC GATACCTTCG GCGATTTGCT GAAACGGCGG 
GGGGTAACCA GGCGGGAATT CCTCTGTTTT TGCGGCAAGA TGGCTGCGCT CATAGGCGCC
GGGGGGGCCT TCGCGGGAAG CCGGACGGCT TTCGCGCAGG AACTCGCGGG GCGGCTGGAG
GGGGCGCGGC GCCCGAGCGT GGTTTACCTG CAGTTGCAGG AGTGTACCGG CTGCCTGGAA
AGCCTGCTCC GCTCCGCGAG CACCCCGGTT GAAGAACTGG TCCTGGAGCA GATCTCGCTC
GACTACAACG AGCTCCTCAT GGCTCCCTCG GGAGAGGCGG CCGAACAGGC GCTCGCCGCG
GCACAAGGGA AGCCGCACCT CCTCCTGGTG AACGGCTCGG TGCCGCTCAA GGACGGCGGG
GTCTATTGCA CCATCGGCGG CCGCTCCGCG CGCGACGTCC TGGAGCGCGC CGCGGCCAAT
GCCACCGCGG TTGTCGCCAT AGGGGCCTGC GCCGAGTACG GCTGCGTCCA GGCCGCAGCA
CCCAACCCCA CCGGCGCCGT CGGGGTGGCC GACGTGATCA GGGACCGACC GGTGGTGAAC
GTGAGCGGCT GCCCCCCCAT CGCCGAGACC ATCAGCGCCA CCCTCACCTA CTACCTGGCC
TACGGCCGCA CCCCGCCTCT GGACGGGCTG GGGCGCCCGC TCTTCGCCTA CGGCCAGCGC
ATCCACGACA AGTGTCCCCG CCGCGCGAGT TTCGACGCCG GACAGTTCGC TGAGCGCTTC
GACGACCAAA ACGCCCGCCT GGGGCACTGC CTCTACCGGC TGGGGTGCAA GGGGCCGGCC
ACCTTCGCCC CCTGCGCCAC CATCGAATGG AACGACGGGT TGAGCTTTCC GATCAAGGCG
GGGCACCCCT GCCTTGGCTG CACCGAGCGC CATTTCTACG ACCGCATGAC CCCGTTCTAC
CGGCGCCTCC CCGGCATCGT GGTCCCGGGG CTCGGGGTGG AAGCGACCGC CAACACCATA
GGCGTTGCGG CCGTAGCCGC CTCGGTCGCC GCGGTCGCGG TCCACTCCGC GGCGACCGTG
ATAGCGAAGC ACCGGGCGCG CCGGGCCGAG CCGGAAAGCC TGCCGCTGGC GGTATTGGGA
GACAGGAAGG AAGCTGACGA GAAGGAAGAG AAAAAGGATT CCTGA
 
Protein sequence
MKDLQQWADG DTFGDLLKRR GVTRREFLCF CGKMAALIGA GGAFAGSRTA FAQELAGRLE 
GARRPSVVYL QLQECTGCLE SLLRSASTPV EELVLEQISL DYNELLMAPS GEAAEQALAA
AQGKPHLLLV NGSVPLKDGG VYCTIGGRSA RDVLERAAAN ATAVVAIGAC AEYGCVQAAA
PNPTGAVGVA DVIRDRPVVN VSGCPPIAET ISATLTYYLA YGRTPPLDGL GRPLFAYGQR
IHDKCPRRAS FDAGQFAERF DDQNARLGHC LYRLGCKGPA TFAPCATIEW NDGLSFPIKA
GHPCLGCTER HFYDRMTPFY RRLPGIVVPG LGVEATANTI GVAAVAASVA AVAVHSAATV
IAKHRARRAE PESLPLAVLG DRKEADEKEE KKDS