Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2914 |
Symbol | |
ID | 8138257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3392487 |
End bp | 3393671 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644870512 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_003022701 |
Protein GI | 253701512 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 0.188453 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGATC TTCAGCAGTG GGCAGATGGC GATACCTTCG GCGATTTGCT GAAACGGCGG GGGGTAACCA GGCGGGAATT CCTCTGTTTT TGCGGCAAGA TGGCTGCGCT CATAGGCGCC GGGGGGGCCT TCGCGGGAAG CCGGACGGCT TTCGCGCAGG AACTCGCGGG GCGGCTGGAG GGGGCGCGGC GCCCGAGCGT GGTTTACCTG CAGTTGCAGG AGTGTACCGG CTGCCTGGAA AGCCTGCTCC GCTCCGCGAG CACCCCGGTT GAAGAACTGG TCCTGGAGCA GATCTCGCTC GACTACAACG AGCTCCTCAT GGCTCCCTCG GGAGAGGCGG CCGAACAGGC GCTCGCCGCG GCACAAGGGA AGCCGCACCT CCTCCTGGTG AACGGCTCGG TGCCGCTCAA GGACGGCGGG GTCTATTGCA CCATCGGCGG CCGCTCCGCG CGCGACGTCC TGGAGCGCGC CGCGGCCAAT GCCACCGCGG TTGTCGCCAT AGGGGCCTGC GCCGAGTACG GCTGCGTCCA GGCCGCAGCA CCCAACCCCA CCGGCGCCGT CGGGGTGGCC GACGTGATCA GGGACCGACC GGTGGTGAAC GTGAGCGGCT GCCCCCCCAT CGCCGAGACC ATCAGCGCCA CCCTCACCTA CTACCTGGCC TACGGCCGCA CCCCGCCTCT GGACGGGCTG GGGCGCCCGC TCTTCGCCTA CGGCCAGCGC ATCCACGACA AGTGTCCCCG CCGCGCGAGT TTCGACGCCG GACAGTTCGC TGAGCGCTTC GACGACCAAA ACGCCCGCCT GGGGCACTGC CTCTACCGGC TGGGGTGCAA GGGGCCGGCC ACCTTCGCCC CCTGCGCCAC CATCGAATGG AACGACGGGT TGAGCTTTCC GATCAAGGCG GGGCACCCCT GCCTTGGCTG CACCGAGCGC CATTTCTACG ACCGCATGAC CCCGTTCTAC CGGCGCCTCC CCGGCATCGT GGTCCCGGGG CTCGGGGTGG AAGCGACCGC CAACACCATA GGCGTTGCGG CCGTAGCCGC CTCGGTCGCC GCGGTCGCGG TCCACTCCGC GGCGACCGTG ATAGCGAAGC ACCGGGCGCG CCGGGCCGAG CCGGAAAGCC TGCCGCTGGC GGTATTGGGA GACAGGAAGG AAGCTGACGA GAAGGAAGAG AAAAAGGATT CCTGA
|
Protein sequence | MKDLQQWADG DTFGDLLKRR GVTRREFLCF CGKMAALIGA GGAFAGSRTA FAQELAGRLE GARRPSVVYL QLQECTGCLE SLLRSASTPV EELVLEQISL DYNELLMAPS GEAAEQALAA AQGKPHLLLV NGSVPLKDGG VYCTIGGRSA RDVLERAAAN ATAVVAIGAC AEYGCVQAAA PNPTGAVGVA DVIRDRPVVN VSGCPPIAET ISATLTYYLA YGRTPPLDGL GRPLFAYGQR IHDKCPRRAS FDAGQFAERF DDQNARLGHC LYRLGCKGPA TFAPCATIEW NDGLSFPIKA GHPCLGCTER HFYDRMTPFY RRLPGIVVPG LGVEATANTI GVAAVAASVA AVAVHSAATV IAKHRARRAE PESLPLAVLG DRKEADEKEE KKDS
|
| |