Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0121 |
Symbol | |
ID | 8135424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 153355 |
End bp | 154215 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644867741 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_003019965 |
Protein GI | 253698776 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 0.984934 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACCG AGGAAGGAAA ACAGTTCCCG CCGCAGCGTC AGGCGCAGCC GGGAAAAGAA GCAGAGATGA CGCCGAGGCC CAAAAGCGGC GAGTTCGAAT ACCGGGGGGC CGGGAAGCTG CAGGGGAAAA CGGCCCTCAT CACCGGCGGC GACAGCGGCA TCGGGCGTGC CGTCGCCATC GCCTTCGCCC GCGAGGGGGC GAACGTCGCT TTCGGATACC TGGAGGAAGA CCAGGACGCG AAAGAGACCC GGGACATCGT GGAGCGGGAG GGGGGGCGCT GCCTCGCCTT CCGCGGCGAC GTGGGTCAGG AGCAGTTCTG CCTCGACATT GTCAAAAAGA CGTTGGAGGC ATTCGGCCGG CTGGACATAG TGGTGAACAA CGCGGCCGAG CAGCATTACC GCGAGGGCAT CGAAGAGATC TCCTCGGAGC AGTTGGAGCG GACCTTCAGG ACCAACATCT TTTCCTATTT CTATCTGGTT AAGGCCGCGC TCAAGCACCT GCAAGAGGGA TCCCGGATCA TCAACACCAC CTCGGTCACC GCCTACAAGG GAAACCCCAA CCTCCTCGAT TACTCCTCCA CCAAGGGGGC CATCGTCGCC TTCACCCGCT CCCTCGCGCT GTCGCTCGCC GACAAGGGGA TCCTGGTGAA CGCCGTCGCC CCCGGTCCCA TCTGGACCCC GCTCATCCCC GGAACCTTCC CGGAGGAAAA GACGGAGCAG TTCGGCGAGA ACGTGCTTTT GAAGCGGGCG GGACAGCCGG TGGAAGTGGC CCACAGCTAC GTCTTCCTCG CCTCCGAAGG AGGCTCCTAC ATGACCGGGC AGGTGCTGCA CCCAAACGGC GGAACAATCG TCGGGGGTTA G
|
Protein sequence | MPTEEGKQFP PQRQAQPGKE AEMTPRPKSG EFEYRGAGKL QGKTALITGG DSGIGRAVAI AFAREGANVA FGYLEEDQDA KETRDIVERE GGRCLAFRGD VGQEQFCLDI VKKTLEAFGR LDIVVNNAAE QHYREGIEEI SSEQLERTFR TNIFSYFYLV KAALKHLQEG SRIINTTSVT AYKGNPNLLD YSSTKGAIVA FTRSLALSLA DKGILVNAVA PGPIWTPLIP GTFPEEKTEQ FGENVLLKRA GQPVEVAHSY VFLASEGGSY MTGQVLHPNG GTIVGG
|
| |