Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1231 |
Symbol | |
ID | 8136556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1436997 |
End bp | 1438307 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868845 |
Product | homoserine dehydrogenase |
Protein accession | YP_003021050 |
Protein GI | 253699861 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.000207213 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGAGA TAAAGATTGG CCTTCTCGGT TTCGGCACCA TCGGTGCCGG TGTGGTAAAG CTTCTTGCCG CGAACAGCGC GCTTTTGACG GAGAAAACCG GCACCCGGAT TTCCCTGAAG AGCATCGTCG ACCTCGACAT CACCACCGAC CGCGGCGTCA GCACCGAAGG GATCATTCTC ACCCGCAACG CCGAAGAAGT GCTGACCGAT CCGGAAATCT CCGTAATCAT CGAGCTGATC GGCGGCTACG AACCCGCCCG CAGCTTCGTC CTTAAGGCCA TCTCTCACGG CAAGCACGTG GTAACCGCCA ATAAGGCACT GCTGGCTGTT CACGGCGAGG AAATATACGC TGCCGCCAAG GCAAACGGCG TCGAAATCCT TTTCGAGGCT GCCGTGGGCG GGGGCATCCC GGTCCTTTCC GCCATCAAGG GGAACCTGGC GGGGAACCGT TTCTCGACCG TGCTCGGCAT CGTGAACGGC ACCTGCAACT ACATCCTGAC CCGGATGACC CATGATGGAG CGGACTTCTC CGAAGTGCTG AAGAGCGCCC AGGAATTAGG CTACGCCGAA GCCGACCCCA CCTTCGATAT AGAGGGCGTC GACACGGCGC ACAAACTCTG CCTGCTGCTC TCCCTTTGTT TCGGAACCAA GATCGACCTG AAAGACATCT ATTCCGAAGG GATCACCTCT ATTTCCTCGG AAGACGTCGA CTTCGCCAAA TCCTTCGGCT ATCGCATCAA GCTTTTGGCC ATAGGGAAGA TGGATGGCGG GCAGATCGAG GCGCGCGTGC ATCCGACCAT GATCCCGATC GACTACCCGC TGGCCGACGT GCACGGCGTG TTCAACGCCA TCAGGCTCAC CGGCGACTTC ATCGGCCCGG TGATGTTCTA TGGCAGGGGC GCGGGGATGG ACGCAACCGC GAGCGCCGTA GTAGGCGACG TCATCGACCT TTCCCGGAGC ATGGGGGCCG GCATCTCCCG CCGCTGCGCG CCGCTTGGGT ACCTCGACGA GAAAGTCAGC ACCCTGCCGA TCAAGCCCAT GGGCGAGATC GTGAGCAAGT ACTACATCCG TTTCCAGGCG CTGGATCGTC CCGGCGTGCT GGCGCGCATT GCTGGAGCCC TGGGGGCCAG CGGCATCAGC ATCGCCTCCA TGCTGCAGAG CGCCAGAAGC GCGAGCGAAA TAGTTCCCAT CGTCATCATG ACCCACGAGG CGCGCGAAGC AGACGTGCGC CGGGCGCTTG CCGAGATCGA CACCTACGAG GTCATCCGCG GCAAGAGCAC CTTCATCAGG ATCGAGGACA ACCTCGAATA A
|
Protein sequence | MKEIKIGLLG FGTIGAGVVK LLAANSALLT EKTGTRISLK SIVDLDITTD RGVSTEGIIL TRNAEEVLTD PEISVIIELI GGYEPARSFV LKAISHGKHV VTANKALLAV HGEEIYAAAK ANGVEILFEA AVGGGIPVLS AIKGNLAGNR FSTVLGIVNG TCNYILTRMT HDGADFSEVL KSAQELGYAE ADPTFDIEGV DTAHKLCLLL SLCFGTKIDL KDIYSEGITS ISSEDVDFAK SFGYRIKLLA IGKMDGGQIE ARVHPTMIPI DYPLADVHGV FNAIRLTGDF IGPVMFYGRG AGMDATASAV VGDVIDLSRS MGAGISRRCA PLGYLDEKVS TLPIKPMGEI VSKYYIRFQA LDRPGVLARI AGALGASGIS IASMLQSARS ASEIVPIVIM THEAREADVR RALAEIDTYE VIRGKSTFIR IEDNLE
|
| |