Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0329 |
Symbol | |
ID | 8135636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 409832 |
End bp | 410962 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644867946 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_003020168 |
Protein GI | 253698979 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.00000000115964 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGCGCA TCATTTACGA GAGAAACATG TCCTACGGCA TAGTGACAGA ACAGATCGCC ACCTTCGATA CGGAACTGCG CCTGGAAAGC GGGCGTATCC TGGGGCCGAT CGACATAGCC TACGAGACCT ACGGCACCCT GAACGAGTCG CGCTCCAACG CCATCCTGGT GACCCATGCC TGGACCGGCA GCGCGCATCT GGCCGGGCGC TACAGCGAGG ACGAGAAGCG GGCGGGTTGG TGGGACGAGA TCGTCGGCCC CGGCTGTCTT TTGGACACCG ACCGCTACTT CGTGATCTGC TCCAACGTGA TCGGCTCCTG CTTCGGCTCC ACCGGCCCCA CCTCGATCAA CCCGAAGACC GGCAAACGCT ACAACCTGGC CTTCCCGGTG ATCACGGTGC GCGACATGGT GAAGGCACAG GCCCTGCTCA TGGACCGGTT GGGGATCGAG AAGCTTCACT GCGTGCTGGG TGGGAGCATG GGGGGGATGC AGGCACTGGA GTGGGCGACC CAGTTCCCGG AGCGGGTCGG CTCGGCCGTG GTGCTCGCCA CGACGCCGCG CCCCTCGGCG CAGGCGATCT CGCTCAACGC CGTGGCGCGC TGGGCCATCT TCAACGACCC TAACTGGAAA AAAGGGGAAT ACCGGAAAAA TCCCAAGGAC GGCCTGGCTT TAGCCCGCGG CATCGGGCAC ATCACCTTTC TCTCGGACGA GTCGATGACG GCCAAGTTCG ACCGCCGTTT CTCCGCCCGC GACGGGCAGT TCGACTTCTT CGGGCAGTTC GAGGTGGAGC GCTACCTGAC CTACAACGGC TACAACTTCG TCGACCGCTT CGACGCCAAC TCCTTCCTCT ACCTCGCCAA GGCGCTCGAT CTCTACGACG TCGCCTCGGG GTGCGAATCG CTGGAGGAAG CCTTCGCGCC GGTGACCGCC CCGATACAGT TCTTCGCCTT CACCTCGGAT TGGCTCTACC CGCCGGCCCA GACCGAGGAG ATGGTCGCGT CGCTAAAGAA GCTGGGAAAA GAGGTGGAGT ACCACCTGAT CACCTCGGCC TACGGCCACG ACGCCTTCCT GCTGGAGCAC CAGACCTTCA CCCCGCTGGT CGAGTCGTTC CTGGACCGGG TCGGGGTTTA G
|
Protein sequence | MRRIIYERNM SYGIVTEQIA TFDTELRLES GRILGPIDIA YETYGTLNES RSNAILVTHA WTGSAHLAGR YSEDEKRAGW WDEIVGPGCL LDTDRYFVIC SNVIGSCFGS TGPTSINPKT GKRYNLAFPV ITVRDMVKAQ ALLMDRLGIE KLHCVLGGSM GGMQALEWAT QFPERVGSAV VLATTPRPSA QAISLNAVAR WAIFNDPNWK KGEYRKNPKD GLALARGIGH ITFLSDESMT AKFDRRFSAR DGQFDFFGQF EVERYLTYNG YNFVDRFDAN SFLYLAKALD LYDVASGCES LEEAFAPVTA PIQFFAFTSD WLYPPAQTEE MVASLKKLGK EVEYHLITSA YGHDAFLLEH QTFTPLVESF LDRVGV
|
| |