Gene GM21_0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0329 
Symbol 
ID8135636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp409832 
End bp410962 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content64% 
IMG OID644867946 
Producthomoserine O-acetyltransferase 
Protein accessionYP_003020168 
Protein GI253698979 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00000000115964 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCGCA TCATTTACGA GAGAAACATG TCCTACGGCA TAGTGACAGA ACAGATCGCC 
ACCTTCGATA CGGAACTGCG CCTGGAAAGC GGGCGTATCC TGGGGCCGAT CGACATAGCC
TACGAGACCT ACGGCACCCT GAACGAGTCG CGCTCCAACG CCATCCTGGT GACCCATGCC
TGGACCGGCA GCGCGCATCT GGCCGGGCGC TACAGCGAGG ACGAGAAGCG GGCGGGTTGG
TGGGACGAGA TCGTCGGCCC CGGCTGTCTT TTGGACACCG ACCGCTACTT CGTGATCTGC
TCCAACGTGA TCGGCTCCTG CTTCGGCTCC ACCGGCCCCA CCTCGATCAA CCCGAAGACC
GGCAAACGCT ACAACCTGGC CTTCCCGGTG ATCACGGTGC GCGACATGGT GAAGGCACAG
GCCCTGCTCA TGGACCGGTT GGGGATCGAG AAGCTTCACT GCGTGCTGGG TGGGAGCATG
GGGGGGATGC AGGCACTGGA GTGGGCGACC CAGTTCCCGG AGCGGGTCGG CTCGGCCGTG
GTGCTCGCCA CGACGCCGCG CCCCTCGGCG CAGGCGATCT CGCTCAACGC CGTGGCGCGC
TGGGCCATCT TCAACGACCC TAACTGGAAA AAAGGGGAAT ACCGGAAAAA TCCCAAGGAC
GGCCTGGCTT TAGCCCGCGG CATCGGGCAC ATCACCTTTC TCTCGGACGA GTCGATGACG
GCCAAGTTCG ACCGCCGTTT CTCCGCCCGC GACGGGCAGT TCGACTTCTT CGGGCAGTTC
GAGGTGGAGC GCTACCTGAC CTACAACGGC TACAACTTCG TCGACCGCTT CGACGCCAAC
TCCTTCCTCT ACCTCGCCAA GGCGCTCGAT CTCTACGACG TCGCCTCGGG GTGCGAATCG
CTGGAGGAAG CCTTCGCGCC GGTGACCGCC CCGATACAGT TCTTCGCCTT CACCTCGGAT
TGGCTCTACC CGCCGGCCCA GACCGAGGAG ATGGTCGCGT CGCTAAAGAA GCTGGGAAAA
GAGGTGGAGT ACCACCTGAT CACCTCGGCC TACGGCCACG ACGCCTTCCT GCTGGAGCAC
CAGACCTTCA CCCCGCTGGT CGAGTCGTTC CTGGACCGGG TCGGGGTTTA G
 
Protein sequence
MRRIIYERNM SYGIVTEQIA TFDTELRLES GRILGPIDIA YETYGTLNES RSNAILVTHA 
WTGSAHLAGR YSEDEKRAGW WDEIVGPGCL LDTDRYFVIC SNVIGSCFGS TGPTSINPKT
GKRYNLAFPV ITVRDMVKAQ ALLMDRLGIE KLHCVLGGSM GGMQALEWAT QFPERVGSAV
VLATTPRPSA QAISLNAVAR WAIFNDPNWK KGEYRKNPKD GLALARGIGH ITFLSDESMT
AKFDRRFSAR DGQFDFFGQF EVERYLTYNG YNFVDRFDAN SFLYLAKALD LYDVASGCES
LEEAFAPVTA PIQFFAFTSD WLYPPAQTEE MVASLKKLGK EVEYHLITSA YGHDAFLLEH
QTFTPLVESF LDRVGV