Gene Gdia_3134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3134 
SymbolmetX 
ID6976568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3431439 
End bp3432602 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content68% 
IMG OID643392642 
Producthomoserine O-acetyltransferase 
Protein accessionYP_002277479 
Protein GI209545250 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.289102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.106399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGA CCCTTCCCAC CCCGCTGGAA CACGACCATC TGCTGTTTCC CGAAGGTTTG 
GCGCTGGAAT GCGGATTCCG CCTGGCGCCG GTGCGGGTCG CCTACCGGAC CTACGGCACC
CTGTCGGCGG CGCGCGATAA CGCGATCGTC GTCTGCCATG CCCTGACGGG CGACCAGTAC
CTGGCCGATA CCCAGCCCCT GACCGGCAAG CCCGGCTGGT GGAGCCGCAT GGTGGGGCCC
GGGTTGCCGA TCGACACCGA CCGGTTCTTC GTCATCTGCA TGAACGTGCT GGGCGGGTGC
ATGGGCTCGA CCGGGCCGCG GTCCTCGCGC ACCGGAATGG AAGGCGAGGG GGCGGAGCCG
TGGGGCACCG ATTTTCCGCC GATTACCATC CGCGACATGG TCCGCGCGCA GAAGCTGGTC
GTCGACCATC TGGGCATCCG GCGGCTGTTC GCCGTCGTCG GCGGGTCGAT GGGCGGGATG
CAGGTGCTGG AATGGGCCGC GACCTTCCCC GACATGGTGT TCGCGGCGAT GCCGATCGCG
ACCTCGCCGT TCCATTCGGC CCAGAACATC GCGTTCAACG AGGTCAGCCG CCAGGCCATC
TTCGCCGATC CCGACTGGCA TGGTGGCCGC TACTGGGAAC GCGAGGCCGT CCCGGCGCGG
GGGCTGGCGG TCGCGCGGAT GATGGCGCAC ATCACCTATC TGTCCGAAGA GGCGCTGACG
CGGAAATTCG GCCGGCGGGT GCGGCGCGAC CCGTACGGTC CGGCCAACCC GCTGTCCCTG
TTCGGCGAGA TGTTCGAGGT CGAGAGCTAT CTGCGGCACC AGGGCTCGTC CTTCGTGCGG
CGCTTCGACG CCAATTCCTA CCTGACCATC ACGCGGGCCA TGGATTATTT CGACCTGGGA
GCCGATCATG ACGGCGACCT GTCGCGGCCG TTCCAGGGAA CGCGCACGCG TTTCTGCATC
GTCTCGTTCT CGTCCGACTG GCTGTTCCCG ACCTCGCAGG CGCGGCTGCT GGCGCGCGCG
CTGAACCGCG CCGCCGCCAA CGTGTCGTTC GTCGAGATCG AGAGCGACAA GGGCCATGAC
GCCTTCCTGC TGGACGAGCC GGATTTCGAT CGCACGGTGC GCGGCTTCCT GTCCGGCGCC
GCCGAACATG CGCGGATCGG CTGA
 
Protein sequence
MDQTLPTPLE HDHLLFPEGL ALECGFRLAP VRVAYRTYGT LSAARDNAIV VCHALTGDQY 
LADTQPLTGK PGWWSRMVGP GLPIDTDRFF VICMNVLGGC MGSTGPRSSR TGMEGEGAEP
WGTDFPPITI RDMVRAQKLV VDHLGIRRLF AVVGGSMGGM QVLEWAATFP DMVFAAMPIA
TSPFHSAQNI AFNEVSRQAI FADPDWHGGR YWEREAVPAR GLAVARMMAH ITYLSEEALT
RKFGRRVRRD PYGPANPLSL FGEMFEVESY LRHQGSSFVR RFDANSYLTI TRAMDYFDLG
ADHDGDLSRP FQGTRTRFCI VSFSSDWLFP TSQARLLARA LNRAAANVSF VEIESDKGHD
AFLLDEPDFD RTVRGFLSGA AEHARIG