Gene Amir_6239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_6239 
Symbol 
ID8330450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp7318588 
End bp7319619 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content77% 
IMG OID644946670 
Producthomoserine O-acetyltransferase 
Protein accessionYP_003103889 
Protein GI256380229 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGGGT GGCGCGACGG CGATCCGGCC GGTGGGAGGA AGTGGCACCG GGGCGCGCTG 
CCCGGTCTGC CGTTCCGGCT CGCCTACGAG ACCTGGGGCG AGCCTGACGA CGACCGGTCC
AACGCGGTGC TCGTGCTGCA CGCCCTCACC GGGGACAGCC ACGTCGCCGG ACCCGCCGGT
CCCGGCCACC CGACCGCCGG GTGGTGGGAC GGGCTGGTGG GGCCCGGCCT GGCGCTGGAC
ACCGACCGGT GGTTCGTCGT CGCGCCCAAC GCGCTGGGCG GCTGCCAGGG CAGCACCGGC
CCGTGGGACA CCGCGCCCGA CGGGCGGCCC TGGGGCGAGC GGTTCCCCGC CGTCGGCATC
CGCGACCAGG TGCGGGCCGA GCTGGGGCTG GCCGACGCGC TCGGCGTGCG CTCGTGGGCG
GCGGTCGTCG GCGGGTCCAT GGGCGGGATG CGGGCGCTGG AGTGGGCGGT GACCGCGCCC
GAGCGGGTGC GGTCGCTGCT GGTGCTCGCG GCCCCCGCCG CGTCCGGCGC CGACCAGATC
GCGCTCGCCT CGGCCCAGCT GCACGCCCTC AAGCTGCACC CGCAGGAGGG GATGGCGGTG
GCCAGGCGGA TCGCCCACCA CGGGTACCGC ACCGCCGCCG AGCTCAACGC CCGGTTCGGG
CGGAGCGTCC AGGGGGACGG GCGGTTCGCC GTCGAGTCCT ACCTGGACCA CCAGGCGGAC
AAGCTGGCCG GGAGGTTCGA CCCCGGCAGC TACCGGGTGC TCACCGAGGC CATGAACGGC
CACGACGTGG GCCGGGGCCG GGGAGGGGTG CGCGCGGCGC TGGGCGCGGT GACCGCGCGC
ACCCTCGTCG CCGGGATCGA CACCGACCGG CTCTACCCGC TGGAGCAGCA GCGGGAGCTG
GCCGAGGCGA TCCCCGCAGC GGGCGACCTG CGCGTCGTGG CCTCGCCGTA CGGCCACGAC
GGGTTCCTCG TCGAGGAGGA GCAGGTCGCC GCGCTGCTGG GGGAACTGCT GCGGGTCAGA
AGCCCGCGGT GA
 
Protein sequence
MTGWRDGDPA GGRKWHRGAL PGLPFRLAYE TWGEPDDDRS NAVLVLHALT GDSHVAGPAG 
PGHPTAGWWD GLVGPGLALD TDRWFVVAPN ALGGCQGSTG PWDTAPDGRP WGERFPAVGI
RDQVRAELGL ADALGVRSWA AVVGGSMGGM RALEWAVTAP ERVRSLLVLA APAASGADQI
ALASAQLHAL KLHPQEGMAV ARRIAHHGYR TAAELNARFG RSVQGDGRFA VESYLDHQAD
KLAGRFDPGS YRVLTEAMNG HDVGRGRGGV RAALGAVTAR TLVAGIDTDR LYPLEQQREL
AEAIPAAGDL RVVASPYGHD GFLVEEEQVA ALLGELLRVR SPR