Gene Dole_0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0752 
SymbolmetX 
ID5693587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp875780 
End bp876955 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content60% 
IMG OID641263349 
Producthomoserine O-acetyltransferase 
Protein accessionYP_001528639 
Protein GI158520769 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00772131 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAT ATATCGAACA CGATAAAAGC GGTGTGTCCG TGGGGCTGGT GGAAAAACAG 
TTTTTCACCT TTGCCGAACC CCCGAACCCG ATGAAACTGG ACAGCGGCGC CGTTCTGGGG
CCGGTGACCA TCGCCTATGA GACCTATGGC CGACTGAACG AAGACGCAAG TAACGCCGTG
CTGGTGGCCC ACGCCCTTAC CGGCGATTCC CACGCCGCCG GCTGCTACAG CAAAACCGAC
CCCAAGCCCG GCTGGTGGGA CATCATGGTC GGCCCGGGCA AGGGGATTGA CACCAATAAA
TACTTTGTGA TCTGCTCCAA CGTTCTCGGC GGGTGTATGG GCTCCACCGG GCCGTCATCG
GTGAACCCGA CCACAAAAAA ACCCTACGGC GCAAGCTTTC CGGTGATCAC CATCGGTGAC
ATGGTCCGGG CACAGAAGGC CCTGACAGAC CACCTGGGCG TCAAACGGCT CCTGGCCGTG
GTAGGCGGCT CCATGGGCGG CATGCAGGTA ATGGAGTGGT GCGTCCGCTA CCCGGAGATG
GTAACGTCGG CCATTCCCCT GGCCACCACC ACCCGCCATT CGGCCCTGGC CATTGCCTTT
AACGAGGTGG CCCGGCAGGC CATCATGACC GACCCCAACT GGAGCAGCGG TGATTACTAC
GGTGGCAACA AGCCGGCCAT GGGCCTGGCC GTGGCCCGCA TGATCGGCCA CATCACCTAT
CTTTCGGATG AGGCCATGCG CCAGAAGTTC GGCCGCCGGC TGCAGGACAA GGCAGCGGTC
TCTTTTGATT TCGGCGCCGA CTTCCAGGTG GAGAGCTACC TGCGCCACCA GGGCGCCAAG
TTTGTGGAGC GGTTTGACGC CAACACCTTT CTTTACATCA CCAAGGCCGC CGACTACTTT
GACCTGGAGG CCCAGCACGG GAACGGATCA GCGGTGGAGG CCTTTTCAAA GGCCCGTGCC
CGGTTCCTGG TGGTCTCCTT TACATCGGAC TGGCTTTACC CCACCTACCA GTCCCGCGCC
ATGGTCACGG CCATGAAGAA AAACGCCCTG GATGTCAGCT TCTGCGAAAT CGAAGCCGAC
TGCGGCCATG ACGCGTTTCT GATTCCCAAC CCGCGCCTGA GCGCCCTGAT TAAAGGATTT
TTAGAAAGTG TATCCACCGG ACAACAGCAC CCATAA
 
Protein sequence
MSEYIEHDKS GVSVGLVEKQ FFTFAEPPNP MKLDSGAVLG PVTIAYETYG RLNEDASNAV 
LVAHALTGDS HAAGCYSKTD PKPGWWDIMV GPGKGIDTNK YFVICSNVLG GCMGSTGPSS
VNPTTKKPYG ASFPVITIGD MVRAQKALTD HLGVKRLLAV VGGSMGGMQV MEWCVRYPEM
VTSAIPLATT TRHSALAIAF NEVARQAIMT DPNWSSGDYY GGNKPAMGLA VARMIGHITY
LSDEAMRQKF GRRLQDKAAV SFDFGADFQV ESYLRHQGAK FVERFDANTF LYITKAADYF
DLEAQHGNGS AVEAFSKARA RFLVVSFTSD WLYPTYQSRA MVTAMKKNAL DVSFCEIEAD
CGHDAFLIPN PRLSALIKGF LESVSTGQQH P