Gene Moth_1308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1308 
Symbol 
ID3831794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1351668 
End bp1352792 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content58% 
IMG OID637829244 
Producthomoserine O-acetyltransferase 
Protein accessionYP_430164 
Protein GI83590155 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00994406 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.346812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGAG TCGGTATTGT AACAACCAGG TTTTATGAAT GGTCGCAATG CCTCCACCTG 
GAAAGCGGGG CCCAGCTGGG CTCCCTTACC ATAGCCTATG AGACCTATGG GGAACTGAAC
GCGGCCGGAA ATAACGCCAT CCTGGTCCTC CACGCCCTTA CCGGTAATGC CCATATCGCC
GGCCGAAATT TTCCGGACGA GAGGTATCCC GGCTGGTGGG ACCCCCTGGT AGGGCCGGGC
CGGGCCCTGG ACACCAGGCG CTATTTCATT GTCTGTGCCA ACGTCCTGGG AAGCTGCTAT
GGTACCACCG GGCCGGCCAG CATTAATCCA GCCACCGGCA AGCCCTACGG GATGGATTTT
CCGGCCATCA CTATCCGCGA TATGGTACGG GCACAAAAAA TCCTCCTTGA CTATCTGGGG
GTCAAGCGCC TGGTGGCGGC CATCGGTGGT TCCATGGGCG GGATGCAGGT CCTGGAGTGG
GGTTTTCTTT ATCCTCAGAT GCTGGACGCC ATTATTCCCA TTGCCACCTG CGGCCGGACT
ACTCCCATGC AGATTGCCTT TCACCACGTG CAGCGGGAAG CCATTTACGC CGACCCCGAC
TGGCAGGGAG GCAATTATTA CGGCACTGCC GGGCCCCGGC GGGGACTGGC CCTGGCCCGG
CAGATCGGGA TTATTACTTA TAAAAGCGAC CCCTCCTGGA ACATGAAATT TGGCCGCAAC
CTGGTGGACC CCCGGAAATA CTTCCAACTG GAAGGGCAGT TCGAAGTAGA GAGCTACCTG
GCCTACCAGG GGAGGAAGCT GGTAGATCGT TTCGACGCCA ACTCTTACCT GTACCTTACC
AAAGCAGTAG ACCTCCACGA TGTGAGCCAG GGACGGGGAA GCTATAATGA AGTCTGGCGG
GATTTCCCCT GCCCCTGCCT GGGTATAGGC ATATCAAGCG ATTTTCTTTT CCCTCCCTAT
CAGGTGCAGG AGATTGTCCG GATGATTAAC GACGGCGGCG GCCATGCCCG TTACGCAGAG
ATTGATTCCC CCTATGGCCA CGACGCCTTT TTAATCGAGT TTAACCAGCT GGCAGCCATT
ATCCAGCCGT TTCTGAAAGA GTTGCGCCCG GACCTGGCCG CTTGA
 
Protein sequence
MDGVGIVTTR FYEWSQCLHL ESGAQLGSLT IAYETYGELN AAGNNAILVL HALTGNAHIA 
GRNFPDERYP GWWDPLVGPG RALDTRRYFI VCANVLGSCY GTTGPASINP ATGKPYGMDF
PAITIRDMVR AQKILLDYLG VKRLVAAIGG SMGGMQVLEW GFLYPQMLDA IIPIATCGRT
TPMQIAFHHV QREAIYADPD WQGGNYYGTA GPRRGLALAR QIGIITYKSD PSWNMKFGRN
LVDPRKYFQL EGQFEVESYL AYQGRKLVDR FDANSYLYLT KAVDLHDVSQ GRGSYNEVWR
DFPCPCLGIG ISSDFLFPPY QVQEIVRMIN DGGGHARYAE IDSPYGHDAF LIEFNQLAAI
IQPFLKELRP DLAA