Gene Moth_1309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1309 
Symbol 
ID3831795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1352794 
End bp1354080 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content58% 
IMG OID637829245 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_430165 
Protein GI83590156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0637456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.233255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCA ATCACCAGTA TCGTTTTGCC ACCCTGGCCG TTCAAGCCGG CCAGGAGCCC 
GATCCGGCTA CGGGAGCTCG CGCCGTGCCT ATCTACCAGA CTACTTCCTA TGTCTTCCGG
GATGCCGATC ATGCCGCCGC CCTCTTCGGC CTGAGGGAAG AAGGTAATAT ATATACCCGG
ATTATGAATC CCACTACCGA TGTCTTTGAA AAAAGGATGG CTGCCCTGGA GGGCGGTGTC
GGCGCCCTGG CAACGGCTTC CGGCCAGGCG GCCATCACCC TGGCCATAGC TAATATCGCC
ACCGCCGGAC AGGAGATTGT TGCTTCCACC AGCCTTTACG GCGGCACCTT CAGTCTTTTT
AATTCAACCC TGCCCAAGTT TGGTATTAAA ACCCGTTTTG TCGACAGCGC CGAACCGGAG
GATTTCCGGG CTGCCATTAC AGACCGGACC AGGGCCCTTT ATGTGGAAAT CCTGGGCAAT
CCCAAGCTGG ACGTGCCGGA CCTGGAGGCC CTGGCTGCCA TCGCCCACGA GGCTGGTATT
CCTTTAATCG TTGATAACAC CTTCGCCACG CCTTACCTGT GCCGTCCGTT TGAATTCGGA
GCCGATATCG TCGTGCACTC GGCGACCAAG TTTATCGGCG GCCATGGTAC TTCCATTGGC
GGCATAATTG TCGATTCCGG TAAATTCAAT TGGGATAACG GTAAATTTCC CGGACTGGTA
GAACCCGATC CCAGTTATCA CGGCCTCAGT TACGTCCAGT CCTTTGGCCC GGCGGCCTAC
ATCGTCAAGG CGCGGGTCCA GCTCTTGCGG GACCTGGGAC CGGCCTTAAG TCCTTTCAAT
GCCTTCCTTT TCCTGCAGGG ACTGGAAACT CTGCACCTGC GGATGGAGCG CCACGTCCAA
AATGCTACCA GGATCGCCGG CTGGCTGGCA GAGCACCCGG CTGTCGCCTG GGTGAGCTAT
CCGGGCCTAC CCGGCCATCC CTACTACGAA CGGGCCCGAA AATACCTGCC TAAAGGAGCG
GGGGCCATTT TGACCTTTGG TATTAAGGGC GGCAAGGAGG CCGGTAAGAA GTTTATCAAC
AGCGTGAAAC TCTTCTCCCT CCTGGCCAAC GTGGGTGATG CCCATTCCCT GGTCATTCAC
CCGGCCAGTA CCACCCATCA GCAGCTGACA CCGGAGGAAC AGCTGGCCTC GGGTGTTACC
GAAGATCTGG TCCGCATCTC CGTGGGCCTG GAGGACGTAG AAGACCTGAT TGCCGACCTG
GACCAGGCAT TAAACAGGAG CCGGTAG
 
Protein sequence
MTTNHQYRFA TLAVQAGQEP DPATGARAVP IYQTTSYVFR DADHAAALFG LREEGNIYTR 
IMNPTTDVFE KRMAALEGGV GALATASGQA AITLAIANIA TAGQEIVAST SLYGGTFSLF
NSTLPKFGIK TRFVDSAEPE DFRAAITDRT RALYVEILGN PKLDVPDLEA LAAIAHEAGI
PLIVDNTFAT PYLCRPFEFG ADIVVHSATK FIGGHGTSIG GIIVDSGKFN WDNGKFPGLV
EPDPSYHGLS YVQSFGPAAY IVKARVQLLR DLGPALSPFN AFLFLQGLET LHLRMERHVQ
NATRIAGWLA EHPAVAWVSY PGLPGHPYYE RARKYLPKGA GAILTFGIKG GKEAGKKFIN
SVKLFSLLAN VGDAHSLVIH PASTTHQQLT PEEQLASGVT EDLVRISVGL EDVEDLIADL
DQALNRSR