Gene Athe_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1190 
Symbol 
ID7408772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1281259 
End bp1282488 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content37% 
IMG OID643715555 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_002573063 
Protein GI222529181 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTTA ATACTTCTTT AATTCATGGA GGTATTGGTC AAAAAGAAAA TAAGGGGGCA 
ACTAACATTC CGATATACCA ATGTAATTCT TTCCAATATG AAACTGCACG TGAGTTGGAA
GAGGTTTTTT CTGGCAAAAA GCCAGGCTTT ATATATACAA GGATTAACAA TCCCACAGTT
GAAGCGTTTG AAAGAAGAAT AGCATTTTTA GAAGAAGGCA TTGCTGCTGT TGCCACATCA
TCTGGCATGG CAGCCGTTGC CTTAGCAATA TTGAATTTAG TAAGAAATGG AGATGAAATT
GTTTCAGCAA GCGGGATTTT TGGTGGCACA TATTCATTGT TCAGATCATT TGAAAACCTT
GGTATTAAGA CAAGATTTGC AGAAGATAGC AGCCTTGAGA GCTTTGAAAA GCATATAACA
GAGAAAACTA AAGTAATTTT TGTAGAAACA ATAGGAAATC CAAAACTGGA TGTGCCAAAT
ATCAAACAAA TAGCTGAGCT TGCGCATGAG CATGGTATTG CACTCATTGT TGATAGCACT
GTCACAACAC CGTACCTTGT AAAACCCATA AAACTCGGTG CTGATATAGT GGTTCATTCT
ACATCAAAGT TTATAAATGG AAGCGGCAGC TGTATTGGCG GAGTTATAGT TGCAAGCAGC
AACATGAAAA TTGATTATGA TAGGTATCCG CTTATTAAGG AATACAAAAA GTATGGTGAA
TTTGCGTACA TTGCACGACT TCGAAATAAT TTGCTTAAAG ATTTTGGCGC CTGTATATCG
CCTTTTAATG CATTTTTAAA TACAATCGGG CTTGAAACCC TCGGTGTTCG TATGCAAAAG
ATTTGCGAGA ATGCTCTTTG CCTTGCCAAA GCCCTAAAAG AAAATAAGAA GGTTGTTTCA
GTAAATTACC CTGGGCTTGA TGAAAGTAGT TACTTTAGAG TTGCAACAGA ACAGTTTGGA
GGCAAATATG GAGCAATTTT GACAATACGG GTTGGAACAA AGGTGAATGC CTTTAAAGTG
ATAGATTCAT TGCGATATGC CATAAATTCA ACCAATATAG GAGATGTAAG GACACTTGTT
GTACATCCTG CGTCAACTAT ATATGCAAGC TTTTCTGTTG AAGAAAAAGA ATCTATGGGT
GTTTATGAAG ATATGATAAG AATATGTGTT GGCCTTGAGG ATGTAGAAGA CATAATAGAA
GATTTTTACC AGGCACTTGA AAAGATTTAA
 
Protein sequence
MRFNTSLIHG GIGQKENKGA TNIPIYQCNS FQYETARELE EVFSGKKPGF IYTRINNPTV 
EAFERRIAFL EEGIAAVATS SGMAAVALAI LNLVRNGDEI VSASGIFGGT YSLFRSFENL
GIKTRFAEDS SLESFEKHIT EKTKVIFVET IGNPKLDVPN IKQIAELAHE HGIALIVDST
VTTPYLVKPI KLGADIVVHS TSKFINGSGS CIGGVIVASS NMKIDYDRYP LIKEYKKYGE
FAYIARLRNN LLKDFGACIS PFNAFLNTIG LETLGVRMQK ICENALCLAK ALKENKKVVS
VNYPGLDESS YFRVATEQFG GKYGAILTIR VGTKVNAFKV IDSLRYAINS TNIGDVRTLV
VHPASTIYAS FSVEEKESMG VYEDMIRICV GLEDVEDIIE DFYQALEKI