Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1125 |
Symbol | |
ID | 7408707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1217022 |
End bp | 1218167 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643715491 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_002572999 |
Protein GI | 222529117 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000197892 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAAAAAT TTGAGTTCTG GGAAGAGGGC TACAAAAAAT TTGTACGATT TGCTCACAAC AAAGATTTTG TGTTAGAAAG TGGAAAAACA TTTGGACCAA TAACAGTTGC TTATGAAGTC TATGGAGAGA TAAATAAAGA AAAAAATAAT ATCATCCTTA TAACCCACGC TCTAACAGGT GATTCTCACG TTGCAAAGCA TTCAGAAGAT GACCCTAAGC CAGGATGGTG GGACAAGTTT GTTGGCCCTG GCAAGATGAT TGACACAAAT AAATATTTTG TAATATGTTC AAATGTGTTT GGGGGCTGTC AGGGGACAAC AGGTCCTTCT TCTATTGACC CTGAGACAGG AAAACCTTAT GGTGCTAGGT TTCCTATAAT AACCATAAAA GACATGGTGA ATGTTCAGAA AAAGCTTTTA GAAGCTCTAA AGATAGACCA TATACTATGT GTTGTTGGCG GTTCCATGGG CGGAATGCAG GCTTTGGAGT GGGCTGTGAG CTACCCGGAT TTTATGGACG GGGTTATAAA CATTGCATCG CCGCTCAAAC TCAACGCACA GTCCATTGCT TTCAATGAAG TTATGAGAAG GGCTATCATG GCAGACCCAA ACTGGCACGG TGGAGACTAC TATGACAAGA CAGGACCTTC TCAGGGACTT TCAATTGCCA GAATGCTTGG CATGATAACA TACCAGTCTG ACAAGCTTAT GGATAAAAAG TTTAATCGAC GAATGAAAGA CCCTGTTGAG AGCTTTTTTG AGTCGTTTAA CACCGAGTTT GAAGTCGAAA GTTATTTGCA CTATCAGGGT ATGAAACTTG TTCAGAGGTT TGATGCAAAC ACATATCTTT ATCTTACACG TGCCATGGAC CTTTATGATT TGGGAAGGAC ATATGGCAGT GAAGAGGAGG CTTTAAGACG AATAAAGGCA AAATTTTTGC TCATTGCTAT AACATCGGAT ATACTTTTTC CGCTATCTCA GATGAGATAT ATGAGAGACA AGCTTTTAGA AGCTGGGGTT GACCTTGTTT ACAAAGAAAT TGAGTCTGAT TATGGTCACG ACTCTTTTTT GGTTGAAGAG GAAAAATACA GGAATATTAT ATCTGAATTT TTAGAATTAC TTTATAATAA GGAGATGATA AGATGA
|
Protein sequence | MEKFEFWEEG YKKFVRFAHN KDFVLESGKT FGPITVAYEV YGEINKEKNN IILITHALTG DSHVAKHSED DPKPGWWDKF VGPGKMIDTN KYFVICSNVF GGCQGTTGPS SIDPETGKPY GARFPIITIK DMVNVQKKLL EALKIDHILC VVGGSMGGMQ ALEWAVSYPD FMDGVINIAS PLKLNAQSIA FNEVMRRAIM ADPNWHGGDY YDKTGPSQGL SIARMLGMIT YQSDKLMDKK FNRRMKDPVE SFFESFNTEF EVESYLHYQG MKLVQRFDAN TYLYLTRAMD LYDLGRTYGS EEEALRRIKA KFLLIAITSD ILFPLSQMRY MRDKLLEAGV DLVYKEIESD YGHDSFLVEE EKYRNIISEF LELLYNKEMI R
|
| |