Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1959 |
Symbol | |
ID | 7407373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2071659 |
End bp | 2072915 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716331 |
Product | Homoserine dehydrogenase |
Protein accession | YP_002573819 |
Protein GI | 222529937 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000138202 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCAAAGG TTGCAATAAT GGGATTTGGC GTTGTTGGAT CAGGTGTTTG GGAAGTTTTA ACAAAAAATG CATCATCAAT TGCAAAAAGG GCAGGAGAAG AAATATCGGT AAAATACATT CTTGACATTC GTGATTTCCC TGACCACCCT GCAAAAGATT TGATGATAAA AGATTTTGAC ATCATATTGA ACGACCCCGA AGTTTCAATT GTTGTTGAGA CAATAGGCGG GCTTGAACCT GCATACACTT ATACAAAAAA GCTACTTTTG AATGGTAAAC ACGTTGTAAC ATCTAACAAA GAACTTGTTG CAAAGTACGG TCCAGAACTT TTGAAGATTG CAAAAGAGAA CAATATAAAC TACTTTTTTG AAGCAAGTGT TGGTGGAGGA ATTCCCATTA TAAGACCTCT TCAAAACTGT TTAGCAGGAA ATCAAATTAC AGAAATTGCA GGAATTTTAA ACGGTACAAC AAACTATATT CTCACTCAGA TGAAAAAGTA TTCACTTTCG TTTGAAGATA CTCTAAAAGA AGCTCAGGAA AAAGGATACG CAGAAAGAAA TCCGAGCAAC GATATAGAAG GTCATGATGC ATGTAGAAAG ATTGCAATTC TATCATCTAT TGCATACTCC CATTATGTAA ACTATGAAAA CATTTATACA GAAGGGATAT CCAAGATAAC AAAAGAGGAT ATGGAGTATG CTGAAGAGCT TGGATGTACA ATAAAGCTCA TTGCAATGAG TAAAAAGATT GACGACAAAA AAGTATTTGC AAGAGTTTCG CCCCTTATGA TATCTTACAA AAGCCCGTTT GCAAATGTGG ATGATGTGTT CAATGCAATT TTAGTAAAAG GCGATGCAAT TGGTGATGTG ATGTTTTACG GCCAAGGCGC CGGAAAGCTT CCAACAGCGA GTGCTGTTGT TGGCGACATC ATAGACATTG TAAAACACAT TGATAAGTCT TATGTCTACA CATGGGCAAT CTCTGGGGAT ATTGAAGTTG TGGACATTGA AAATACATCC TGCAGATTCT TTGTAAGAGT CAAATACAAA GACTACACCA AGGCAAAAGA TGCTGTTTCT CTCATATTCA ATGACTGCAT GATAGTAAAC ACACATAGAC CGATAGGCAC AAACGAATTT GCGTTTGTAA CACATGAGAT GAAAGAAAGT GAGTTCAAAG AAAAGATTTC TCAGCTTGAA AAGATTTCTG TTGTAGAAAA GGTCTTGTCT ATTATTAGAT ATGATGAAAA CATATAA
|
Protein sequence | MAKVAIMGFG VVGSGVWEVL TKNASSIAKR AGEEISVKYI LDIRDFPDHP AKDLMIKDFD IILNDPEVSI VVETIGGLEP AYTYTKKLLL NGKHVVTSNK ELVAKYGPEL LKIAKENNIN YFFEASVGGG IPIIRPLQNC LAGNQITEIA GILNGTTNYI LTQMKKYSLS FEDTLKEAQE KGYAERNPSN DIEGHDACRK IAILSSIAYS HYVNYENIYT EGISKITKED MEYAEELGCT IKLIAMSKKI DDKKVFARVS PLMISYKSPF ANVDDVFNAI LVKGDAIGDV MFYGQGAGKL PTASAVVGDI IDIVKHIDKS YVYTWAISGD IEVVDIENTS CRFFVRVKYK DYTKAKDAVS LIFNDCMIVN THRPIGTNEF AFVTHEMKES EFKEKISQLE KISVVEKVLS IIRYDENI
|
| |