Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0698 |
Symbol | |
ID | 7407122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 782881 |
End bp | 783660 |
Gene Length | 780 bp |
Protein Length | 259 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643715070 |
Product | sorbitol-6-phosphate dehydrogenase |
Protein accession | YP_002572586 |
Protein GI | 222528704 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00117968 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTGCA AACGATTAGA GGGACAGGTT GCGATTGTTA CAGGGGCTGC CCAGGGGCTT GGCGAAGCTT TAGCAAGAAG ACTTGACAAA GAAGGATGCA AGGTTGTTGT TGCAGATATA AACTTCGAAG GTGCTCAAAA AGTTGCAAGT GAGCTATCTG AAGCTATTGC TGTAAAGTGT GACGTTACAA ACGAAGAAGA TGTCGAAACA ATGGTCGACA AGACAATTGA AACTTTTGGC CAGCTTGATT TGATGGTTGC AAATGCCGGA ATACTGATTG CAAAGCCTAT TACAGAATTT TCACTTGCTG AGTGGAAAAA GGTAATCGAT GTAAACCTCA TTGGATATTT CTTGTGTGCA AGAGCCGCAG CAAGAGTGAT GATTCCACGC CGAAAAGGAA ATATAATCCA GATAAATAGT AAGTCTGGAA AGAAAGGATC ATACAAAAAC TCCGCATACT CTGCATCAAA GTTTGGTGGC ATTGGTCTTA CCCAGAGCTT GGCACTTGAG CTTGCAGAGT ACGGGATTAG AGTAAATGCT ATATGCCCGG GAAATTTGCT TGACTCACCT TTGTGGGTAA ACAGCCTTTA TGAGCAGTAC TCCAAAAATC AAGGACTTAC ACCAGAACAG ATAAGAGAAA AATATTTGAG CCAAGTACCA CTCAGGCGTG CATGCACATA TGACGATGTT GCAAATGTTC TCGTATTTTT AGCATCTGAT GAGGCAAGCT ATATGACAGG GCAAGCTATA AATGTAACAG GTGGTCAGGA AATGAGATAA
|
Protein sequence | MICKRLEGQV AIVTGAAQGL GEALARRLDK EGCKVVVADI NFEGAQKVAS ELSEAIAVKC DVTNEEDVET MVDKTIETFG QLDLMVANAG ILIAKPITEF SLAEWKKVID VNLIGYFLCA RAAARVMIPR RKGNIIQINS KSGKKGSYKN SAYSASKFGG IGLTQSLALE LAEYGIRVNA ICPGNLLDSP LWVNSLYEQY SKNQGLTPEQ IREKYLSQVP LRRACTYDDV ANVLVFLASD EASYMTGQAI NVTGGQEMR
|
| |