Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2872 |
Symbol | |
ID | 7873774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3108790 |
End bp | 3110310 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699793 |
Product | Aldehyde dehydrogenase (NAD(+)) |
Protein accession | YP_002889848 |
Protein GI | 237653534 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTACG CACTGCCCGG CCAGGCCGAA GCCAAGGTTC AGTTCAAGAC CCGCTACGAC AACTTCATTG GTGGCAAGTG GGTGGCCCCG GTCAAGGGCC AGTATTTCGA TGTGGTGACC CCGATCACCG GCCAGAAGTA CACCCAGGCC GCGCGTTCCA CCGCCGAGGA CATCGAGCTC GCCCTCGACG CCGCGCACGC AGCCTTCCCC AAGTGGGGCA AGGCCGACGC CACCACCCGT TCGAACATCC TGCTCAAGAT CGCCGACCGC CTCGAGGCCA ACCTCGAGCT GCTCGCCTAC GCCGAGACCG TCGATAACGG CAAGCCGATC CGCGAGACGC TCAACGCCGA CATCCCGCTC GCCGTCGACC ACTTCCGCTA CTTCGCCGGC TGCCTGCGCT CGCAGGAAGG CGGCATCTCG GAGATCGACG AGAACACCAT GGCCTATCAC ATCCACGAGC CCCTGGGCGT GGTCGGCCAG ATCATCCCGT GGAACTTCCC CATCCTGATG GCGGCGTGGA AGCTGGCGCC GGCGCTGGGT GCGGGCAACT GCGTGGTGCT CAAGCCCGCC GAGTCGACCC CGATCTCGAT CCTGGTGCTG ATGGAGCTGA TCGCCGACCT GCTGCCGCCG GGCGTGCTCA ACATCGTCAA CGGCTACGGC CGCGAGGCCG GCATGCCGCT CGCCACCAGC AAGCGCATCG CCAAGATCGC CTTCACCGGC TCCACCTCCA CCGGCCGCGT GATCGCCCAG GCCGCCGCCA ACAACCTGAT CCCGGCCACG CTCGAACTGG GCGGCAAGTC GCCCAACATC TTCTTCGCCG ACGTCATGGA CAAGGACGAC GCCTTCCTCG ACAAGGCGAT CGAGGGCCTG GTGCTGTTCG CCTTCAACCA GGGCGAGGTG TGCACCTGCC CGAGCCGCGC GCTGATCCAG GAATCGATCT ACGACCGCTT CATGGAGCGT GCTCTGAAGC GCGTCGCCGC GATCAAGCAG GGCAGCCCGC TCGACACCGA CACCATGATG GGCGCGCAGG CCTCCGCCGA GCAGATGAGC AAGATCCAGT CGTATCTGCA GCTCGGCAAG GAAGAGGGCG CCCAGGTGCT GATCGGCGGT GCGCGCGCCC AACTCGGCGG CGACCTGGCC GACGGCTTCT ATATCCAGCC GACGCTGTTC AAGGGCCACA ACAAGATGCG CATCTTCCAG GAGGAGATCT TCGGGCCGGT GCTCGCGGTG ACCACCTTCA AGGACGAGGC CGAGGCCCTG GCGATCGCAA ATGACACCCT GTATGGTCTG GGCGCCGGCG TGTGGAGCCG CAACGGCAAC GTCGCCTACC GCATGGGCCG CGCCATCCAG GCCGGGCGCG TGTGGACCAA CTGCTACCAC GCCTACCCGG CGCACGCGGC CTTCGGCGGC TACAAGGAGT CGGGCATCGG CCGCGAGACG CACAAGGTCA TGCTCGACCA CTACCAGCAG ACCAAGAACC TGCTGGTGAG CTACAGCGAG AACAAGCTCG GCTTCTTCTG A
|
Protein sequence | MLYALPGQAE AKVQFKTRYD NFIGGKWVAP VKGQYFDVVT PITGQKYTQA ARSTAEDIEL ALDAAHAAFP KWGKADATTR SNILLKIADR LEANLELLAY AETVDNGKPI RETLNADIPL AVDHFRYFAG CLRSQEGGIS EIDENTMAYH IHEPLGVVGQ IIPWNFPILM AAWKLAPALG AGNCVVLKPA ESTPISILVL MELIADLLPP GVLNIVNGYG REAGMPLATS KRIAKIAFTG STSTGRVIAQ AAANNLIPAT LELGGKSPNI FFADVMDKDD AFLDKAIEGL VLFAFNQGEV CTCPSRALIQ ESIYDRFMER ALKRVAAIKQ GSPLDTDTMM GAQASAEQMS KIQSYLQLGK EEGAQVLIGG ARAQLGGDLA DGFYIQPTLF KGHNKMRIFQ EEIFGPVLAV TTFKDEAEAL AIANDTLYGL GAGVWSRNGN VAYRMGRAIQ AGRVWTNCYH AYPAHAAFGG YKESGIGRET HKVMLDHYQQ TKNLLVSYSE NKLGFF
|
| |