Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0428 |
Symbol | |
ID | 7084938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 488936 |
End bp | 490396 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697460 |
Product | succinic semialdehyde dehydrogenase |
Protein accession | YP_002354103 |
Protein GI | 217968869 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGA ACCTGAAAGA TCCCGACCTC TTCCGCACCC GCTGCCATGT CGATGGGCAG TGGATCGACG CCGACGACGG CGCCACCACG AGCATCCGCA ACCCGGCCAC GGGCGAGGTG CTCGGCACCA TCCCGCGCAT GGGCGCGGCC GAGACCCGGC GCGCCATCGC GGCCGCCAAC GCCGCATGGC CGGCGTGGCG TGCGCTGACC GCCGGCGCGC GCGCGAAGAT CCTGCGGCGC TGGTTCGAGC TCATCCTCGC CAACCAGGAA GACCTCGCCG TGCTGATGAC CAGCGAGCAG GGCAAGCCGC TCGCCGAGGC GCGCGGCGAG GTGCTCTACG CCGCCTCCTT CATCGAGTGG TTCGCCGAGG AAGGCAAGCG CATCTACGGC GACGTGATCC CCGGCCACCA GCCCGACAAG CGCATCGTCG TCACCAAGGA GCCGATCGGG GTGTGCGCGG CGATCACGCC GTGGAACTTC CCCGCGGCGA TGATCACGCG CAAGGCCGGT CCGGCGCTCG CGGCCGGATG CACGATGGTG CTCAAGCCCG CCACCCAGAC CCCTTACTCG GCGCTCGCGC TGGCGGTGCT CGCCGAGCGC GCGGGGGTGC CGAAGGGCGT GTTCAGTGTG GTCACCGGCG GCGCGGCCGA GATCGGCGGC GAGCTGACCG CCAACCCGAT CGTCCGCAAG CTCACCTTCA CCGGCTCCAC CGAGATCGGC GTCAAGCTGA TGGCGCAGTG CGCGCCGAGC GTCAAGAAGC TCTCGCTCGA GCTCGGCGGC AATGCGCCCT TCATCGTCTT CGACGATGCC GACCTCGACG CCGCGGTCGA GGGCGCCATC GCTTCCAAAT ACCGCAACAC CGGCCAGACC TGCGTGTGCG CCAACCGCCT GCTGGTGCAG GACGGTGTGT ACGACGCCTT CGCCGCCAGG CTCGCCGCCG CGGTGGCGCG CCTGAAGGTG GGCAACGGCC TCGCCGAGGG CAGCACCCAG GGCCCGCTGA TCGACATGAA CGCGGTGGCC AAGGTCGAGG AGCACATCGC CGACGCGGTG GAGAAGGGCG CGCGCGTGCT CGCCGGCGGC AAGCGCCACG CGCTCGGCGG CAGCTTCTTC GAGCCCACCA TCCTGGTCGA CGTGACCCCG GCGATGAAGG TGGCGCGCGA GGAGACCTTC GGCCCGGTGG CGCCGCTGTT CCGCTTCAAG GACGAGGCCG AGGCGATCCG CATGGCCAAC GACACCGAGT TCGGCCTCGC CGCCTATTTC TACGCCAGCT CGATGAACCG CGTGTGGCGG GTCGGGGAGG CGCTCGAGTA CGGCATCGTC GGCATCAACA CCGGAATCAT CTCGACCGAG GTCGCGCCCT TCGGCGGCAT GAAGTCCTCC GGCCTCGGCC GCGAAGGTTC CAAGTACGGC ATCGAGGACT ACCTCGAGGT CAAGTATCTG TGCATGGGCG GCGTGCAGTG A
|
Protein sequence | MSLNLKDPDL FRTRCHVDGQ WIDADDGATT SIRNPATGEV LGTIPRMGAA ETRRAIAAAN AAWPAWRALT AGARAKILRR WFELILANQE DLAVLMTSEQ GKPLAEARGE VLYAASFIEW FAEEGKRIYG DVIPGHQPDK RIVVTKEPIG VCAAITPWNF PAAMITRKAG PALAAGCTMV LKPATQTPYS ALALAVLAER AGVPKGVFSV VTGGAAEIGG ELTANPIVRK LTFTGSTEIG VKLMAQCAPS VKKLSLELGG NAPFIVFDDA DLDAAVEGAI ASKYRNTGQT CVCANRLLVQ DGVYDAFAAR LAAAVARLKV GNGLAEGSTQ GPLIDMNAVA KVEEHIADAV EKGARVLAGG KRHALGGSFF EPTILVDVTP AMKVAREETF GPVAPLFRFK DEAEAIRMAN DTEFGLAAYF YASSMNRVWR VGEALEYGIV GINTGIISTE VAPFGGMKSS GLGREGSKYG IEDYLEVKYL CMGGVQ
|
| |