Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1774 |
Symbol | |
ID | 5104774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1716239 |
End bp | 1717681 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507672 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001191853 |
Protein GI | 146304537 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00658035 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCCAA TTATTCTAGG CGGAGAGAAG GTAGTAACTC AGGCAAAGAT AACCGTGATG GATCCGGGGA AAGGAAAACC CCTGAACGAG GTTTCCGTTG CTGGAAGGGA AGAAACGAGA CACGCCATAG AGCTAGCTGA CCAGGCCTTC GATCACTTCT CTAGGCTCGC ATTGAAGGAG AGAACAAAGA TTCTTGAGAG GGCGGCTGAA ATCATGGAGA AGAGGAGCGA GGAACTGGCT AGAACCCTGA CAGCCGAGTC CGGGAAGCCC ATCAGGGATG CTAGGGTTGA GGTAACTAGG GCAATCCATC TCTTCAGGTC AGCAGCGCAG GAAGTTAGAC TAGTTCTTGA GGGAGCTACG TTCAGGGTAG ACGGATATGA GTATCCTCCC GGTAACGAGA GGAGAATGGT GATGTCTGTG AGAGAGCCAC TGGGAGTGGT TGGGGCAATC CTCCCCTTCA ATTTCCCGGC AAATAGTTTC GCACACAAGG TTGCACCGAA TTTAGCGGTC GGAAACACCG TGGTAGTAAA GCCCTCAACC TCAACCCCCA TTACGGGAGT CCTACTGGGG GAGATCCTGT ATGAGGCAGG ACTTCCCAAG GGTGTGTTAA GCGTTCTGCC AGGGGGTGGA GATACCGTGG GGGCAGAGAT AGTGGAGAAC AGGAGGGTGA AAGGAATCAC GTTTACCGGC TCAACTCCCG TGGGGACGTC CATTGCCTCA AAGGCCGTGA CAACGGGAAA GAGGGTTATG ATGGAGATGG GCGGATCTGA TCCAATCGTA GTGTTCAAGG ATGCCGATCT CGAGAGGGCG GTCTCAATAT CCGTGAGGGC CAGGTTTGAG TATGCAGGGC AGAATTGTAA TGCCGGGAAG AGGATACTGG TTCAGGAGGA GATTTACGAG AAATTCCTCA AGGAGTACGT GAATAGGGTG AAATCCCTGC GGGTTGGTGA TCCCATGGAC GAGAGCACTG AGATGGGACC TGTTATCTCG GCGAGCGTGG TTAAGGACCT GGAGAACGCG GTGTTAGACT CCGAGACGAA GGGAGGGGTG ATTCACAGGG GTATGGCGGG CACTGGCGGT TATTATTTTG CTCCCACAGT GGTCGAGAAC GCTAACTTGG ACATGGTCAT CATGAAGAGG GAGATCTTTG GTCCCGTTGC ACCAATAGCT AAGTTCTCTG ATTGGAAGGA GGCCGTGGAA TTGGCAAACT CGACCGAATA TGGCCTGCAG GCCTCTATCT TCACCTCAGA TTTGAAATTG GCACTAAAGA TGAGCCGGGA AATCAGGGCC GGAGCAGTGA TAGTGAATGA CAGTACCAGG TTGAGGTGGG ACTCCCTTCC CTTCGGCGGG GTAGGTCTCA GCGGGATAGG GAGCAGGGAG GGGGTGAGAA GTACCATGTT AGCCATGACA GAGGAGAAGC TGATCTCCTT GGACCTCTCC TAG
|
Protein sequence | MIPIILGGEK VVTQAKITVM DPGKGKPLNE VSVAGREETR HAIELADQAF DHFSRLALKE RTKILERAAE IMEKRSEELA RTLTAESGKP IRDARVEVTR AIHLFRSAAQ EVRLVLEGAT FRVDGYEYPP GNERRMVMSV REPLGVVGAI LPFNFPANSF AHKVAPNLAV GNTVVVKPST STPITGVLLG EILYEAGLPK GVLSVLPGGG DTVGAEIVEN RRVKGITFTG STPVGTSIAS KAVTTGKRVM MEMGGSDPIV VFKDADLERA VSISVRARFE YAGQNCNAGK RILVQEEIYE KFLKEYVNRV KSLRVGDPMD ESTEMGPVIS ASVVKDLENA VLDSETKGGV IHRGMAGTGG YYFAPTVVEN ANLDMVIMKR EIFGPVAPIA KFSDWKEAVE LANSTEYGLQ ASIFTSDLKL ALKMSREIRA GAVIVNDSTR LRWDSLPFGG VGLSGIGSRE GVRSTMLAMT EEKLISLDLS
|
| |