Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4061 |
Symbol | |
ID | 8744689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 314462 |
End bp | 315979 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 646514626 |
Product | Aldehyde dehydrogenase (NAD(+)) |
Protein accession | YP_003405573 |
Protein GI | 284167295 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATACG ATAAATTCAC GGATACGATC CGCCAGAATC ACGAGCAAGC GATAGAAGAG GTGACGCGTG GACGTCAATT CGATGCATGG ATCAAGGGAT CTTCCCGGAC TTCGATCCGA GGTGAGAGGT TCGAAACGAA AGACCCTGCC ACAGAGAAAA CGATAACGAC GGTACCTCGC TGTCAGGAAG AAGACGTTGA TAAAGCCGTC GAGGCAGCGA AGAAAGCCTT CAATCAAACT TGGGGTTCAC TACCGGTAGA CGAACGATCT GAGGCCCTTC TCGATTGGGT CGAGACCCTG CGTGCGAACA GCGAGGAATT AGCGTTACTC GAGAGTCTGG ATACGGGTAA GCCCCTTGCA GACGCAGAAT ACGAAGTGAG TGAGGCGATT AATTACATCG AATATTACGC GCAGATAGCG AGGGGAGAAC AAGGAGACCA GCTCCCGATC GGAGACGATA CGCACGCGTT TACGAAGCAC GAACCATATG GTGTGGCTGG TCTTATCGTT CCATGGAACT ATCCCCTCTT ACTGGCATCG TGGAAGCTCG GTCCCGCGCT TTCCGCGGGG AATACCGTTG TTCTAAAGCC TGCGGAAGAT TCGCCGCTTT CGGTGACGCG AATCGCTCAG CTCTCTGAAG ATGTCCTTCC GCCGGGAACG CTCAATATCG TCCACGGCTT TGGCGATGAA GCCGGTGCTC CCCTTACACG GCACGAGGAC ATCTCGAAGC TCTCGTTCAC GGGTGAGGGG CAAACGGGCG AAACAATAAT GAAGGCCGCT GCTGAGCAGA TAACGCCTGT CACACTCGAA TTGGGTGGCA AATCACCCTT TATCGTTTTT CCAGACGCAG ACTTAGAGAA CGCGGCGGAG ATCGCTGCGG AAGGAATCTT CTACAATACC GGCCAATCGT GCGATGCCTT CTCCAGGACA TTAGTACATG AGGATATTCA TGCTGAATTC CTGGAACTGT TCCGAGCTGA GGCAGAAGAG AGGATTGTCG GTGACCCACT TGCGAAAGAG ACCACGACCG GTCCCCTCGC GTCGAAGCAG CAATTCGAAA AGGTACGGGA GTATATCGAG ATAGGTAAAA AAGAAGGAGC TGCTTTAGTG CACGGGGGAG AATCCATCTC CATTACAGAT AGCGATGACG GTTGGTTCGT AGAACCGACT ATCTTTGACG GAGTTGAAAA TGATATGAGA ATTGCTCAAG AAGAGATATT CGGCCCGGTG GCTTCGGTGA TTCAGTTCGC GGATTATAAC GAAGCCGTTT CTATTGCAAA TGACATAGAC TTCGGACTAG CAGCAGGGGT TGCGACGACT GACCTCTCGA TCGCTCATCG GGCTTCGGAT GATATTCAAG CAGGTACAGT CTGGGTTAAT CAGTATGCGG ATCTGGTCCC AGGTACACCG TTCGGAGGGT TCAAGCGATC TGGGATAGGA CGCGAATGCG CAAAAGATAC CCTCCGAGAA TATCAACAGA CGAAGACGGT CAATATAAGT CTGGGACAGA TGGACTGA
|
Protein sequence | MSYDKFTDTI RQNHEQAIEE VTRGRQFDAW IKGSSRTSIR GERFETKDPA TEKTITTVPR CQEEDVDKAV EAAKKAFNQT WGSLPVDERS EALLDWVETL RANSEELALL ESLDTGKPLA DAEYEVSEAI NYIEYYAQIA RGEQGDQLPI GDDTHAFTKH EPYGVAGLIV PWNYPLLLAS WKLGPALSAG NTVVLKPAED SPLSVTRIAQ LSEDVLPPGT LNIVHGFGDE AGAPLTRHED ISKLSFTGEG QTGETIMKAA AEQITPVTLE LGGKSPFIVF PDADLENAAE IAAEGIFYNT GQSCDAFSRT LVHEDIHAEF LELFRAEAEE RIVGDPLAKE TTTGPLASKQ QFEKVREYIE IGKKEGAALV HGGESISITD SDDGWFVEPT IFDGVENDMR IAQEEIFGPV ASVIQFADYN EAVSIANDID FGLAAGVATT DLSIAHRASD DIQAGTVWVN QYADLVPGTP FGGFKRSGIG RECAKDTLRE YQQTKTVNIS LGQMD
|
| |