Gene Htur_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4061 
Symbol 
ID8744689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp314462 
End bp315979 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content52% 
IMG OID646514626 
ProductAldehyde dehydrogenase (NAD(+)) 
Protein accessionYP_003405573 
Protein GI284167295 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATACG ATAAATTCAC GGATACGATC CGCCAGAATC ACGAGCAAGC GATAGAAGAG 
GTGACGCGTG GACGTCAATT CGATGCATGG ATCAAGGGAT CTTCCCGGAC TTCGATCCGA
GGTGAGAGGT TCGAAACGAA AGACCCTGCC ACAGAGAAAA CGATAACGAC GGTACCTCGC
TGTCAGGAAG AAGACGTTGA TAAAGCCGTC GAGGCAGCGA AGAAAGCCTT CAATCAAACT
TGGGGTTCAC TACCGGTAGA CGAACGATCT GAGGCCCTTC TCGATTGGGT CGAGACCCTG
CGTGCGAACA GCGAGGAATT AGCGTTACTC GAGAGTCTGG ATACGGGTAA GCCCCTTGCA
GACGCAGAAT ACGAAGTGAG TGAGGCGATT AATTACATCG AATATTACGC GCAGATAGCG
AGGGGAGAAC AAGGAGACCA GCTCCCGATC GGAGACGATA CGCACGCGTT TACGAAGCAC
GAACCATATG GTGTGGCTGG TCTTATCGTT CCATGGAACT ATCCCCTCTT ACTGGCATCG
TGGAAGCTCG GTCCCGCGCT TTCCGCGGGG AATACCGTTG TTCTAAAGCC TGCGGAAGAT
TCGCCGCTTT CGGTGACGCG AATCGCTCAG CTCTCTGAAG ATGTCCTTCC GCCGGGAACG
CTCAATATCG TCCACGGCTT TGGCGATGAA GCCGGTGCTC CCCTTACACG GCACGAGGAC
ATCTCGAAGC TCTCGTTCAC GGGTGAGGGG CAAACGGGCG AAACAATAAT GAAGGCCGCT
GCTGAGCAGA TAACGCCTGT CACACTCGAA TTGGGTGGCA AATCACCCTT TATCGTTTTT
CCAGACGCAG ACTTAGAGAA CGCGGCGGAG ATCGCTGCGG AAGGAATCTT CTACAATACC
GGCCAATCGT GCGATGCCTT CTCCAGGACA TTAGTACATG AGGATATTCA TGCTGAATTC
CTGGAACTGT TCCGAGCTGA GGCAGAAGAG AGGATTGTCG GTGACCCACT TGCGAAAGAG
ACCACGACCG GTCCCCTCGC GTCGAAGCAG CAATTCGAAA AGGTACGGGA GTATATCGAG
ATAGGTAAAA AAGAAGGAGC TGCTTTAGTG CACGGGGGAG AATCCATCTC CATTACAGAT
AGCGATGACG GTTGGTTCGT AGAACCGACT ATCTTTGACG GAGTTGAAAA TGATATGAGA
ATTGCTCAAG AAGAGATATT CGGCCCGGTG GCTTCGGTGA TTCAGTTCGC GGATTATAAC
GAAGCCGTTT CTATTGCAAA TGACATAGAC TTCGGACTAG CAGCAGGGGT TGCGACGACT
GACCTCTCGA TCGCTCATCG GGCTTCGGAT GATATTCAAG CAGGTACAGT CTGGGTTAAT
CAGTATGCGG ATCTGGTCCC AGGTACACCG TTCGGAGGGT TCAAGCGATC TGGGATAGGA
CGCGAATGCG CAAAAGATAC CCTCCGAGAA TATCAACAGA CGAAGACGGT CAATATAAGT
CTGGGACAGA TGGACTGA
 
Protein sequence
MSYDKFTDTI RQNHEQAIEE VTRGRQFDAW IKGSSRTSIR GERFETKDPA TEKTITTVPR 
CQEEDVDKAV EAAKKAFNQT WGSLPVDERS EALLDWVETL RANSEELALL ESLDTGKPLA
DAEYEVSEAI NYIEYYAQIA RGEQGDQLPI GDDTHAFTKH EPYGVAGLIV PWNYPLLLAS
WKLGPALSAG NTVVLKPAED SPLSVTRIAQ LSEDVLPPGT LNIVHGFGDE AGAPLTRHED
ISKLSFTGEG QTGETIMKAA AEQITPVTLE LGGKSPFIVF PDADLENAAE IAAEGIFYNT
GQSCDAFSRT LVHEDIHAEF LELFRAEAEE RIVGDPLAKE TTTGPLASKQ QFEKVREYIE
IGKKEGAALV HGGESISITD SDDGWFVEPT IFDGVENDMR IAQEEIFGPV ASVIQFADYN
EAVSIANDID FGLAAGVATT DLSIAHRASD DIQAGTVWVN QYADLVPGTP FGGFKRSGIG
RECAKDTLRE YQQTKTVNIS LGQMD