Gene Htur_4746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4746 
Symbol 
ID8745337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp352177 
End bp355512 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content65% 
IMG OID646515245 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003406192 
Protein GI284172810 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.879601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTG ACGACACACT TCCAGGCGTT CCCGATATCG AAGACTCGCA ACCAAATACG 
CCGGTCTCCG CCACGTTCGA GACCGGCACC GCGAACGACC CCGCCGTCGG TACCGACGGC
GACGAGCCGA CGACGGTGAC GATCAACGGC GAACCGGTGA CCGTCCCGCC GGGTTCGACC
GTTATCGACG CGATGCAGGC CGTCAGCGAC GAGACGGTCA GCGTCGACCC CGGCGAGGAC
GGCGTTGATG ACGACGCCGA TGTCCCTGCA CTCTGCTACT ACGACCGCGA CGGCGACTGC
AGCGGCGAGA TCGGTCCGCG GAGCGAGTGC CGGACCTGCA TGGTCGAGAC CGACGAGCAC
GGCCTCGTTC CCTCGTGTTC GTTCCCGGCC GAGGAGGGGC TCTCCGTGCA GACGGATACC
CCCGATGCCG AAGAGACGCG CAGCGTCAAC CTCGATCTGG TCCTCTCGAA CCACAACCTC
CGGTGTACGA CCTGCAACGG CAACGGCCGC TGTGAACTAC AGGACACCGC AATCAGCGAG
GGCGTCGACC ATCCGCGCTA TGGCGTCTTT GACGATCGTG ACGAGTACGA ACCGCTCGAC
GACAGTTCGT CGTTCATCCA GATCGACCGG AACAAGTGCA TCCTCTGTAA CCGGTGTGTC
GAGGGATGTA ACGACGTCCA GGTCGAGGGC GTGCTCCGCA TCGAGGGCCA CGGCGAGGAT
ACCCGGATCG GCTTCCAGTC CGACGCCGAG ACGATGGCCG AGTCCGACTG CGTTTCCTGC
GGTCACTGCG CGACTGTCTG TCCGACAGGC GCGCTGACAG AGAAGGACAT CGGCGGCGCG
GCGACGCTTC CGCTTCCAGG GTTCACCCAG CGCAACTCCA TCGGGACGGT TATCGAGCAT
GAAGAAGTCG AAACGCTCGA CGACACGACG GCACCGAATC GCTCGCCCGA CCCCGGCGGA
GAGAGTCGGA TCGGAGCCGG CCCCAGCGTG AGTCCAGACG ATCGAAGCGG CGTCGCACGG
TTTATGTCAC AGAGCAGGCG CCGTGCGATG GACATCGCCA CCGAGTACGG TCACAAAGCC
ATGTTGGCGG GCGAACACAC CGCCGAAAAC ATCGCGACGA AGGTGCTCCC CGAGGGGAGA
TTGTTCGACG TCGCGTCGAC CGTGAGCGAC TACCGCCTCG GCAAGATCGA CAAGGAGGAG
ACGACGTGCG GGTTCTGTGC TGTTGGCTGC CGATTCGAGA TGTGGGGCAA AGACGGCGAC
GCGATCGGTG TCCAACCCGT CGAAGACCCC GCGAAGGCGC CCGCTAACAA CTTCTCGACC
TGCGTGAAAG GGAAGTTCGG CCACGAGTTC GCCAACAGCG ACGAGCGGAT CACGGAACCG
CTTGTTCGCA ACGAGGACGG CGAGTTCGAG ACGGCGTCCT GGGACGAGGC GCTCGATCGC
GTCGCGAGCG GGCTGCGCGC GATCCAGGAC GAACACGGCA TCGACGCCGT CGGCTGTCTC
GCGTCCTCGA AGGGGAGCAA CGAAGAGGCG TATCTCGTCC AGAAGTTCGC GCGACAGGTT
CTCGGAACGA AAAACATCGA CAACTGTGCG CGGCTCTGTC ACTCGACGAC GGTGGCGGCG
CTACAACAGA CGCTCGGGTA CGGCGCTATG ACCAACCGCA TCAACGAGGA CGTCGGCGAG
GCCGACGCCT ACCTCATCAC CGGGTCGAAC ACGACGGAGT CGCACCCGGT TTTGGCGACC
CGTATCAAGC AAAACGTCCG AGACGGCGCC GACCTGATCG TCTTCGATCC CCGTGAGGTC
AACATCGCAG AGCACGCCGA CCAGTACACA CGGACCAGAC CTGGGTATGA CGTCGCCTGG
ATCAACGGTC TCATCCGATA TATCATTGAG AACGACCTCC ACGATGAGGC GTTCATCGAA
CGTAACACGA AGGGATTCGA GAAAGTCAAG GAGAAGGTAC AGGCGTTCAC ACCCGAGAAC
GTCGAAGAGC TGGCTGGCGT TCCGCCTGCA GAGCTGAAGT CCGCCGCCGA GACGCTCGCC
GAGGCCGATA CCGTCGTCTT CGGCTGGGCG ATGGGAATGA CCCAGTCCAG CCACGGCACG
CAGAACCTCC TCGCGATGGC CGACCTCGCC CTCACGCTCG GCCAGGTCGG CAAGCCCGGG
GCCGGGCTCT CACCTTTCCG GGGGCAGAAC AACGTGCAGG GCGGCGGCGG CGACATGGGA
ACGCTTCCTG GAAGCCTGCC GGGCTATCAG GATCCAGCGG ACGCCGAGGT CGCCGAAAAG
TTCGAAAAAG CGTGGGGCGA GCGCCCGCCC GAGGAGCCGG GGCTCAAGGT GCCGGAGATG
CTCTCGGAGG CTCACGAAGG CAACCTGCGT GGAATGTACG TCGTCGGGGA GAATCCCGCG
TTGTCCGAAC CCGACATCCA GCACGCCGAA GCGGCACTCG AGAAACTCGA GTTCCTCGTC
GTTCAGGACA TCTTCATGAC GGAGACGGCG ACTCACGCGG ACGTGATCTT GCCCGCAGCG
ACGTCGCCGG AGAAACACGG CACGTTCACT AACACCGAGC GCCGCATCCA GCGGGTGCGC
CCAACTGCGA CACCGCCCGG GACGGCGCGC CAAGACTGGG AGATCACTCA GGACCTGGCC
AACCGGCTCG GGTATAGCTG GGACTACGAC CACCCGCGGG AGATCATGGA CGAGATCAGC
GACCTGGTCC CGATCTACGG CGGTGTCAGC TACGACCGCC TCGAGTCGGG CGACGAGCAC
GGACTCCAGT GGCCCTGCTG GGACGAAGAC CACCCCGGAA CGCCATACCT CTACGATTAC
GAGGACGGAG AGTTCAACTT CGACGACGGT ATGGCTCGCT TCGTGCCCGC GGACGGTGGA
CACCCCGGCG AGCTGCCCGA CGAAGAGTAT CCGCTCACGC TCACCTCCGG GCGGGTGCTC
TACCACTGGC ACACCGGCCA GATCACCCGG CGCGTCGAGG GGCTCATGAG CCACGTCGGC
GAGAGCTTCG TCGAGATCAA CCCGTCGACG GCCGACGAAC TCGGCGTCGC CGACGGCGAG
TACGTCCAGG TCGAGTCGCG CCGCGGAGAC ATCGTCGTCA AGGCGAACGT AACCGACCGC
GTCGGCGAGG GAACGTTGTT CATCCCGATG CACTTCGCTG CCGGCGCGGT CAACAAGCTC
ACTCAGGAGA GCTTTGACCC GCACACGGGA ATTCCCGAGT ACAAGGTGTC CAGCGTCCGC
GTCGAGCCGC TCGGATCGGA GGCCAATCCG GACGTGTTGC GGGCGCCCGA CGCTGGAGCC
GATAGCGACG GCGGAACCGC AGTCAGCGAC GACTGA
 
Protein sequence
MSTDDTLPGV PDIEDSQPNT PVSATFETGT ANDPAVGTDG DEPTTVTING EPVTVPPGST 
VIDAMQAVSD ETVSVDPGED GVDDDADVPA LCYYDRDGDC SGEIGPRSEC RTCMVETDEH
GLVPSCSFPA EEGLSVQTDT PDAEETRSVN LDLVLSNHNL RCTTCNGNGR CELQDTAISE
GVDHPRYGVF DDRDEYEPLD DSSSFIQIDR NKCILCNRCV EGCNDVQVEG VLRIEGHGED
TRIGFQSDAE TMAESDCVSC GHCATVCPTG ALTEKDIGGA ATLPLPGFTQ RNSIGTVIEH
EEVETLDDTT APNRSPDPGG ESRIGAGPSV SPDDRSGVAR FMSQSRRRAM DIATEYGHKA
MLAGEHTAEN IATKVLPEGR LFDVASTVSD YRLGKIDKEE TTCGFCAVGC RFEMWGKDGD
AIGVQPVEDP AKAPANNFST CVKGKFGHEF ANSDERITEP LVRNEDGEFE TASWDEALDR
VASGLRAIQD EHGIDAVGCL ASSKGSNEEA YLVQKFARQV LGTKNIDNCA RLCHSTTVAA
LQQTLGYGAM TNRINEDVGE ADAYLITGSN TTESHPVLAT RIKQNVRDGA DLIVFDPREV
NIAEHADQYT RTRPGYDVAW INGLIRYIIE NDLHDEAFIE RNTKGFEKVK EKVQAFTPEN
VEELAGVPPA ELKSAAETLA EADTVVFGWA MGMTQSSHGT QNLLAMADLA LTLGQVGKPG
AGLSPFRGQN NVQGGGGDMG TLPGSLPGYQ DPADAEVAEK FEKAWGERPP EEPGLKVPEM
LSEAHEGNLR GMYVVGENPA LSEPDIQHAE AALEKLEFLV VQDIFMTETA THADVILPAA
TSPEKHGTFT NTERRIQRVR PTATPPGTAR QDWEITQDLA NRLGYSWDYD HPREIMDEIS
DLVPIYGGVS YDRLESGDEH GLQWPCWDED HPGTPYLYDY EDGEFNFDDG MARFVPADGG
HPGELPDEEY PLTLTSGRVL YHWHTGQITR RVEGLMSHVG ESFVEINPST ADELGVADGE
YVQVESRRGD IVVKANVTDR VGEGTLFIPM HFAAGAVNKL TQESFDPHTG IPEYKVSSVR
VEPLGSEANP DVLRAPDAGA DSDGGTAVSD D