Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4746 |
Symbol | |
ID | 8745337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 352177 |
End bp | 355512 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515245 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003406192 |
Protein GI | 284172810 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.879601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACTG ACGACACACT TCCAGGCGTT CCCGATATCG AAGACTCGCA ACCAAATACG CCGGTCTCCG CCACGTTCGA GACCGGCACC GCGAACGACC CCGCCGTCGG TACCGACGGC GACGAGCCGA CGACGGTGAC GATCAACGGC GAACCGGTGA CCGTCCCGCC GGGTTCGACC GTTATCGACG CGATGCAGGC CGTCAGCGAC GAGACGGTCA GCGTCGACCC CGGCGAGGAC GGCGTTGATG ACGACGCCGA TGTCCCTGCA CTCTGCTACT ACGACCGCGA CGGCGACTGC AGCGGCGAGA TCGGTCCGCG GAGCGAGTGC CGGACCTGCA TGGTCGAGAC CGACGAGCAC GGCCTCGTTC CCTCGTGTTC GTTCCCGGCC GAGGAGGGGC TCTCCGTGCA GACGGATACC CCCGATGCCG AAGAGACGCG CAGCGTCAAC CTCGATCTGG TCCTCTCGAA CCACAACCTC CGGTGTACGA CCTGCAACGG CAACGGCCGC TGTGAACTAC AGGACACCGC AATCAGCGAG GGCGTCGACC ATCCGCGCTA TGGCGTCTTT GACGATCGTG ACGAGTACGA ACCGCTCGAC GACAGTTCGT CGTTCATCCA GATCGACCGG AACAAGTGCA TCCTCTGTAA CCGGTGTGTC GAGGGATGTA ACGACGTCCA GGTCGAGGGC GTGCTCCGCA TCGAGGGCCA CGGCGAGGAT ACCCGGATCG GCTTCCAGTC CGACGCCGAG ACGATGGCCG AGTCCGACTG CGTTTCCTGC GGTCACTGCG CGACTGTCTG TCCGACAGGC GCGCTGACAG AGAAGGACAT CGGCGGCGCG GCGACGCTTC CGCTTCCAGG GTTCACCCAG CGCAACTCCA TCGGGACGGT TATCGAGCAT GAAGAAGTCG AAACGCTCGA CGACACGACG GCACCGAATC GCTCGCCCGA CCCCGGCGGA GAGAGTCGGA TCGGAGCCGG CCCCAGCGTG AGTCCAGACG ATCGAAGCGG CGTCGCACGG TTTATGTCAC AGAGCAGGCG CCGTGCGATG GACATCGCCA CCGAGTACGG TCACAAAGCC ATGTTGGCGG GCGAACACAC CGCCGAAAAC ATCGCGACGA AGGTGCTCCC CGAGGGGAGA TTGTTCGACG TCGCGTCGAC CGTGAGCGAC TACCGCCTCG GCAAGATCGA CAAGGAGGAG ACGACGTGCG GGTTCTGTGC TGTTGGCTGC CGATTCGAGA TGTGGGGCAA AGACGGCGAC GCGATCGGTG TCCAACCCGT CGAAGACCCC GCGAAGGCGC CCGCTAACAA CTTCTCGACC TGCGTGAAAG GGAAGTTCGG CCACGAGTTC GCCAACAGCG ACGAGCGGAT CACGGAACCG CTTGTTCGCA ACGAGGACGG CGAGTTCGAG ACGGCGTCCT GGGACGAGGC GCTCGATCGC GTCGCGAGCG GGCTGCGCGC GATCCAGGAC GAACACGGCA TCGACGCCGT CGGCTGTCTC GCGTCCTCGA AGGGGAGCAA CGAAGAGGCG TATCTCGTCC AGAAGTTCGC GCGACAGGTT CTCGGAACGA AAAACATCGA CAACTGTGCG CGGCTCTGTC ACTCGACGAC GGTGGCGGCG CTACAACAGA CGCTCGGGTA CGGCGCTATG ACCAACCGCA TCAACGAGGA CGTCGGCGAG GCCGACGCCT ACCTCATCAC CGGGTCGAAC ACGACGGAGT CGCACCCGGT TTTGGCGACC CGTATCAAGC AAAACGTCCG AGACGGCGCC GACCTGATCG TCTTCGATCC CCGTGAGGTC AACATCGCAG AGCACGCCGA CCAGTACACA CGGACCAGAC CTGGGTATGA CGTCGCCTGG ATCAACGGTC TCATCCGATA TATCATTGAG AACGACCTCC ACGATGAGGC GTTCATCGAA CGTAACACGA AGGGATTCGA GAAAGTCAAG GAGAAGGTAC AGGCGTTCAC ACCCGAGAAC GTCGAAGAGC TGGCTGGCGT TCCGCCTGCA GAGCTGAAGT CCGCCGCCGA GACGCTCGCC GAGGCCGATA CCGTCGTCTT CGGCTGGGCG ATGGGAATGA CCCAGTCCAG CCACGGCACG CAGAACCTCC TCGCGATGGC CGACCTCGCC CTCACGCTCG GCCAGGTCGG CAAGCCCGGG GCCGGGCTCT CACCTTTCCG GGGGCAGAAC AACGTGCAGG GCGGCGGCGG CGACATGGGA ACGCTTCCTG GAAGCCTGCC GGGCTATCAG GATCCAGCGG ACGCCGAGGT CGCCGAAAAG TTCGAAAAAG CGTGGGGCGA GCGCCCGCCC GAGGAGCCGG GGCTCAAGGT GCCGGAGATG CTCTCGGAGG CTCACGAAGG CAACCTGCGT GGAATGTACG TCGTCGGGGA GAATCCCGCG TTGTCCGAAC CCGACATCCA GCACGCCGAA GCGGCACTCG AGAAACTCGA GTTCCTCGTC GTTCAGGACA TCTTCATGAC GGAGACGGCG ACTCACGCGG ACGTGATCTT GCCCGCAGCG ACGTCGCCGG AGAAACACGG CACGTTCACT AACACCGAGC GCCGCATCCA GCGGGTGCGC CCAACTGCGA CACCGCCCGG GACGGCGCGC CAAGACTGGG AGATCACTCA GGACCTGGCC AACCGGCTCG GGTATAGCTG GGACTACGAC CACCCGCGGG AGATCATGGA CGAGATCAGC GACCTGGTCC CGATCTACGG CGGTGTCAGC TACGACCGCC TCGAGTCGGG CGACGAGCAC GGACTCCAGT GGCCCTGCTG GGACGAAGAC CACCCCGGAA CGCCATACCT CTACGATTAC GAGGACGGAG AGTTCAACTT CGACGACGGT ATGGCTCGCT TCGTGCCCGC GGACGGTGGA CACCCCGGCG AGCTGCCCGA CGAAGAGTAT CCGCTCACGC TCACCTCCGG GCGGGTGCTC TACCACTGGC ACACCGGCCA GATCACCCGG CGCGTCGAGG GGCTCATGAG CCACGTCGGC GAGAGCTTCG TCGAGATCAA CCCGTCGACG GCCGACGAAC TCGGCGTCGC CGACGGCGAG TACGTCCAGG TCGAGTCGCG CCGCGGAGAC ATCGTCGTCA AGGCGAACGT AACCGACCGC GTCGGCGAGG GAACGTTGTT CATCCCGATG CACTTCGCTG CCGGCGCGGT CAACAAGCTC ACTCAGGAGA GCTTTGACCC GCACACGGGA ATTCCCGAGT ACAAGGTGTC CAGCGTCCGC GTCGAGCCGC TCGGATCGGA GGCCAATCCG GACGTGTTGC GGGCGCCCGA CGCTGGAGCC GATAGCGACG GCGGAACCGC AGTCAGCGAC GACTGA
|
Protein sequence | MSTDDTLPGV PDIEDSQPNT PVSATFETGT ANDPAVGTDG DEPTTVTING EPVTVPPGST VIDAMQAVSD ETVSVDPGED GVDDDADVPA LCYYDRDGDC SGEIGPRSEC RTCMVETDEH GLVPSCSFPA EEGLSVQTDT PDAEETRSVN LDLVLSNHNL RCTTCNGNGR CELQDTAISE GVDHPRYGVF DDRDEYEPLD DSSSFIQIDR NKCILCNRCV EGCNDVQVEG VLRIEGHGED TRIGFQSDAE TMAESDCVSC GHCATVCPTG ALTEKDIGGA ATLPLPGFTQ RNSIGTVIEH EEVETLDDTT APNRSPDPGG ESRIGAGPSV SPDDRSGVAR FMSQSRRRAM DIATEYGHKA MLAGEHTAEN IATKVLPEGR LFDVASTVSD YRLGKIDKEE TTCGFCAVGC RFEMWGKDGD AIGVQPVEDP AKAPANNFST CVKGKFGHEF ANSDERITEP LVRNEDGEFE TASWDEALDR VASGLRAIQD EHGIDAVGCL ASSKGSNEEA YLVQKFARQV LGTKNIDNCA RLCHSTTVAA LQQTLGYGAM TNRINEDVGE ADAYLITGSN TTESHPVLAT RIKQNVRDGA DLIVFDPREV NIAEHADQYT RTRPGYDVAW INGLIRYIIE NDLHDEAFIE RNTKGFEKVK EKVQAFTPEN VEELAGVPPA ELKSAAETLA EADTVVFGWA MGMTQSSHGT QNLLAMADLA LTLGQVGKPG AGLSPFRGQN NVQGGGGDMG TLPGSLPGYQ DPADAEVAEK FEKAWGERPP EEPGLKVPEM LSEAHEGNLR GMYVVGENPA LSEPDIQHAE AALEKLEFLV VQDIFMTETA THADVILPAA TSPEKHGTFT NTERRIQRVR PTATPPGTAR QDWEITQDLA NRLGYSWDYD HPREIMDEIS DLVPIYGGVS YDRLESGDEH GLQWPCWDED HPGTPYLYDY EDGEFNFDDG MARFVPADGG HPGELPDEEY PLTLTSGRVL YHWHTGQITR RVEGLMSHVG ESFVEINPST ADELGVADGE YVQVESRRGD IVVKANVTDR VGEGTLFIPM HFAAGAVNKL TQESFDPHTG IPEYKVSSVR VEPLGSEANP DVLRAPDAGA DSDGGTAVSD D
|
| |