Gene Htur_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1038 
Symbol 
ID8741625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1073726 
End bp1077118 
Gene Length3393 bp 
Protein Length1130 aa 
Translation table11 
GC content67% 
IMG OID646511616 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003402603 
Protein GI284164324 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCG AGGACCCGGC GAGCCACGTC GACGACGTGG AGACCGAGCG GCGGCCGACC 
GCCGTCGAGA AACTCCCCTC AGTACCGAAC GTCGCCGATC CGCGGCCGAG CACCCCGCTG
ACCGAACAGT TCGAGACCGG AACCGCCAAC GATCCGGACG TCCGCTCCGG AGACGGCAAG
GACGGAATGA CCCACCTCAC GGTCGACGGG ACGCCCGTCT CGGTGCCGCC GGGCTCGACG
ATCATCGACG CGATCGAGTC CGTCGAACCG GCCGACGAGT TGGCCGCGCT CTGTTACTAC
GACCGCGACA CCGAACAGGC CGACGAGATC GGTCCGCGCG GCGAGTGTCG GACCTGCACC
GTCCACACCG AGGAGCACGG GCTCGTGACG TCCTGCTCGC ACCCCGTCGA GGAGGGGATG
ACGGTCCGGA CCGACGAGGA CGACGCGGCC GAGGCCCGCG AGGTCAACCT CGACCTCCAG
CTGTCGGACC ACAACCTCCG CTGTACGACC TGCGGCCAGA ACGGCCGCTG CGAACTGCAG
GACACCTCCA TCGAGCAAGG CGTCGAGGAG CCCCGTTGGG GCGTCTTCGA GGACCGCGAT
CAGTACGAGC CGCTCGACGA CACCTCGCCG GCCATTCAGA TCGACCGCAA CAAGTGCATC
CTCTGTAACC GCTGCGTCGA GGCCTGCAAC GACGTGCAGG TCGAGGGCGT CCTCCGGATG
GAGGGCAACG GCCAGGACAC CCGCATCGCC TTCCAAAACG GCGAGGACAC CTTCGACGAG
TCCACCTGCG TCTCCTGTGG CCACTGCGCG ACCGTCTGTC CGACCGGCGC CCTGGTCGAG
CAGGGACTGA CCGACGCTGC GACGATTCCC CTGCCCGGCT TCACCCAGGA GAACTCGATC
GGGAAAGTCC TCGAGAGTCC GAAGGCCGAA ACGGCCGATC AGACCGAGGC GCCGAACCGG
GACCTCCCCT ACGACGTGGG CGGGCGAGGG AAGCCCGAGG AGGACCTGTC CGGCGTCGCC
CGCTTCATGT CGATCGCGAG AGCGCGCGCC GGGGACTCGA AGCGGCAGAT GAGCCATACC
CTGAAAGAGG TCGGCGACCG CGCGCTCGAG GAGTTCGAAC ACTTCTCCGA GGGGATCGCC
AGCGAGGCGA TGCCGGCCGG TCAGCTGTTC AACGTCGCGA CGACGATCGG CGACGCGCGC
CTCTCGCGGA TCGAGAAGGC CGAGACCACC TGTAACTACT GCGCGGTCGG TTGTCGGTTC
GAGCTCTACG GCAAGGACGG CGAGGTGCTC GGCGTCCGAC CCGCCGAGCC CGACTCGGCG
CCGGCGAACG ACTTCTCGAC CTGCGTGAAG GGGAAATTCG GCTACGATTA CGTCGATGCC
GACGACCGAC TCGAGAAACC GCTGATCCGG AAGGAGGACG CGCCGGACGG GCCGGTCGGC
CGTGAGGGCT TCCGCGAGGC CACGTGGAAG GAAGCGCTCG AGCGCGTCTA CGAGGGGCTC
TCGGAGGTCC GCGAGGAACA CGGCAGCGAG AGCCTCTCGG TCATCTCCTC GTCGAAGACG
ACCAACGAGG AGAACTTCCT CTGCCAGAAG TTCGCTCGGC AGGTGCTGGG GACGCCCCAC
GTCGACAACT GCGCGCGGCT CTGTCACTCC TCGACCGTGG CCGCGCTGCA GCAGACGGTC
GGTTACGGCG CGATGACCAA CCGGATCAAC GAGGACATCG CGGAGACCGA CTGCTATCTC
ATCACCGGTT CGAACACGAC CGAGTCCCAC CCCGTCCTCG CGACGCGGAT CAAGCAGAAC
GTCCGGGACG GCGCCGACCT CATCGTCATC GACCCCCGTG AGATGGGACT GGCCGAGCAC
GCCGACCAGT ACATCCGGAC GACGCCCGGC GAGGACGTGG CCTGGATGAA CGGGATGATG
CGGTACATCA TCGAGAACGA CCTCCACGAC GAGGAGTTCA TCGAGGAGCG GACGAAGCAC
TTCGAGAAGT TGAAAGAGAA GGTCGAGCCG TTCACGCCCG AGAAGGTCGA GGAACTGACG
GCCGTCCCCG CCGAGGAACT GAAGCAGGCC GCGGAGACGA TCGCCACCGC GGACACCTGC
ATCTTCGGCT GGGCGATGGG GCTGACCCAG CACAACACCG GCACGCGGAA CGTGATGTCG
ATCGCCAACC TCGCGCTGCT GACGGGCAAC CTCGGCAAGC CCGGGGCCGG CCTCTCGGCG
TTCCGCGGAC AGAACAACGT CCAGGGCGGG GGCGGCGACA TGGGACCGGC CCCGCACACG
CTCCCGGGAT ACCAGGACCT CGCCGACGAG GAGGTGCTGG ACAAGTTCGC CGACGCGTGG
GGAGAGCGCC CGCCCAACGA GATCGGGCTT CGGCTCCCGG AGATGTTCCA CGCGATTAAC
GACGACGAGC TCCGCGGCAT GTTCATCATG GGCGAGAATC CCGTCCTCTC GGAACCGGAC
GTCGACAACG CCGAGGAGGG GCTCGAGAAT ATCGACTTCC TCGCCATGCA GGACATCTTC
CTGACCGAGT CGGCCGAGTA CGCCGACGTC GTCCTCCCGG CCGCCTCCGC CGCCGAGAAG
TCCGGCACGT TCACGAACAC CGAACGGCGC ATCCAGCGGG TCCGTCCCGC GGTCGACTCG
CCGGGGAAGG CGAAACCCGA CCAGGAGATC CTCATCCAGC TCGCTCGACG GTTCGGCTAC
GACTGGGACT ACGACGGTCC GGCCGAGGTG ATGGAGGAGA TCAACGACCT CGTCCCCATC
TACGGCGGCG TCACCTACGA GCGACTCGAG GAGGAGACCA AGGGCATCCA GTGGCCCTGC
TTCGACGAGG ACCACCCCGG GACCCCCTAC CTCTACGAGG ACGAGTTCAA CTTCGAGGAC
GGGAAGGCCC GCTTCGTCCC CGCCGACTAC GCCAAGCCGC CGGACATGCC CGACGAGGAG
TACCCGATCA CGCTCTCCTC GGGGCGGGTC CTGTACCACT GGCACACCGG CACGATGACC
CGCCGGGTCG GGACGCTCAT GAACCACGTC CCCGAGAGCT TCGTGACGAT CCACCCCGAG
ATGGCCGACC AGTTGGGCAT CGACGATCAG GAGTACGTCC GCGTCCAGTC CCGGCAGGGC
GAGATCGTCG TGAAGGCCAA CGTCGAGGAC ACCTCCGATC CCGGCGTCGT CTTCATACCG
ATGCACTTCC CGCAGGGGGC GATCAACAAG CTCACCCAGC ACGAACTCGA CCCGACGTCG
TTCATCCCGC AGTACAAGGT GACGAGCGTC CGCATCACGC CGCTCGACGT TCCCCCCGAG
GAGGCGGCCA ACGTCGTCTC CCCGACGCCC GGCCAGCTCG AGGGCCAGGA CGGCGACCCC
GAGGACGTCG GCGGTCGGCG GGCTGACGAC TGA
 
Protein sequence
MSSEDPASHV DDVETERRPT AVEKLPSVPN VADPRPSTPL TEQFETGTAN DPDVRSGDGK 
DGMTHLTVDG TPVSVPPGST IIDAIESVEP ADELAALCYY DRDTEQADEI GPRGECRTCT
VHTEEHGLVT SCSHPVEEGM TVRTDEDDAA EAREVNLDLQ LSDHNLRCTT CGQNGRCELQ
DTSIEQGVEE PRWGVFEDRD QYEPLDDTSP AIQIDRNKCI LCNRCVEACN DVQVEGVLRM
EGNGQDTRIA FQNGEDTFDE STCVSCGHCA TVCPTGALVE QGLTDAATIP LPGFTQENSI
GKVLESPKAE TADQTEAPNR DLPYDVGGRG KPEEDLSGVA RFMSIARARA GDSKRQMSHT
LKEVGDRALE EFEHFSEGIA SEAMPAGQLF NVATTIGDAR LSRIEKAETT CNYCAVGCRF
ELYGKDGEVL GVRPAEPDSA PANDFSTCVK GKFGYDYVDA DDRLEKPLIR KEDAPDGPVG
REGFREATWK EALERVYEGL SEVREEHGSE SLSVISSSKT TNEENFLCQK FARQVLGTPH
VDNCARLCHS STVAALQQTV GYGAMTNRIN EDIAETDCYL ITGSNTTESH PVLATRIKQN
VRDGADLIVI DPREMGLAEH ADQYIRTTPG EDVAWMNGMM RYIIENDLHD EEFIEERTKH
FEKLKEKVEP FTPEKVEELT AVPAEELKQA AETIATADTC IFGWAMGLTQ HNTGTRNVMS
IANLALLTGN LGKPGAGLSA FRGQNNVQGG GGDMGPAPHT LPGYQDLADE EVLDKFADAW
GERPPNEIGL RLPEMFHAIN DDELRGMFIM GENPVLSEPD VDNAEEGLEN IDFLAMQDIF
LTESAEYADV VLPAASAAEK SGTFTNTERR IQRVRPAVDS PGKAKPDQEI LIQLARRFGY
DWDYDGPAEV MEEINDLVPI YGGVTYERLE EETKGIQWPC FDEDHPGTPY LYEDEFNFED
GKARFVPADY AKPPDMPDEE YPITLSSGRV LYHWHTGTMT RRVGTLMNHV PESFVTIHPE
MADQLGIDDQ EYVRVQSRQG EIVVKANVED TSDPGVVFIP MHFPQGAINK LTQHELDPTS
FIPQYKVTSV RITPLDVPPE EAANVVSPTP GQLEGQDGDP EDVGGRRADD