Gene Htur_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1991 
Symbol 
ID8742590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2059620 
End bp2061920 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content67% 
IMG OID646512573 
Productblue (type 1) copper domain protein 
Protein accessionYP_003403548 
Protein GI284165269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGC GATCTGACGA GCGGCCGAAT CCGGTACCCG AATCAGCGAC CGACTATACC 
GCGACGTCCC GTCGGCACCT GTTGAAGGCC GCCGCGGCCG CCGGCGGCGT CGTCGCGCTG
GGCGATCTCG CGGCCGCTCA GGGGGTAGAG ACGATCGAAC TGGGCGGCGA GACCAGCGGC
TGGCAGGGCG TCGCGCCCGA CGACATCGCG GGCGAGACGA ACCCCACGCT GGAACTCGAG
GCGGGAACGA CCTACGAACT CACGTGGGAG AACCTCGACG GGCAACCGCA CAACTTCGTC
ATCGAGAGCG AGGGCGGAGA GCAACTCGAG CGAACCGACC TGCTGATGGA ACAGGGCGAG
ACGCAGACCC TCGAGTTCGA GGCGACGAGC GAGATGGCGG AGTACTACTG CGAACCGCAC
TCGGCGACGA TGCGCGGGGA GATTTCGGTC GGCGACGGTG GTGGGGGCGG GGCGGAGCAA
GACGAAGCGG CCGAAGAAGA GCCCGAGGCC TTCTTCGATC CCGGCGCGGA GATCGGCGTA
CGAACGCTCG CGGAGGGAAT GACGGCGCCG ACGGACATGG CGGTCGCCGA CGAGGACGAG
GAGCGGTACT TCGTCGCCGA CCAGACGGGC GAGCTCTGGG TCGTTACGGG AGACGGCCTG
CAGGACGAAC CGTTCCTCGA CGTCAGCGAC CGGCTGGTCG AACTCGGCAC GTTCGAGGGC
GACTACGCCG ATCCGAATCA GGACTACGAC GAGCGGGGAC TGCTCGGTGT CGAGTTCCAT
CCCGAGTTCG CGGAGAACGG CCGCTTCTTC GTCCACTACA GCGCGCCGCC GAACGACGAG
ACGCCGGAGG GCTGGAGCCA CGTCGAGGTC GTCTCCGAGT TGCAGGCTAC CGAGGACCTG
AGCGCCGGCG ACCCCGACTC GGAACGGGTT CTGATGGAGT TCCAGAAGCC CCAGTACAAC
CACGACGCCG GACCGATGGC GTTCGGCCCC GACGGCTACC TGTACGTCCC GATGGGCGAC
GGCGGCGGTG CCAACGACAA CATGGAAGGC CACGTCGAGG ACTGGTACGA CGGGAACGAG
GGCGGGAACG GACAGGACGT CAGCGAGAAC CTCCTCGGAA GTGTCCTCAG AGTCGACGTC
GATAGCGAGA TGTCGGAGAC GTCTCGAGAC GGAAGCGGCG ACGCCGCCGA CGAGGAGGGC
GAGGACCGAC CGTACGCCAT CCCGGAGGAC AACCCGCTCG TCGATTCGGA CGAGGGACTC
GACGAACACT ACGCGTGGGG CTTCCGAAAC CCCTTCGGGA TCTCCTTCGA CAGCGACGGA
CGGCTGTTCG TCTCCGACGC CGGCCAGGAC CTCTTCGAGG AGGCGAACCT CGTCGAGGCT
GGCGGCAACT ACGGTTGGAA CGTCAAGGAG GGGACCCACT GCTTCAGCAC TGAGAGTCCC
AGCCAGCCGC CGGAGGACTG CCCCGACTCG GCGCCCGACG AAGCGCCGTA TGACGGGCAG
GAACTGCAAG ACCCCATCGT CGAGTATCCC CACGTGTACC AGGAACAGGT GGTCGGCATC
ACGATCATCG GCGGCCACGT CTACGAGGCC GGCGATATCG GGGACCTCGA CGGGAAGTAC
GTCTTCGGCG ACTGGACGGC CGATCCGGCG CGACAGTCCC CGCAGGGGCG AATCCTCGCC
GCTTCGGAGC CGAGTGACGG GGCCGGAGGG ATGACCGGCG ACGGCGGTGG CAACCAGACC
GAAGGGATGA GTCCCGACGA CCAGGAGATG CCGGAGAACG CGACGCCCGA CGAAGAGGGT
ATCGAAGGCG AAGGCTTCGA GAACGAGACG AACGCGACCA ACGCGACCAA CGAGACGCCC
GACGACGGCG CGGCGGACGT CGGCGGTGGC GGCCAAGAGC AGGTCGTTCC GCGAGACGAA
CTCTGGGATA TGGAGGAACT CCAGCTCGCC GGCTCCGAAG ACGGCTCGTT TCCGTACTTC
GTCCGGCAGT TCGGTCAGGA CCTCGACGGT AACGTGTACG TGCTCGCAAA TCAGGTGGGC
GTTCCGGAGG GCGACACGGG CACGGTCTTC GAGATCGTTC CACCGGGCGA GGGCGAGTCG
CTGGAACCGT TCGAAGCGGA CGAAGCGGTC GAACCCGAGG AGCAAGAGAC GGACGAGAAC
GCGACCGAAG ACACTCAGAA CGAATCGATC GCCGAGAACG CCACGGACAA CGAGAGCGTC
GCGGACGAGA ACGTCACTGA CGGTGAGAAC GCGACCGACA ACGAGACGCT GAGCGAGAAC
GTGACCGCCG ACGGCGCCTG A
 
Protein sequence
MSERSDERPN PVPESATDYT ATSRRHLLKA AAAAGGVVAL GDLAAAQGVE TIELGGETSG 
WQGVAPDDIA GETNPTLELE AGTTYELTWE NLDGQPHNFV IESEGGEQLE RTDLLMEQGE
TQTLEFEATS EMAEYYCEPH SATMRGEISV GDGGGGGAEQ DEAAEEEPEA FFDPGAEIGV
RTLAEGMTAP TDMAVADEDE ERYFVADQTG ELWVVTGDGL QDEPFLDVSD RLVELGTFEG
DYADPNQDYD ERGLLGVEFH PEFAENGRFF VHYSAPPNDE TPEGWSHVEV VSELQATEDL
SAGDPDSERV LMEFQKPQYN HDAGPMAFGP DGYLYVPMGD GGGANDNMEG HVEDWYDGNE
GGNGQDVSEN LLGSVLRVDV DSEMSETSRD GSGDAADEEG EDRPYAIPED NPLVDSDEGL
DEHYAWGFRN PFGISFDSDG RLFVSDAGQD LFEEANLVEA GGNYGWNVKE GTHCFSTESP
SQPPEDCPDS APDEAPYDGQ ELQDPIVEYP HVYQEQVVGI TIIGGHVYEA GDIGDLDGKY
VFGDWTADPA RQSPQGRILA ASEPSDGAGG MTGDGGGNQT EGMSPDDQEM PENATPDEEG
IEGEGFENET NATNATNETP DDGAADVGGG GQEQVVPRDE LWDMEELQLA GSEDGSFPYF
VRQFGQDLDG NVYVLANQVG VPEGDTGTVF EIVPPGEGES LEPFEADEAV EPEEQETDEN
ATEDTQNESI AENATDNESV ADENVTDGEN ATDNETLSEN VTADGA