Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1991 |
Symbol | |
ID | 8742590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2059620 |
End bp | 2061920 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646512573 |
Product | blue (type 1) copper domain protein |
Protein accession | YP_003403548 |
Protein GI | 284165269 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC GATCTGACGA GCGGCCGAAT CCGGTACCCG AATCAGCGAC CGACTATACC GCGACGTCCC GTCGGCACCT GTTGAAGGCC GCCGCGGCCG CCGGCGGCGT CGTCGCGCTG GGCGATCTCG CGGCCGCTCA GGGGGTAGAG ACGATCGAAC TGGGCGGCGA GACCAGCGGC TGGCAGGGCG TCGCGCCCGA CGACATCGCG GGCGAGACGA ACCCCACGCT GGAACTCGAG GCGGGAACGA CCTACGAACT CACGTGGGAG AACCTCGACG GGCAACCGCA CAACTTCGTC ATCGAGAGCG AGGGCGGAGA GCAACTCGAG CGAACCGACC TGCTGATGGA ACAGGGCGAG ACGCAGACCC TCGAGTTCGA GGCGACGAGC GAGATGGCGG AGTACTACTG CGAACCGCAC TCGGCGACGA TGCGCGGGGA GATTTCGGTC GGCGACGGTG GTGGGGGCGG GGCGGAGCAA GACGAAGCGG CCGAAGAAGA GCCCGAGGCC TTCTTCGATC CCGGCGCGGA GATCGGCGTA CGAACGCTCG CGGAGGGAAT GACGGCGCCG ACGGACATGG CGGTCGCCGA CGAGGACGAG GAGCGGTACT TCGTCGCCGA CCAGACGGGC GAGCTCTGGG TCGTTACGGG AGACGGCCTG CAGGACGAAC CGTTCCTCGA CGTCAGCGAC CGGCTGGTCG AACTCGGCAC GTTCGAGGGC GACTACGCCG ATCCGAATCA GGACTACGAC GAGCGGGGAC TGCTCGGTGT CGAGTTCCAT CCCGAGTTCG CGGAGAACGG CCGCTTCTTC GTCCACTACA GCGCGCCGCC GAACGACGAG ACGCCGGAGG GCTGGAGCCA CGTCGAGGTC GTCTCCGAGT TGCAGGCTAC CGAGGACCTG AGCGCCGGCG ACCCCGACTC GGAACGGGTT CTGATGGAGT TCCAGAAGCC CCAGTACAAC CACGACGCCG GACCGATGGC GTTCGGCCCC GACGGCTACC TGTACGTCCC GATGGGCGAC GGCGGCGGTG CCAACGACAA CATGGAAGGC CACGTCGAGG ACTGGTACGA CGGGAACGAG GGCGGGAACG GACAGGACGT CAGCGAGAAC CTCCTCGGAA GTGTCCTCAG AGTCGACGTC GATAGCGAGA TGTCGGAGAC GTCTCGAGAC GGAAGCGGCG ACGCCGCCGA CGAGGAGGGC GAGGACCGAC CGTACGCCAT CCCGGAGGAC AACCCGCTCG TCGATTCGGA CGAGGGACTC GACGAACACT ACGCGTGGGG CTTCCGAAAC CCCTTCGGGA TCTCCTTCGA CAGCGACGGA CGGCTGTTCG TCTCCGACGC CGGCCAGGAC CTCTTCGAGG AGGCGAACCT CGTCGAGGCT GGCGGCAACT ACGGTTGGAA CGTCAAGGAG GGGACCCACT GCTTCAGCAC TGAGAGTCCC AGCCAGCCGC CGGAGGACTG CCCCGACTCG GCGCCCGACG AAGCGCCGTA TGACGGGCAG GAACTGCAAG ACCCCATCGT CGAGTATCCC CACGTGTACC AGGAACAGGT GGTCGGCATC ACGATCATCG GCGGCCACGT CTACGAGGCC GGCGATATCG GGGACCTCGA CGGGAAGTAC GTCTTCGGCG ACTGGACGGC CGATCCGGCG CGACAGTCCC CGCAGGGGCG AATCCTCGCC GCTTCGGAGC CGAGTGACGG GGCCGGAGGG ATGACCGGCG ACGGCGGTGG CAACCAGACC GAAGGGATGA GTCCCGACGA CCAGGAGATG CCGGAGAACG CGACGCCCGA CGAAGAGGGT ATCGAAGGCG AAGGCTTCGA GAACGAGACG AACGCGACCA ACGCGACCAA CGAGACGCCC GACGACGGCG CGGCGGACGT CGGCGGTGGC GGCCAAGAGC AGGTCGTTCC GCGAGACGAA CTCTGGGATA TGGAGGAACT CCAGCTCGCC GGCTCCGAAG ACGGCTCGTT TCCGTACTTC GTCCGGCAGT TCGGTCAGGA CCTCGACGGT AACGTGTACG TGCTCGCAAA TCAGGTGGGC GTTCCGGAGG GCGACACGGG CACGGTCTTC GAGATCGTTC CACCGGGCGA GGGCGAGTCG CTGGAACCGT TCGAAGCGGA CGAAGCGGTC GAACCCGAGG AGCAAGAGAC GGACGAGAAC GCGACCGAAG ACACTCAGAA CGAATCGATC GCCGAGAACG CCACGGACAA CGAGAGCGTC GCGGACGAGA ACGTCACTGA CGGTGAGAAC GCGACCGACA ACGAGACGCT GAGCGAGAAC GTGACCGCCG ACGGCGCCTG A
|
Protein sequence | MSERSDERPN PVPESATDYT ATSRRHLLKA AAAAGGVVAL GDLAAAQGVE TIELGGETSG WQGVAPDDIA GETNPTLELE AGTTYELTWE NLDGQPHNFV IESEGGEQLE RTDLLMEQGE TQTLEFEATS EMAEYYCEPH SATMRGEISV GDGGGGGAEQ DEAAEEEPEA FFDPGAEIGV RTLAEGMTAP TDMAVADEDE ERYFVADQTG ELWVVTGDGL QDEPFLDVSD RLVELGTFEG DYADPNQDYD ERGLLGVEFH PEFAENGRFF VHYSAPPNDE TPEGWSHVEV VSELQATEDL SAGDPDSERV LMEFQKPQYN HDAGPMAFGP DGYLYVPMGD GGGANDNMEG HVEDWYDGNE GGNGQDVSEN LLGSVLRVDV DSEMSETSRD GSGDAADEEG EDRPYAIPED NPLVDSDEGL DEHYAWGFRN PFGISFDSDG RLFVSDAGQD LFEEANLVEA GGNYGWNVKE GTHCFSTESP SQPPEDCPDS APDEAPYDGQ ELQDPIVEYP HVYQEQVVGI TIIGGHVYEA GDIGDLDGKY VFGDWTADPA RQSPQGRILA ASEPSDGAGG MTGDGGGNQT EGMSPDDQEM PENATPDEEG IEGEGFENET NATNATNETP DDGAADVGGG GQEQVVPRDE LWDMEELQLA GSEDGSFPYF VRQFGQDLDG NVYVLANQVG VPEGDTGTVF EIVPPGEGES LEPFEADEAV EPEEQETDEN ATEDTQNESI AENATDNESV ADENVTDGEN ATDNETLSEN VTADGA
|
| |