Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2304 |
Symbol | |
ID | 8742910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2371598 |
End bp | 2372929 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646512889 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_003403857 |
Protein GI | 284165578 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCTCTC CGACGCGCCG ACGCCTGCTG GCGGCGACCG GAGCCGCGGT TCCCGCGCTC GCGGGCTGTT TCACGGAGTC GGGATCCGAC GCGACGGAAC TCGCGACGCC CGAGTCCGTC CCCGCCGACG ACTGGGTCAA ACCCGACTGG CGGCCGGCCG ACGCCGTCCC GAGCGAGGAC GACGTCGCGG CGACGACGGT CGTCTCCGAT CTCGCGATTC CGTGGGATCT CACCGTCGCC GCCGGCGACG CCTTCGTCAC CGAACGCGAC GGCGGCGTTC GTCGATTCGA TGCGGACACG CTGGCCGAAG ACGCCGATCT GGGACCCAAC GACGGCGAGA CACTCCTCGA GAGCGCGTCC CTCCCCGATC GCGCGTCGCC CGGCGAGGGC GGGACCCTCG GCGTCGCGGC CCACCCCGAC TATCCCGACA CCCCCGACCT GTTCGTCTAC TACACGGCCG ACGACGGGGG CGTCTCGAAC CGGGTCGTCC GCTACGACCT CGAGGCCGAC GCCCTCGAGA CGATCCTCGA GGGAATTCCG GGGTCGTCGG TCCACAACGG CGGGCGGATC GCGTTCGGCC CCGACGACCA CCTCTGGGTG CTGACGGGCG ACGCGAGGGA GCCGGCGCTG TCGCAGGATC CCGGGTCCCG CGCGGGTGCC GTCTTGCGGG TGACGCCCGA CGGCGAGCCC CACCCCGAGA ATCCCGACTG GGGCGACGAC GGCGACCGAC GCACGTACAC GCTCGGCCAC CGCAACCCGC AGGGACTCGA CTTCACGCCG CAGGGAACGC CGATCCTCGC CGAACACGGG CCGGGCGCGC GGGACGAACT CTCGATCCTC CGGCCGGGCG GCAACTACGG CTGGGATATC GTCCGCGGCG GGCCGGACGA CCCCGAGTAC GGGAGCTACG ACGAGTACGA GGCGGCGACG CCGCCGGTCG TCAACACCGG CCCCAAAACG ACGTGGGCGC CCTCCGGACT GGCGTTCTAT GACGACGACG CGATCGGCCC GTGGGAGAAT ACCGTCCTCG TCTGCGGGCT CACCTCGAGC GCGCTGTCCG TCGTCGGGCT CACGCCCCGA AGCGACTCGG ACGGCGACGA CGAGGCGAGT TCTGACGACA CCGACGGCGT CCGGTACGAC GCCGACTGGC TCGACGATCG CGTTACGGCG ACGGTCCATC GGCTGTTCGC CGACGAGTGG GGTCGCCTTC GACACGTCGA GCCCGGGCCC GACGGCTCGC TGTACCTGCT CACGTCGAAC CGGGACGGTC GCGCGGACGG CCCGTTTCCC CGGACGAACG ACGACCGGAT CGTCAGGCTG GACCCGCGGT AG
|
Protein sequence | MCSPTRRRLL AATGAAVPAL AGCFTESGSD ATELATPESV PADDWVKPDW RPADAVPSED DVAATTVVSD LAIPWDLTVA AGDAFVTERD GGVRRFDADT LAEDADLGPN DGETLLESAS LPDRASPGEG GTLGVAAHPD YPDTPDLFVY YTADDGGVSN RVVRYDLEAD ALETILEGIP GSSVHNGGRI AFGPDDHLWV LTGDAREPAL SQDPGSRAGA VLRVTPDGEP HPENPDWGDD GDRRTYTLGH RNPQGLDFTP QGTPILAEHG PGARDELSIL RPGGNYGWDI VRGGPDDPEY GSYDEYEAAT PPVVNTGPKT TWAPSGLAFY DDDAIGPWEN TVLVCGLTSS ALSVVGLTPR SDSDGDDEAS SDDTDGVRYD ADWLDDRVTA TVHRLFADEW GRLRHVEPGP DGSLYLLTSN RDGRADGPFP RTNDDRIVRL DPR
|
| |