Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2157 |
Symbol | |
ID | 8742759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2227686 |
End bp | 2229926 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646512740 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003403712 |
Protein GI | 284165433 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGCG GCTCCGAACG CGTCGCGGAG CGACTCGCAT CGATGACCCG GGAGGAGAAA CTGCGTCTCG TCAGCGGCCG CAGCGATCCC GCGGGGACGG CGACGGGCTA CCTGCCCGGC GTCGAGCGAC TCGACGTTCC GCCCTTCCGG CTGGTCGACG GCCCGCTCGG GATTCGCGCC GAGGGAGAGC GGGCGACGGC GTTCCCGGCG TCGATCGCCG TCGCGGCGAC GTTCGATCCC GACCTCGCGC GCGAGAAGGG TGCGGCGATG GCCCGCGAGG CGCGGGCGCT CGAGCAGGAC GCGCTGCTCG CGCCCGGCGC GAACCTGATC CGGGTCCCTC ACTGCGGGCG GAACTTCGAG TACTACTCGG AAGAGCCGCT GCTGGCCGCC GAGACGGCGG CGGGAGCGGT CGACGGGATC CAGGACGAGG ACGTCGTCGC GACGGTCAAA CACTACGTCG CGAACAACCA GGAGACCGAC CGCGTCCGGG TCAGCAGCGA GGTCGACGAA CGTACGCTCC GGAAGCTGTA TCTGCGGCCG TTCTGGGCGG CCGTCGAGGC CGGCGCCGGC TCCGTGATGA CCGCTTACAA CCGTGTCAAC GGAACGTACA TGAGCGACCA CGACCGCCTC GTCGGTGACG TGCTCAAAGG CGAGTGGGGG TTCGACGGCT ACGTCGTCTC CGACTGGTAC GGCACCGAGA GCACCGTCGG CGCCGCGAAC GCGGGGCTGG ATCTCGAGAT GCCCGGGGTC GCGATCGACG GCGGGTTCGG TGGCGACGGC GATGGGGACA AGGAGGACGG CTCGTTCGAC GCGGCCGACC TCGAGGGCGA GGCCACCGAG ATCATGGGCG GGCTCCCCGA CGGAACGAAG GGCGACCTGT TCGGCGACCC GCTGGCCGAC GCGATCGACG CCGGCGAGGT GCCGGCCGAG CGGCTGGACG ACATGGTTCG GCGGATTCTC GGACAACTCG AGCGGATCGG CCGCCTCGAG AGTGCCGACG ACGGCGATCG AGGTGCTGAC GACGCTCGAA GTGACGGCGA CGACGATGAC GGGGCCGGAG CCATCGATAC GCCCGCCCAC CGCGACCTCG CCGAGCGGAT CGCCGTCCGG GGGACCGTCC TGCTCGAGAA CGACGGCGTT CTCCCGCTCG AGGACGAGGC CGACGTCGCC GTCGTCGGTC CGAACGTCCA CGAGGCGAAA CTCGGGGGCG GCGGCTCCTC GGAGACGACG CCGTTTCGGT CGACGAGTCC CGCGGCGGGG CTGGAGTCGC GGGCCGACGG CGCGGTGACG GTCGCCCGCG GCTGCGAGCC GATTCCGGAT CTCTCGCTGT TCGACGCGCT GCCGTTCGTC GAGAGCGAGG AGAGCGACGA GCCGGCCGCG GACGCGAGGG TCGGCATCGA CGCCGACGAG CCCGCTATCG ACGCCGCGGT CCGGGCCGCC GGCGATGCCG ACGTCGCGGT CGTCTTCGTC CGCGACCGGA CGACCGAGGG GAAGGACCGG GACTCGCTGC GGTTGCCCGG CCGACAGGAC GAACTCGTCG AGGCAGTTGC CGACGCGGCC GCGGAGACGG TCGTCGTGGT CCGGTCGAGC GGCCCCGTCG AACTCCCGTG GCGCGAGGCG GTCGACGCCG TCCTCGAGGC CTGGTATCCC GGACAGGCTG ACGGCGCGGC GGTCGCGTCG GTGCTGTACG GCGACCGCGA TCCGTCCGGT CGCCTGCCGG TCACGTTCGC CCCGGAAGGG ACGTACCCGA CGGCCGATGA GCACCGATAC CCCGGGATCA ACGACGAAGC GCACTACGAG GAGGGACTGT TCGTCGGCTA CCGCCACTTC GACCGAAAAT CGGTCGACAC AGAGCCGACC TACCCGTTCG GTCACGGACA CTCCTACGCC GACTTCGCGT ACCGCGACGC GTCGGTCGTC GACGACCGGA CGGTCCGCCT CACCGTCGAG AACGTCGCCG ATCGGGACGG GCGCGAGGTG GTGCAGGCGT ACGTCCGGCC CCCCGAGTCG GCCGCGATCG AGCGGCCGAC TCGAGAACTC GCCGGCTTCG AGTCGATCGC CGTCCCCGCC GGTGAGATGC GGACGGTCGA CATCGACCTC GCGGATCGGG CGCTCGGACG GTACGACGCT GCCGACGGCT GGGTGATCGA TTCAGACACC TATCCGATCG AACTGGCTCG CTCGGCGCGA GACGTGCGAA AAACGGTCGC TCTGGAGGTT GCGGAGGAGA CGCTTTCGTG A
|
Protein sequence | MNGGSERVAE RLASMTREEK LRLVSGRSDP AGTATGYLPG VERLDVPPFR LVDGPLGIRA EGERATAFPA SIAVAATFDP DLAREKGAAM AREARALEQD ALLAPGANLI RVPHCGRNFE YYSEEPLLAA ETAAGAVDGI QDEDVVATVK HYVANNQETD RVRVSSEVDE RTLRKLYLRP FWAAVEAGAG SVMTAYNRVN GTYMSDHDRL VGDVLKGEWG FDGYVVSDWY GTESTVGAAN AGLDLEMPGV AIDGGFGGDG DGDKEDGSFD AADLEGEATE IMGGLPDGTK GDLFGDPLAD AIDAGEVPAE RLDDMVRRIL GQLERIGRLE SADDGDRGAD DARSDGDDDD GAGAIDTPAH RDLAERIAVR GTVLLENDGV LPLEDEADVA VVGPNVHEAK LGGGGSSETT PFRSTSPAAG LESRADGAVT VARGCEPIPD LSLFDALPFV ESEESDEPAA DARVGIDADE PAIDAAVRAA GDADVAVVFV RDRTTEGKDR DSLRLPGRQD ELVEAVADAA AETVVVVRSS GPVELPWREA VDAVLEAWYP GQADGAAVAS VLYGDRDPSG RLPVTFAPEG TYPTADEHRY PGINDEAHYE EGLFVGYRHF DRKSVDTEPT YPFGHGHSYA DFAYRDASVV DDRTVRLTVE NVADRDGREV VQAYVRPPES AAIERPTREL AGFESIAVPA GEMRTVDIDL ADRALGRYDA ADGWVIDSDT YPIELARSAR DVRKTVALEV AEETLS
|
| |