Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2741 |
Symbol | |
ID | 8743355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2816059 |
End bp | 2817936 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646513327 |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_003404287 |
Protein GI | 284166008 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGATG ACGACTACCC GCCGATCGAG GCCTACGGCG TCGTCGGGAA CCTCGAGACC TGTGCGCTCG TCGCCCCGGA CGGCTCGGTC GACTGGTTCC CGTTTCCCCA CCTCGAGTCG CCGAGCATTT TCGCCGCCGT CCTCGACGCC GAGCGGGGCG GGCGGTTCCG GATCGCCCCG ACCGACTCGT TCGAGACCGA CCGGCGGTAC GTCGACGACA CGAACGTCCT CGAGACGTCG TTCCGAACCG ACGGCGGCAC GGCGACGGTG ACGGATTTCC TGCCGCCGGC CGGGCGCACC GACCATCCGA AGAAGGTGCT CTACCGCAAA CTCGCCTGCG ACGAGGGACG CGTCGACCTC GCGGTCGATC TCGAACCGCG GTTCGATTAC GCCCGCGCGG AGACGACGGT GGAATCAGAA CAGCGAGGAG CGCTCGCTGA GGGACGAGAC GAGCGGACGC TGCTCGAGAG TCCGATCGAC CTCGAGATCG AGGACGATCG AGTAACCGGT GAGCTCTCGC TCGAGACCGG CGACGAGGAG TGGTTTCTAC TCCGGTGTAC GGGCGCAGAG GACGCGAATA CGGATCCCGA AGCCGCGCTC GAGGAGAGCC TCGAGTACTG GTCCGATTGG GCACACGACT GTGGTGCCGA CGGCGACTGC GTGTTCGAGG GGGCGTGGCA CGATCGGGTC GTCCGCTCCG AACTCGTCTT GAAACTCCTC ACCCACGCCG AGTCCGGGGC GATCGCGGCC GCGCCGACCA CCTCCTTACC CGAGGACATC GGCGGCGTTC GGAACTGGGA CTACCGGTTC AACTGGCTCC GCGACGCCGG ATTCACCGTT CAGGCGCTGA TGAACCTCGG GACCGCCGAC GAGGCGACCG CTTACTTCGA GTGGTTCATG GACCTCTGTC AGGCCGACGA CCCGGCGGCG ATCCAGCCGC TGTACGGCCT CCACGGCGAG TCGGACCTCG AGGAGCGAGA ACTCGAGCAC TTCGAGGGGT ATCGCGGGTC CAGCCCGGTC CGGATCGGCA ACGAGGCCGC CGACCAGCGT CAGCACGACA CCTACGGCGA ACTCCTGCTC GCCGTCGACG AGATGCACCG GCACGGTCGC GAACTGGACC CCGACGAGTG GGACCGAATC CGCGATATCG TCGACTACGT CCGCGAGATC TGGGACGAAC CGGACGCGGG CATCTGGGAG GTTCGCGGCG GGGACGAACA CTTCGTCTAC TCGAAGGTCA TGTGCTGGGT CGCTCTCGAT CGCGGGATCG CGCTCGCGAC CGACGGTGGC TACGACGCTC CGGTCGGGGA GTGGCGGGAG ACCTGCGAGC GGATCAGGGC TGACGTCCTC GAGAACGGGT ACGACGAGGA CGTCGGCGCG TTCGTCCAGT CCTACGGGTC GAACGCGCTC GACGCGACCG GACTCCTGCT CCCGATCGTC GGCTTCCTGC CCTTCGGCGA TGACCGCATT CGAGAAACGA TCGACGCGAT CGAGGAGACG TTAGTCGAGG ACGGGGTGTT CGTCCAGCGG TACGACGGCG ACGACGGCCT CCCGGGCGAG GAGGGCGCGT TCGTCCTCTG CTCGTGCTGG TTCGTCGACG CGCTCGCGCT CTCCGGGCGC GTCGCGGAGG CCCAATCCCG GTTCGAGACG CTGCTCGAGT ACCTGAACCC GCTCGGACTG ATCGCGGAAG AGATCGATCC CGAGAGCGGC GCCCATCTCG GGAACTTCCC GCAGGCGTTC AGCCACATCG GAATCGTCAA CAGCGCCCTC TATCTCGGCT ATCTGCGGGG CCACGAGGCG CCCGGCCCGG CGCCGATGGG GATTCGACTC GGCGAGCCGG TCGGGCTCCC AAGCGAGAGT TCCGATAGGA GATACTGA
|
Protein sequence | MRDDDYPPIE AYGVVGNLET CALVAPDGSV DWFPFPHLES PSIFAAVLDA ERGGRFRIAP TDSFETDRRY VDDTNVLETS FRTDGGTATV TDFLPPAGRT DHPKKVLYRK LACDEGRVDL AVDLEPRFDY ARAETTVESE QRGALAEGRD ERTLLESPID LEIEDDRVTG ELSLETGDEE WFLLRCTGAE DANTDPEAAL EESLEYWSDW AHDCGADGDC VFEGAWHDRV VRSELVLKLL THAESGAIAA APTTSLPEDI GGVRNWDYRF NWLRDAGFTV QALMNLGTAD EATAYFEWFM DLCQADDPAA IQPLYGLHGE SDLEERELEH FEGYRGSSPV RIGNEAADQR QHDTYGELLL AVDEMHRHGR ELDPDEWDRI RDIVDYVREI WDEPDAGIWE VRGGDEHFVY SKVMCWVALD RGIALATDGG YDAPVGEWRE TCERIRADVL ENGYDEDVGA FVQSYGSNAL DATGLLLPIV GFLPFGDDRI RETIDAIEET LVEDGVFVQR YDGDDGLPGE EGAFVLCSCW FVDALALSGR VAEAQSRFET LLEYLNPLGL IAEEIDPESG AHLGNFPQAF SHIGIVNSAL YLGYLRGHEA PGPAPMGIRL GEPVGLPSES SDRRY
|
| |