Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4094 |
Symbol | |
ID | 8744722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 353742 |
End bp | 355964 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514654 |
Product | Beta-fructofuranosidase |
Protein accession | YP_003405601 |
Protein GI | 284167323 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.481915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCTG AGGACTCAGA GTCGAAAGAC TCTCCCAGCC TTGGGTTTCT GACCGCGGGT GAACTGACGG ACGAACAGTG CGCAGCGCTT CGAGCTGTAC GGGACATTGC GGGCGTGGAC TGTGTTTCAC TCGAGGATCT ACCGACGGAA CAATCGCTGT CGTCGTTCGA CGCGCTCTGG TGGCACCGCG ATCTCTCTTT CGCGGACGAA ACGTTCCCTT CCGGTGCGGT CGACGCACTG ATGGCGTATC TCGACGGGGG CGGCGGACTT CTGTTGAGTC TACACGGAGT CACGACCGTC AAGGAGCTGG GGATCGATCC GCAGCCGCCG GACCTCATCG ACGAAACGTA CAAACCAGTC CACAAGTGGG GTCAGCGCCC GGCGGGGTTC CTGATCGGTT CGCGGTTCGC CGACGCCGAC CTGTTCGAGT CATGTGACGG AAACCGCGTT CACACGCAGC CGTCCCGTAG CGAATCGACA CCTCGAGTGG CCTACGGACG CCGGATCCCG AAGCGTGGAA CGGTTCTCGC GAGCGCCGCT GTCGGTGAGA CGGACGTCCC CCGAGAGAAC AGCGTCATCG GTTGGCAAGT CGGGGCGGGA ACCGTCGTCG GTATCGGCCA ACACTTCACG TTCGATGGCG TACCGTCGGA GTACCTGACG ACACTGCGGA CGATTCTTCG CGGGACGATC AAGCACATCG CCCGCGGGGA GGCCGCGATG TACACTCGAC CGCGGAGTGG CGCGGAACTG ACACAGGTCC GGACAGAGAT CGACGACGAT CCTCACCGCC CCTCGTACCA TTTCACGCCG CCGGCAAACT GGATGAACGA TCCGAACGGA CTAGTCAAAT GGAACGGCGA GTATCACCTG TTCTACCAGT ACAACCCGGC CGGACCGTAC CACGGATCGA TCCACTGGGG GCATGCGGTG AGCGACGACC TCGTTCACTG GGAGGATAGG CCCATCGCGC TAGAGCCCGA CACAGGGGGA CCGGACCGGC ACGGGTGCTG GTCGGGCTGT ACGGTACTCG ATGACGACGT CCCCACGTTC GTCTACACCG GCGGCGACGG ACATGATCAG CTCCCCTGTC TCGCTCGCGC GGCCGATGAC GACCTCGATA CCTGGCAGAA GTCCCCACAG AATCCGATCA TCACCGACCC TCCCGAGCGG CCGCAGATCC TCGCGAACGA CGACTGGAAC GCCGAGTTTC GCGACCACGA CGTCTGGAAG GAGGACGGAA CCTGGTATCA CCTCATCGGC TCCGGGACCG AGGATGCCGG TGGAACGGCG CTGTTGTATC AGTCGGACGA CCTTCTCGAT TGGGCGTACG TCGGTCCAAT CCTAGTCGGA GACCGCGACG AAGACGGGCC GATCTGGGAG TGCCCGGAAC TACTCGACTT CGGTGACCTG CAGCTGCTGC AGGTCTCGAA CTACGACAAG GTCGCGTACT TTCTCGGAAC GTTCGACGGT CAGACGTTCG ACCGAAAGGA CTCGGGGACG CTCGATCACG GGAACTACTA CGCCGCCCAA TCGATCCCCG ACGGCGACGG GCGGTACCTC TCGTGGGGCT GGATCCGCGA GGATCGCAGT GCGTCCGCGC AGTGGGACGC GGGGTGGTCT GGTGCCATGT CTGTCCCTCG TTCGCTCTCC CTTTCGTCCG ACGGTACTCT TGTAGTGCAG CCGGCCGAGG AACTCACTCG TCTTCGGGGA GAACGCGAGA CGATCGACCG CCAGACGCTC TCCCCGGACG ATCCGTCACC GTGTGACGGC GTCTCCGGCG ACGCGCTGGA AATTCAACTC GAACTGGAAC TCGACGGTGC TGACGCGTTC GAGCTCGTCG TCGCGTGCTC CGATGACGGC GAGGAACGAA CATCGATCCG GTACACGGAC GGCAATCGTC TGATCGTCGA TCGTGAACAC TCCAGCCTCT CGGACGCCGC AAACTCCGAT CCGCAGTCAA TCGACGAGGT ACCCCAGTCC GACGACGGCA TCGTTCACCT GCACGTGCTC ATCGACGCTT CGGTCATCGA AGTGTTCGTC AACGATCGAA CCAGTGTGAG CAGCCGAATC TACCCGACTC GGGCCGATAG CACCGGCGTC TCGCTCGAGG CGGTCGGCGG AGCCGTCGAA CTCTACTCTG CAGACATGTG GTCGCTGGAG TCCGCGTTCA CAGACGGCTC GAGCGCCGAG GCGGCCTCAA ACCCCGAAAT AAACCTTGAT TGA
|
Protein sequence | MQAEDSESKD SPSLGFLTAG ELTDEQCAAL RAVRDIAGVD CVSLEDLPTE QSLSSFDALW WHRDLSFADE TFPSGAVDAL MAYLDGGGGL LLSLHGVTTV KELGIDPQPP DLIDETYKPV HKWGQRPAGF LIGSRFADAD LFESCDGNRV HTQPSRSEST PRVAYGRRIP KRGTVLASAA VGETDVPREN SVIGWQVGAG TVVGIGQHFT FDGVPSEYLT TLRTILRGTI KHIARGEAAM YTRPRSGAEL TQVRTEIDDD PHRPSYHFTP PANWMNDPNG LVKWNGEYHL FYQYNPAGPY HGSIHWGHAV SDDLVHWEDR PIALEPDTGG PDRHGCWSGC TVLDDDVPTF VYTGGDGHDQ LPCLARAADD DLDTWQKSPQ NPIITDPPER PQILANDDWN AEFRDHDVWK EDGTWYHLIG SGTEDAGGTA LLYQSDDLLD WAYVGPILVG DRDEDGPIWE CPELLDFGDL QLLQVSNYDK VAYFLGTFDG QTFDRKDSGT LDHGNYYAAQ SIPDGDGRYL SWGWIREDRS ASAQWDAGWS GAMSVPRSLS LSSDGTLVVQ PAEELTRLRG ERETIDRQTL SPDDPSPCDG VSGDALEIQL ELELDGADAF ELVVACSDDG EERTSIRYTD GNRLIVDREH SSLSDAANSD PQSIDEVPQS DDGIVHLHVL IDASVIEVFV NDRTSVSSRI YPTRADSTGV SLEAVGGAVE LYSADMWSLE SAFTDGSSAE AASNPEINLD
|
| |