Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4655 |
Symbol | |
ID | 8745258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 236487 |
End bp | 237908 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646515166 |
Product | Glycosyl hydrolase family 32 domain protein |
Protein accession | YP_003406113 |
Protein GI | 284172731 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.107095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAGA AACTACCTTT GAACCCGAAT CAGTGGCGTC CCGAATATCA TTTCTCGCCA TCCGAAGGAT GGCTGAACGA TCCCAACGGA CTGGTGTATT GCGACGGCGT GTACCACATG TTCTACCAAG CTGGCGAGCA CCGACGGCGG TGGGACCACG CCACGAGCGA AGACCTTCTC TCGTGGTCCG AACAAGGAAC AAAGATCCAG GATACCCAGT CCGTTCAGTC GTTCTCCGGC GGAGCAGTCG TCGACCGTGG CGATACTGCA GACTTCGGCG AAAATACTCT CGTGTTTACG TACACAGGCC ACCACGATGA CGGGACTGAG GATCAGAGAC TGGCGTACAG CACGGACATC GGGGATACTG TCCGTACGTA CAACGCGAAT CCGATAATCG AGAGCGCGAC CGGGGACTTC CGAGATCCCA ACGCGTTTTG GTACGAACCG GACGCGAGCT GGCGTATGGT CGTCAGTCGC GTTGAGGGAA CTAGAGACCG ACCCGCGGGA ATCGAGGTCT ACAGTTCTGA TAACCTCCGT GACTGGGCGT ACGAGAGCAC CTACCGGGAC ACCGACGGAG AAGAGTGGGA GTGTCCGAGC CTGTTCGAGT TACCGGTGGA GAGAACGGCA GAGTCGAGGT GGGTGATGAT CGTCTCGTCA ATCGAGAACT GTTCCGTCGA GTACCACATC GGACATTTCA ACGGGACGGA GTTCGTCGCT GAGGACGTGA TTCTTGCGGA CTATGGGTAT GACTTCTACG CCGCCCAGAA CTGGGAAAAT CCCCCAAGAC ATGGGGAGCT GGTCGTTTCC TGGATGAATA ACTGGGCCTA CGCTGATAAC GGACCTAATC CTGGCTGGAG AGGCGTCATG ACAATTCCAA GAACAATAGC GCTCCACAAC GTGAACGGTG ATGTCAAGGT ACACCAGTAC CCCGCAGCTG AGGTAGCCCG GATGCGGAAG AATGTGATTA CCGATCTCTC GCAGGAGACG GTCGTGCCGG ACGAAAACCC TCTGGAGCAG GTAGACATCG GGAATCGAAC GCTCGATATC GTTGCGACCG TCGACCCACG GAACGCCGAC ACAGTGGAGT TTCGTGTCTG TGAAGGGAGG ACTCAGGCAA GCAGCATCGT CTACGATACA GTAAACGAGG AGCTGCACTT CGACCGGACG AACGCGGGAG CGTTCTTCGA CAATGACGCC TATGGAACGA CGTCAATGCC CCTAAAACTG CGTGAGGACG GAACGGTCAA ACTCCGTATC CTGATCGATC GGTGTTCAGT GGAACTGTTC GCCAACGATG GTCGCCGCAC GATGACGAAT TTGGTGTATC CCGACCGTGA GAGTACCGGA GTCTCATTCT CTTCGGAGGG GGGTGCCGCC GAGGTCGAAC GCCTGATGGT TTACGGGTTA GAAACGTCGT AA
|
Protein sequence | MTQKLPLNPN QWRPEYHFSP SEGWLNDPNG LVYCDGVYHM FYQAGEHRRR WDHATSEDLL SWSEQGTKIQ DTQSVQSFSG GAVVDRGDTA DFGENTLVFT YTGHHDDGTE DQRLAYSTDI GDTVRTYNAN PIIESATGDF RDPNAFWYEP DASWRMVVSR VEGTRDRPAG IEVYSSDNLR DWAYESTYRD TDGEEWECPS LFELPVERTA ESRWVMIVSS IENCSVEYHI GHFNGTEFVA EDVILADYGY DFYAAQNWEN PPRHGELVVS WMNNWAYADN GPNPGWRGVM TIPRTIALHN VNGDVKVHQY PAAEVARMRK NVITDLSQET VVPDENPLEQ VDIGNRTLDI VATVDPRNAD TVEFRVCEGR TQASSIVYDT VNEELHFDRT NAGAFFDNDA YGTTSMPLKL REDGTVKLRI LIDRCSVELF ANDGRRTMTN LVYPDRESTG VSFSSEGGAA EVERLMVYGL ETS
|
| |