Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0895 |
Symbol | |
ID | 8741479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 914514 |
End bp | 915581 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646511473 |
Product | Cellulase |
Protein accession | YP_003402463 |
Protein GI | 284164184 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCCG TCCCGTTCGA TTTCGAATTT CTCACGGAAC TGACCGAGAC GAGTGGCGTC CCCGGCTACG AGGACCGCGT CCGTGACCTC GTCGTCGACG AGTTCGAGGA GAACGTCGAC CGGATCCGAA CCGACGCGAT GGGGAACGTC GTGGGGACGC TCGAGGGCGA GTCCGACTAT TCGGTCGCCG TCGCGGCCCA CATGGACGAG ATCGGCTTCA TGGTCCGCCA CCTCAAGGGC AGCGAGGACG GCTTCGGCTT CGTCGAACTC GACGCGCTGG GCGGCTGGGA CGCGCGCGTC CTCAAGGCCC AGCGGGTGAC GATCCACACC GACGACGGCG ACCTGCCGGG GGTCATCGGT TCGCCGCCGC CACACACCTT GAGCGACGAG GACCGCGAGA AGACTCCCGA GGTCGAGGAC GTCGTCGTGG ACGTCGGTCT CCCCTACGAG GACCTCGAGG AGCGGGTCTC GCCCGGTGAC CTCGTGACGA TGGATCAGAC GACCGAGCGC GTCGGTGAGA CGGTCACCGG CAAGGCGCTC GACGACCGGA TCTGTCTGTT CGCGATGCTC GAGGCGGCTC GCCGGATCAC CGAGCCCGAC GTGACGATCC ACTTCTGTGC GACCGTCCAG GAGGAGGTCG GCCTGCGGGG CGCCCGCGCG CTGGGCGTCG ACGTCGATCC CGATCTGGCG CTCGCGCTCG ACGTCACCGT CGCCAACGAC ATCCCCGGCT TCGAGGCCGG CGATCGCGTC ACCGAACTCG GCGACGGCGC GGCGATCAAA CTCAAGGACG GGAGCGTCAT CACGAACCCG AAGGTCCACA AGCGGCTCCA GTCGGTCGCC GACGAGGCGG AAATCGACTA CCAGCGCGAG ATCCTCCCCG CCGGGGGCAC CGACACGGCC GGCTTCCAGC TTTCCAACGG CGCCAAACCC GTCGGCGCGA TCTCGATTCC GACGCGATAC CTCCACACGC CGACCGAGGC CGCCCACGTC GACGACGTCG CGGCGATGAT CGATCTCCTC GAGGCGTTCC TCTCGAGCGA GGACGGCAAG GAGGACTACA CTCTCTGA
|
Protein sequence | MESVPFDFEF LTELTETSGV PGYEDRVRDL VVDEFEENVD RIRTDAMGNV VGTLEGESDY SVAVAAHMDE IGFMVRHLKG SEDGFGFVEL DALGGWDARV LKAQRVTIHT DDGDLPGVIG SPPPHTLSDE DREKTPEVED VVVDVGLPYE DLEERVSPGD LVTMDQTTER VGETVTGKAL DDRICLFAML EAARRITEPD VTIHFCATVQ EEVGLRGARA LGVDVDPDLA LALDVTVAND IPGFEAGDRV TELGDGAAIK LKDGSVITNP KVHKRLQSVA DEAEIDYQRE ILPAGGTDTA GFQLSNGAKP VGAISIPTRY LHTPTEAAHV DDVAAMIDLL EAFLSSEDGK EDYTL
|
| |