Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0209 |
Symbol | |
ID | 4710979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 241890 |
End bp | 242945 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639854668 |
Product | endoglucanase |
Protein accession | YP_001001805 |
Protein GI | 121997018 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.405756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCACC GCCGACTGAT CCGCTGGGCC GGCGGCGCCC TGATCTGCGC CCACGGACTG CTCCCCGCCG GACTCCCGGC GGGCAGCGCC CCGCCCCCCG CCGCACCCCC GGTGGCCTTT GGCGTGGCCC TGGACGGAGC TGCCAGCCGC GAACGCCTGG CCCACCTGGA ACAGGCACTG GAGCTCGAGA TCGGCCTGGT GGCCATCTTC ATCCAGTTTC CCGAGGATCC GCGGCACGAC AACTTCCCGG CGGAGCAGCT GGACGCCATC CGTCAGGCCG GCGGGCGGCC GGTCCTGACC TGGGAGCCGA TGTACATCGC CGATGGCGAG GAGCACGCCA TCCCCGCCGA GGAGCTGACC GGCGGCGCCT ACGACGCCTA CATCCGTCGT TTTGCGCGCG GCGTCAAGGC GTTCCCCGAG CCGGTAATCA TCCGCTTTGC CCACGAGATG AACCTCGACC GCTACCACTG GGGCTCCACC GCCGAGGACT ACGGCCCATC GGCCCCGACG CGGTATCGGG CGATGTTCCG GCACGTGGTG GAGATCTTCC GCGACGAGGG GGCAGCGGAG CACGCCCGCT TCGCCTTCAA CCCCAACGCC GAGTCGGTCC CCTCGCCCGA CCGCGACCCG GACGCCGACT GGAACCGGCC GGAGGCGTAC TACCCCGGCG ACGCCTACGT CGACGTCCTG GGCATGGACG GCTACAACTG GGGCACCACC CGGACCCGCG AGGAACACGG CTGGGACAGC CGCTTCCAGT CCTTCCAGAC GATCTTCGAG CCGCTCTACC GCACCCTGCG GGATCTCGCC CCCGACAAAC CCATCTACGT CTTCGAGACC GCCACCGTCA CCGACGGGGG CGACAAGGCG GCCTGGATCG AGCAGGCCGC CGCGTCCGCC GTGGCCTGGG AGCTGGCCGG GCTGGTCTGG TTCCATAACG ACAAGGAAGA GAACTGGCGG CTGGATACCG GTGTCACCCC GGAGGACCTT GAACCGCTGC GGCGGATGAT CACCGACCCC GAGGCCCTGC TGGAGGGACG ATCCCGTGGT GACTGA
|
Protein sequence | MNHRRLIRWA GGALICAHGL LPAGLPAGSA PPPAAPPVAF GVALDGAASR ERLAHLEQAL ELEIGLVAIF IQFPEDPRHD NFPAEQLDAI RQAGGRPVLT WEPMYIADGE EHAIPAEELT GGAYDAYIRR FARGVKAFPE PVIIRFAHEM NLDRYHWGST AEDYGPSAPT RYRAMFRHVV EIFRDEGAAE HARFAFNPNA ESVPSPDRDP DADWNRPEAY YPGDAYVDVL GMDGYNWGTT RTREEHGWDS RFQSFQTIFE PLYRTLRDLA PDKPIYVFET ATVTDGGDKA AWIEQAAASA VAWELAGLVW FHNDKEENWR LDTGVTPEDL EPLRRMITDP EALLEGRSRG D
|
| |