Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0925 |
Symbol | |
ID | 4709879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 999053 |
End bp | 1000315 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855394 |
Product | peptidase M42 family protein |
Protein accession | YP_001002503 |
Protein GI | 121997716 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAT CCAGCAAGCC GTGGACCCAG TCCATGCCCG AGGAGCAGTT CGAGCGCATG CGCGAGGTCC TCGCCGCGCC CAGCCCGGTC GGCCTCGAAG GGGCCATGAC CTACGGGGTG CTCAAGCCGT ACTTCGAATC CTTCGCGCCC GCCGAGTGGC GCGTCCACCA GTTCCAGGGG CACGCCGGCA TCGTCCTCGA CACCCATCCG GGCCGGGACG ATCTATTCAA GGTGATGGTG GTCGGCCACG CCGACAAGAT CCGCATGCAG GTGCGCAGCA TCGGCGACGA CGGCAAGGTC TGGATCGACA GCGACTCCTT CCTGCCCGGC ACCCTGATCG GCCACGAGGT CACCCTGTTC AGCGAGGCCC CGGAGAACCC CGGCGCCTAC CGGCGCATTG AGGGCGGCAC CGTCGAGGCT CTGGGCGCCA TCCACTTCGC CGACGAGGAG ACGCGCACCG GGCGTAAGGG GGTCAGGAAG GAGCAGCTCT ACCTGGAGCT TCACATCCAC GGCGAGAACA AGAAGAAGCA GGTCGAGGAC CTCGGCGTCC GCCCCGGCGA CCCGATCCTC CTCAACCGGC CCATCCGCCG GGGCTTCAGC CCGGACACCT TCTACGGGGC CTATCTGGAC AACGGCCTGG GGTGCTTCAC CACCGCCGAG GCGGCGCGCC AGATCGCCGA GGCCGGCGGC GCCCGCAACG TGCGCATGCT CTTCGCCATC GCCAGCTACG AGGAGATCGG CCGCTTCGGC AGTCGCGTGC TGGCCAGTGA GCTGCGCCCC GATGCGCTGA TTGCCGTGGA CGTGGACCAG GACTACGTCG CCGCCCCGGG GGTCTCGGAC AAGCGCTTCC AGCCCCTGAC CATGGGTGCC GGCGTCACCT ACACCGTTGG CGCGGTGGCC AGCGATCAGC TCAACGCGGT GATCCAGCGG GTGGCGACCG AGCAGGACAT CCCGGTGCAG CGCGACGTCA GCGGCCGCGA CACCGGCACC GACGGCATGG CCGGGGTGCT CGGCAACGTG GATTGCACCG CCGCCTCGCT GGGGATCCCG GTGCGCAACA TGCACACCAT CTCCGAGAGC GGCCACACCG GGGACGTCCT GGCGGCCATC CACCTGGTCA CCGGGACCCT GCAGGCCCTC GATGCCCAGG ACGACGGCAG TGGCCGACTG CGCGAGACCT TCCGCCAGGG GCATCCACGC CTGGATCAGG CGGCCGGGCT CAGCCACCCG GGCCCGAAGG CCAAGAACGG CGAGGCGAAG TAA
|
Protein sequence | MTQSSKPWTQ SMPEEQFERM REVLAAPSPV GLEGAMTYGV LKPYFESFAP AEWRVHQFQG HAGIVLDTHP GRDDLFKVMV VGHADKIRMQ VRSIGDDGKV WIDSDSFLPG TLIGHEVTLF SEAPENPGAY RRIEGGTVEA LGAIHFADEE TRTGRKGVRK EQLYLELHIH GENKKKQVED LGVRPGDPIL LNRPIRRGFS PDTFYGAYLD NGLGCFTTAE AARQIAEAGG ARNVRMLFAI ASYEEIGRFG SRVLASELRP DALIAVDVDQ DYVAAPGVSD KRFQPLTMGA GVTYTVGAVA SDQLNAVIQR VATEQDIPVQ RDVSGRDTGT DGMAGVLGNV DCTAASLGIP VRNMHTISES GHTGDVLAAI HLVTGTLQAL DAQDDGSGRL RETFRQGHPR LDQAAGLSHP GPKAKNGEAK
|
| |