Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1167 |
Symbol | |
ID | 5733060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1339259 |
End bp | 1340515 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278307 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001543943 |
Protein GI | 159897696 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000303009 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCGTC ATCTTTTTCG CGGGGTTCTG GCAATTGGGA TTTTGGTCTG GATCGTGTTT CTCTGGCAGT TTATTGATTT ACGGATCAAG CACAGTCGCG CCGCCGATGC CGCCGCCCAA GCTCTCTTGC CCCGAGCAAC CTCAACCGCC TTGCCCGTTC CTACCTTTGT TGCGCCAACC TTGCAGCCGC TGCCCAACAC GGTTGCCGAT GGTCAAGGCG GCTCAGAGCT ACCTGCTAGC GCCAATGATG TGCGCGGGGT ACACCCCAAA ACAGGCCGCT ACGTCGCTGC TTGGCTGCCA ACCTCGTTTG ATGCCGAGGC GGCCCGTGCA ACCTTTGAAG CCAACAAAGA TATTCTCGAT GAGGTCAGCC CGTTTTGGTA TGGCGTGCGA CCTGATGGCA CGTTAATCGC CGACGTTGGC TCACGCGATG CCGAATTGGT GCAAATTGCC AAAGAAAATA ATGTGCTGAT TATTCCAACT GTGCATAATA TTGAAGATTT GGAAGCAGCT TCGGTGGTGT TGGCAACGCC CGAAAGCCGC ACAAACCATA TTAATATTAT TATGGATGAG GTTCGGACCT ACGGCTACGA TGGCATCGAC ATCGATTATG AATCGCTTGC GCTTGATTAT GAAGATGAAT TTACCGCCTT TATGACCGAA TTGGGTGCTG CGTTGCATGC TGAAGATAAA TTATTAACCG TTGCAGTGCA TGCCCACACT GGTCGCCCCG ATTACCAAAA TTATGCCGAT TTGGGCAAAG TGGTTGATCG GCTGCGGATT ATGACCTACG ATTATAGCTG GCGTGGCTCG GAGCCAGGCC CAATTGCTCC GATGTTTTGG GTCAAAGCGG TGGCCGAATA TGCCAAAACC CAAGTTGACC CCAGCAAAAT TCAAATTGGC ATTTCGTTTT ATGCCTACGA TTGGCCAGGT AATGGCGGCT TTGGGGTTGC TCGCACCTAT ACTGAAGTTG AAGAAATTAA AGCCACCTAT CAGCCCCAAA TTCGCTTAGT TGAGGAAGAT GGCGGCCAGC AAATTCAGGA ATCAACTTTT AACTATGCTG GACGCACGGT TTGGTTCTCC AATTATCGTT CGCTAACCGC CAAAATGGAG ATGGTGCGCG AAAACGATTT AGCAGGCATT GCAATTTGGC GCTTGGGCAG CGAAGATCCC CAAAACTGGA CCTATATTCG CGAATCGTTG AAACAAGATC CCTTGATCGT CCAACGCTCA ATCAATCGCT ATCTTCCTGG CCATTAA
|
Protein sequence | MFRHLFRGVL AIGILVWIVF LWQFIDLRIK HSRAADAAAQ ALLPRATSTA LPVPTFVAPT LQPLPNTVAD GQGGSELPAS ANDVRGVHPK TGRYVAAWLP TSFDAEAARA TFEANKDILD EVSPFWYGVR PDGTLIADVG SRDAELVQIA KENNVLIIPT VHNIEDLEAA SVVLATPESR TNHINIIMDE VRTYGYDGID IDYESLALDY EDEFTAFMTE LGAALHAEDK LLTVAVHAHT GRPDYQNYAD LGKVVDRLRI MTYDYSWRGS EPGPIAPMFW VKAVAEYAKT QVDPSKIQIG ISFYAYDWPG NGGFGVARTY TEVEEIKATY QPQIRLVEED GGQQIQESTF NYAGRTVWFS NYRSLTAKME MVRENDLAGI AIWRLGSEDP QNWTYIRESL KQDPLIVQRS INRYLPGH
|
| |