Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0922 |
Symbol | |
ID | 5732691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1054456 |
End bp | 1057143 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278054 |
Product | lantibiotic dehydratase domain-containing protein |
Protein accession | YP_001543698 |
Protein GI | 159897451 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATG TTTATCAATC AAGCATTGTG ATCGCCGCTG ATCAACCTGT CCAACCAACC TGGCAACCTG CTTCGCCCGT CTTGGTACGC TTGGCTGGAA CAAGCTACAA CCATTTACAT AACCTAAAGT TTGAGCGCAC GATTGGTCTG TTTCAAGTGA TTTTGCGCCA ACAACGCTGG CTCGATAACC ATACCCAGGC AATTGTTGAT GTGCTCTACA CATTAATTGG CACAACCCAG GGTTCGCTTA AAAGCAGCTT GGTGGCCCTA AAACGCGATA TTTTCAATCG CCGTTTGCCC TTGCGCAGCG ATGTCGCTAG CTTGCGCAAT CTGGAGCATC CAAGCTTGCG CGGGCAGGTG CTGCGGTTTC GCCGCACCTT GCAGCAATTG CAACGGGCTT GGCGCGAGGC TCGGTTGGTG TTTGAACAAG AATTACAAGC CAAACGGCGT TTGTTGCAAC AAAGCTATGC CCAGCCAGCC TTTCAAAAAG CGGTGCAAGT GGCTAGCTCA ACCTTGGCTA GCAGCTTGCC GCGCTATATC AACGCCGATC CCAGCCAGTT GCGCAGCCGC GAACGCAAAA CCGAAGTTGG AGCCTACCAA TATCTGATGC GCATGGCCGG CAAAACCAGC CCATTTAGCC ATTTTGGCCC TTTGGTGCTC GGCTCGGCTG ACCCAACATT GGCTCAACCG ATTGTGATTG AAGGCCAAAT GCCCCAAGCG CGGGCAGTCA GCCAACTGCG GCGCTCGGTG GTTGGCGCAA TTCGCAATGC TTTGGTGCGC ATTCCGGCAA TCAAGGCTTA TTTGCCCTTG CGCTTGAATC CGAGTAGTTT TGTTGATGGC GAGAGTTTAG TTTTTTCGCG CTTGCGCCAA CAACTTGATG ACCCCAACCA ATATTTTGCT TCATGGCAAC GCCGTGGTAA ACGTTCGCTC TTGCTCGATA GTGTGTTAGC AGTTTTTGCC AACCAACAGC AATTGAGCTA CGCCGAATTG TTGGAGCTTT TGCAAACCCA ACATCCTGAG CTAAGCTCTG CCAAACTTGC CCAAACGATC AATCAATTGC TCACCAGCGG CCTGCTCTTT TACAATTTGG ATTTGCCCTC AGATTATCGT GATCCAGTGC TGGTGTTGAT TGAGCAATTA GCCGATGTTC CCTCGGCTGA GGCTGGGCAA ATTCGTGAAT TATTACAACG TTTGGCTGAA TCGGCGCAAG CCTATGCCCA AGCCACGGTT GCTCAACGTT GGCAGCTTGA GCAAAGCATG CGCCAAATCG TCGATCAGAT CTTGGCCTTT GCGCCCGAAT TGAAGCAAAC CAGCAGCCAA CTTGGTCCGC TAATCGTTGA AGATGTAGTT TGGGATGACG TAGCGGTGCA TTTGGGGCAG CCATTATTGG CAAGCTTGGC TAGCGATTTG GCTCCAGTTT TGGAGTGTGG TTTTCAGCGC GACGGGCGTG GCCCAGCCCA AAGTTGGCTG CTCGATATTT TCACCCATGC CTTTGGCGAA GGCGGCCAAA CCGACACTAT TCTAGGCTTT GCGCCTGAAT ATAATCGGCT CTCGGCCAAT TCTGGCGAGC TGCCAATGCC GCGCATTAAC GCTATTCAAC AACGTCAACG CCGTTATGGC CAGTTTTTGC ACCAAGCGCT TGATGCTGAG CGCGAAGCCC GAGAATGTGT GCTTGATCCC CATGCATTTT GGCAACTTGC CGCCGATTTT GGTCAACATA ATCACTATGG CTCAACCAGT ATTCAATTTC AGGTGGCCGC CAGTAGCCAA ACAGCGCTTG AGCAAGGCGA TTATTTGGTG GTGTTGAATT ACACTTTGCC GGGCTTTGGG CGCTTCTTGA CGCGCTATTT GAGCTTATTG CCTGCGCCAA GCTGGCAAAC CTATGTGCAA TCGGCATGGC AACCATTGCA TAACGACGCT CAAGCCGTGC CAGCCGAAAT TGTTTCGGTG CTCGAACATA ATGCCCAAGT TCATGTGCCA TTTACGGCAA ATGTGATTGT GCCGCCCGAC GAGCCAAGCT ATCGTTCGCA AGCGATTCAG CATGGCGTTG GCGATTTGAG CCTTGCCCAC GATCCGGTTG CTGATCAATT ACGAGTTTAT CACCGCCACG CTGATGGCAG CCAACAAGAG CTTTTGCCCT TGTATATGGG CTTTTTGCAT ATGGTTTCGC TACCAATTTT GCAACGGGTG CTGGCCCAAC TCAGCCCAAG CAGCTATCAC ATGGAGCAAT TGCGACCGCG TGAGCAAACC TTGGTTTCGC GCAAGCAACC AACCAATGCC ATCAGCCATA CGCCACGCTT ACGCATTGGC CGCTTGGTCT TGCAACGTGA AAGCTGGAAT GTGCCAACTG CCGCGATTCC TAGCAGCGTC GCCAACGACG AATTTATGAG CTTTTTCAAT TGGTATAGCT GGGCGGAACA AGCAGGCCTG CCGAGCGAAG TGTTTGTACG CATCCAACGG CCACTTAGCA GCAAACATCT CAATATGTGG ACCAACCACA AACCACTGGC GCTCGATTTC GAAAATTATT TCAGCATTCA GATGTTGCGC CATGTGATTG CTGATGATCA GGTTAGCTTG ACGCTTGAAG AAATGCTGCC CAGCCCGCAG CAGCAGTGGT TTGATCTTGA TCAACAACCC TATGTGATTG AATTTCAGGT TGAGTTTAAT CGGGGAGGCA GCATATGA
|
Protein sequence | MTDVYQSSIV IAADQPVQPT WQPASPVLVR LAGTSYNHLH NLKFERTIGL FQVILRQQRW LDNHTQAIVD VLYTLIGTTQ GSLKSSLVAL KRDIFNRRLP LRSDVASLRN LEHPSLRGQV LRFRRTLQQL QRAWREARLV FEQELQAKRR LLQQSYAQPA FQKAVQVASS TLASSLPRYI NADPSQLRSR ERKTEVGAYQ YLMRMAGKTS PFSHFGPLVL GSADPTLAQP IVIEGQMPQA RAVSQLRRSV VGAIRNALVR IPAIKAYLPL RLNPSSFVDG ESLVFSRLRQ QLDDPNQYFA SWQRRGKRSL LLDSVLAVFA NQQQLSYAEL LELLQTQHPE LSSAKLAQTI NQLLTSGLLF YNLDLPSDYR DPVLVLIEQL ADVPSAEAGQ IRELLQRLAE SAQAYAQATV AQRWQLEQSM RQIVDQILAF APELKQTSSQ LGPLIVEDVV WDDVAVHLGQ PLLASLASDL APVLECGFQR DGRGPAQSWL LDIFTHAFGE GGQTDTILGF APEYNRLSAN SGELPMPRIN AIQQRQRRYG QFLHQALDAE REARECVLDP HAFWQLAADF GQHNHYGSTS IQFQVAASSQ TALEQGDYLV VLNYTLPGFG RFLTRYLSLL PAPSWQTYVQ SAWQPLHNDA QAVPAEIVSV LEHNAQVHVP FTANVIVPPD EPSYRSQAIQ HGVGDLSLAH DPVADQLRVY HRHADGSQQE LLPLYMGFLH MVSLPILQRV LAQLSPSSYH MEQLRPREQT LVSRKQPTNA ISHTPRLRIG RLVLQRESWN VPTAAIPSSV ANDEFMSFFN WYSWAEQAGL PSEVFVRIQR PLSSKHLNMW TNHKPLALDF ENYFSIQMLR HVIADDQVSL TLEEMLPSPQ QQWFDLDQQP YVIEFQVEFN RGGSI
|
| |