Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0344 |
Symbol | |
ID | 5732254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 411441 |
End bp | 413591 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277468 |
Product | peptidase domain-containing protein |
Protein accession | YP_001543124 |
Protein GI | 159896877 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.598712 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCTAC GGATTGGGTT TATCGGCCTC ATGATCGCTT TACTTGGGTT AACTGGTAGT TTACGGGCTT TCAAGCCAGT TCAAGCTGCC CCAAGTAATC CAACTACCGC TCAAATTGCT CAAGCGAATG TCCCGCAGGG GGCAGTCCGC GTTAAAGAAC AGGAAAGCAA CAATACGACT GCCACCGCCA ACGCGATCAT GTCCAGCAGT ACCCTAATTC GCGGGAATAT CTGGAATGGT GATGTTGACT TCTATAGCAT TGAATTAGCC GCTGGCGATC GCCTGACCGC TGCAACCATG AGCGCGGCTT CAGCCTCTGG CAACAACGAT ACGGTATTAA CCTTTTTTGC GCCTGATGGT ACAACCGTGA TTGAAACCGA TGATAACGAT GGCTCACTTG GCGGCAATGC CTCGGTGATT AGTTCGGCTC CGGTGACCCA AACCGCTACC TACTATCTGC GGTTAAGTAG TACTAGTACC AACCAAGTTC GCTACTACGA CTTCTATGTG CGGGTGTTGA GCGAAGCCGC CACTGCTGAA ACAGAGCCAA ATGATCTTTT GACAACCGCC CAAGTGTTGC CTACCAGCGG CACAATTTCG GGGGTCTTAT CGGCTACAAC CGACCTCGAT TACTACCAGC TTAACTTGAA TGCTGGTGAT ACGATCTTTA CCAGCCTCGA CCTCAACCCT GAGCGCGATG GCATTGTTTG GAACGGACGA CTTGGGATCG GCGCGTTTAA TGGCTTTACT TTAGTTGTTG ATGATACCAG TGTAGTCTCA CCAAATGCTG AAGCCCATTT TATGACCGTC AAGACGAGTG GTACCTACTA TGTGCTGGCC TCAACGACGG CTGCAAGTAT TCCGGCTGAA GCAACCTATT TGTTGGGTGT CACAGTGTTC CCTGCCGAGC AGCAAGCCAA TTGTACAACC TACACCAGCG CTGATGTTGC CAAGACCATC CCAACGACTA CCAGCTTTAT TACCTCAACC TTGACTGTTC CCGACAACTT TATCATCGCT GATCTTGATG TCAGTATCGA ACTGACTCAT ACCAATATGC CCGACCTCGA TGTTCAACTC CAAGGGCCAG ATGGCAATGT TGGAGGGCTA TTTACCGATA TTAGCAATAA TACCCAGACC ACGATGAATA TCGATCTTGA TGATGAAGCC GCAATTCCAA TTGGACTTTT CGCGATTGTC TCGGGCATCC GTTATATTCC AGAGTTGAAT TATCGCCTCG ATTGGTTCGA TGGTGGGCGA TCGGCTGGCA CCTGGACTCT CTTGATTCAC GATGATACTG CTGCCAACGG TGGAACATTG CAAAATTGGA GCATCACGCT GTGTGCCGCG CCACCAGCAA GCTGCCCCGA TGGTATGAGC ATGAGCACAA TCACGAGCAC TGATTTTGAA GCCAATGATG GTGGCTTTAC CCATAGCGGA ACTGCTGATT CATGGGAATA TGGCGCACCA AGCGCTGCTC CATTGATCGG TAGCTATAGT GGGGCTAATA GCTGGAAGAC CAACCTTGAC GGAACTTATT CGGTCAGTAG TATTGCCAAT CTCTTCTCGC CATCAATTAG TATTCCCAAT GTGACTGGCC CTGTCTACAT CCAATGGCAA CAACGCTACC AGATGGAAAG TGCTAATTTT GATCATTACA CGGCCTCGGT TGGTACAGGC GTTGTGACCA AAACCTTGTA TGAATTTGAT GCAGCAACCA TGACCAATGC AGTTGGTAAC CCAAGCGTAA CCTTCCAAGA AAGCACAGTT TGGAGTCGCC AGCTCTACGA TATTAGCGAG TTCAAGGGTC AATCAATCCA GGCCTTGTAT CACCTAGATA CTGATAGTTC GGTTCAATTG GGTGGTATCG CGATCGACGA TTTCCAAATT ATGGCCTGTG ACGGTGTGGC AGCAACCGCG ACCCCAACCG AAGCACCAAC CGCTACCAGC ACACCAACCA ACACGCCAAC TAACACACCA ACCAACACGC CAACTAACAC GCCAACCAAT ACACCGATTA TCACGATTAC CCCAAGCAAT ACGCCGACGG CTACGGTTAC CCCAAGTAAC ACGCCAACCG TTACCCCAAG CGTGACCGCT GGGCCAACCA TGATTCCGGT TTATATGCCA TTAGTCGGTA AAGACCAATA G
|
Protein sequence | MRLRIGFIGL MIALLGLTGS LRAFKPVQAA PSNPTTAQIA QANVPQGAVR VKEQESNNTT ATANAIMSSS TLIRGNIWNG DVDFYSIELA AGDRLTAATM SAASASGNND TVLTFFAPDG TTVIETDDND GSLGGNASVI SSAPVTQTAT YYLRLSSTST NQVRYYDFYV RVLSEAATAE TEPNDLLTTA QVLPTSGTIS GVLSATTDLD YYQLNLNAGD TIFTSLDLNP ERDGIVWNGR LGIGAFNGFT LVVDDTSVVS PNAEAHFMTV KTSGTYYVLA STTAASIPAE ATYLLGVTVF PAEQQANCTT YTSADVAKTI PTTTSFITST LTVPDNFIIA DLDVSIELTH TNMPDLDVQL QGPDGNVGGL FTDISNNTQT TMNIDLDDEA AIPIGLFAIV SGIRYIPELN YRLDWFDGGR SAGTWTLLIH DDTAANGGTL QNWSITLCAA PPASCPDGMS MSTITSTDFE ANDGGFTHSG TADSWEYGAP SAAPLIGSYS GANSWKTNLD GTYSVSSIAN LFSPSISIPN VTGPVYIQWQ QRYQMESANF DHYTASVGTG VVTKTLYEFD AATMTNAVGN PSVTFQESTV WSRQLYDISE FKGQSIQALY HLDTDSSVQL GGIAIDDFQI MACDGVAATA TPTEAPTATS TPTNTPTNTP TNTPTNTPTN TPIITITPSN TPTATVTPSN TPTVTPSVTA GPTMIPVYMP LVGKDQ
|
| |