Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0596 |
Symbol | |
ID | 5732494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 688475 |
End bp | 689305 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277723 |
Product | UspA domain-containing protein |
Protein accession | YP_001543372 |
Protein GI | 159897125 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCTT TTAATTTGAT GATTTACCTT GATGGTTCCT CGGCAGCGCG ACGCATGGTG GCCTATCTTG CGCCATTAGC CCGTAAATCG CATGTCAAAA CGACCTTTTT GGTCGATGAA GCCCATCAAG ATGAAGCCGA AATGTATTTT TTCAACGCTG AGCAATTATT GCAAAGCGAT CAAGCGCCGA CCCGCACGAT TCGCGGAGCC ACGCCAGAAC GGGCGATTGT GCTCGAAACT CGTGCTAGCC AACCAGATTT GGTGGCGTTT GGGCCGTTGC GCAAGGAAGG TTGGCGACGA TGGCTGGGTC AATCGGCGAT TGGTTCGTTG GCTCGGCGTT TAACCTGCTC GATGTTATTG ATGCAAGGTC GCCCGAATGA GCTGCGCCGC GCCTTGGTTT GCGCTGCTGG TGGCCCGGCT ACGCTACACG ATGCCCAAAT GACGGCCTCA ATTATCGAAC CCCTTGGCGG CCAAGTAACA ATTTTGCATA TTGTGTCGCA ACTATCGCTG ACCTACAAGC CCGAGGAGCG CGACCCTGAG CGGCTCGCCG ATTTGGTGAT GGAAAAGCAG GGCGAGGTGG CGCGTAATAT TGCTGCCGCC AAAACCATTT TGACCGATCG CGGCATTACC ACCACCGTGC GGATTCGCGC TGGCATGGTG CTTGAAGAAA TTCAAGAGGA ACTCAAAACG GGCGGTTACG ATCTCTTGGT GATAGGTGCG CATCGGGCGC GAACGCCACT TGATCGGGTC TTGCTCGAAG ATGTTAGCGC TGAGATCTTG TTTAATAGTC CAATTCCTGT TTTACTTGTC CAAAATACCA GCGATTTCTA A
|
Protein sequence | MKPFNLMIYL DGSSAARRMV AYLAPLARKS HVKTTFLVDE AHQDEAEMYF FNAEQLLQSD QAPTRTIRGA TPERAIVLET RASQPDLVAF GPLRKEGWRR WLGQSAIGSL ARRLTCSMLL MQGRPNELRR ALVCAAGGPA TLHDAQMTAS IIEPLGGQVT ILHIVSQLSL TYKPEERDPE RLADLVMEKQ GEVARNIAAA KTILTDRGIT TTVRIRAGMV LEEIQEELKT GGYDLLVIGA HRARTPLDRV LLEDVSAEIL FNSPIPVLLV QNTSDF
|
| |