Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3533 |
Symbol | |
ID | 5735392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4447760 |
End bp | 4449139 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641280680 |
Product | putative transcriptional regulator |
Protein accession | YP_001546297 |
Protein GI | 159900050 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.412037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGAAC AGTTACTTGA AAACGGCGAA AGCGAAATGC TGGCCTGCTT CCGTGAACGG TTTCGGCCTG AAGATTTGGC TGAAACTTTA GTGGCTTTTG CCAATGGTGG CGGCGGCAGC GTAGTCATTG GCATCAGTGG GCGAGTCAGG CCCAAAGTCG AGGGTGTTCA AGATATTCAA GCAGCAGAAG AGGCAGCGTT AGAGGCAGCG CTAGCTTGCA CGCCGCCGCT GGTCTTGCCC TTGCCCCAAC AAATTACCAG CAATCAACAA ACCGTCTTGT TGATCGAGGT TCCGCCTGGT TTGCCGAATG TCTACAGTTT ACATGGCAAA TATTTGCGCC GCGAAGGCCC GAGCAATGCG CCAATTCCGC CCAATGCGTT GCGCCAATTG CTGATCGAGC GTGGCGAGGT TGGCTGGGAG CGCGTGCGGC CCGAAAACGT CACAATGGGC GATTTGAGCG AGATGAAAAT TCAATCGTAT GTGGCCCGCA TTGGTCCGCC CGCCGATGTT GACCCCATGG GTTGGCTCTA TCGGCGCGGC TGCCTCGTTC GCGATGCCCA AACCAACTAT CAACCAACCA ATGCTGGCAT GTTGCTGTTC AGCATCACCA CCGAGCGCGA TTATCCCCAA TGCGAGCTGA CTTTGGTGCG CTACACGGGC AACGAGATGA GCGATGAATT CGAGCGGGTC GATATTCGCA CAACCTTGCC CGACCAAGTA CGCCGCGCCG AGCTTTGGTT GGGCGAACAT ATGCGGCGCG GCTCACGCAT GGTTGGCCTC GAACGCGAAG ATTGGACGGA ATATCCGCCA GCCGCGGTGC GCGAAGCCTT GGTTAATGCT GTGGCCCACC GCGATTATAC CGTGCGTGGC GAGGGTATTC GCATTGCCAT GTTTGTTGAT CGACTAGAAG TCTATAGTCC AGGTCGCTTG CCAGGCCACG TCACGATCGA TAATATCGCT GCTGAGCGGT TTAGCCGCAA CGAAACATTG GTGCAAGTGT TGGCCGATTT GGGCTTGATC GAACGGCTTG GCTATGGCAT CGACCGCATG CAGCGCCAAA TGGCCGACGC AGGCTTGCCA CCACCCGAAT TTCGCGAAAC CGCTGCGGGA TTTTTGGTAA CCCTGCGCAA CAAAGGCATG AACGTCGCCA GCAATTCCAC GATCGACACC GCCAAATGGG AAGCCTTAGG CTTGAACGAA CGCCAAATCG GAGCTTTGGT CTATTTAACT GAAAATCGCC GCATTACCAA CCGCGATCTG CAAGACCTCA GCCCCGATGT TTCGCCCGAA ACAATTCGGC GCGATCTCTC GGATTTGGTC GATCGGGGCT TGATTTTGAA GATCGGCCAA AAACGTGCAA CCTATTATAT TTTGCGCTAG
|
Protein sequence | MLEQLLENGE SEMLACFRER FRPEDLAETL VAFANGGGGS VVIGISGRVR PKVEGVQDIQ AAEEAALEAA LACTPPLVLP LPQQITSNQQ TVLLIEVPPG LPNVYSLHGK YLRREGPSNA PIPPNALRQL LIERGEVGWE RVRPENVTMG DLSEMKIQSY VARIGPPADV DPMGWLYRRG CLVRDAQTNY QPTNAGMLLF SITTERDYPQ CELTLVRYTG NEMSDEFERV DIRTTLPDQV RRAELWLGEH MRRGSRMVGL EREDWTEYPP AAVREALVNA VAHRDYTVRG EGIRIAMFVD RLEVYSPGRL PGHVTIDNIA AERFSRNETL VQVLADLGLI ERLGYGIDRM QRQMADAGLP PPEFRETAAG FLVTLRNKGM NVASNSTIDT AKWEALGLNE RQIGALVYLT ENRRITNRDL QDLSPDVSPE TIRRDLSDLV DRGLILKIGQ KRATYYILR
|
| |