Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0713 |
Symbol | |
ID | 5732628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 819199 |
End bp | 820158 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277843 |
Product | helix-turn-helix type 11 domain-containing protein |
Protein accession | YP_001543489 |
Protein GI | 159897242 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.677583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAGCC CAACAACCCG TTTGTTAACC ATCCTTGAGC TCTTACAATC GCATCAGCGA ATGCGCGGTG CTGAACTTGC CAGCCGCCTC GAAATTAGCC AACGCACCGT GCGTCGCTAT GTGGTGATGC TCCAAGATAT GGGAATTCCG GTCGAGGCTG AACGCGGCCC TGATGGTGCT TACTATCTTG GGCGCGGCTA TACCATCCCA CCCCTGATGA TTAATCCGAC TGAGGCATTG GCGCTGGTAC TGGGATTGCG CATGATTCAC GCTTTGCAAT TTCCTGTTGA TGCGACCGCA ATTGAGGGTG CGATCGCCAA ACTCGAACGG GTTATGCCAA CAATTCTGCT TGATCAGGTA CGGGCTTTTC AAGAGGCGAT TACGTTTCAT GTTGCTCCAC CGCCTGCCCT CTTGCAGCCA AGCATCGTCG CCAGCTTAAG TGTGGCGGTG CATACGCGCC AGCAGGTATG GCTGAGCTAT CAAACATACA ATGGGGATGC ATCGGTACGG GTGTTTGATC CCTATGGCAT CGTCTATTAT CAAGGGTATT GGTACAGCGT TGGCTATTGC CACCAACGCA ATGATCTGCG CACGTTTCGC ATTGATCGCA TTCAAGAGCT AAAGTTGCTT AATCAGTCAT TTGAGCGTCC CCAAGATTTC GATGCGTTAC ACCATATGCA CACCACGTTG GCCACCATGG CAGGGCCATA TGCGGTTGAG ATTGTTTTTG CAGCAACCAT GGAACAGGTG CGCCATGTGC TGCCACCCGC AGCCGGAACC TTTGAGCAAA GTGAGGTTGG GATTATTTGG CGGCGGGAAA CCTACGAATT AACCTCGATT GCCCACCGCT TGTTACAAAT CGATCTGCCA GCTACTATTC GCCAACCCTC TGAATTAAAG GCGATGATGC ATCAGCTGGC AGCGAAGGCG CTAGGAATGA CGCTTAATCA ACATCCATAG
|
Protein sequence | MYSPTTRLLT ILELLQSHQR MRGAELASRL EISQRTVRRY VVMLQDMGIP VEAERGPDGA YYLGRGYTIP PLMINPTEAL ALVLGLRMIH ALQFPVDATA IEGAIAKLER VMPTILLDQV RAFQEAITFH VAPPPALLQP SIVASLSVAV HTRQQVWLSY QTYNGDASVR VFDPYGIVYY QGYWYSVGYC HQRNDLRTFR IDRIQELKLL NQSFERPQDF DALHHMHTTL ATMAGPYAVE IVFAATMEQV RHVLPPAAGT FEQSEVGIIW RRETYELTSI AHRLLQIDLP ATIRQPSELK AMMHQLAAKA LGMTLNQHP
|
| |