Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2548 |
Symbol | |
ID | 5734426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3274623 |
End bp | 3275783 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279688 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001545314 |
Protein GI | 159899067 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000188671 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGACA AGCTACTGGA GCTTGCGGCA AAAAATCAAA ACCGCATCAA ACGCGAAGAC ATTCTCAGCG TGTTGCCAGA GCCAGAAACG AATATCGAAG CCCTCGAAAG CCTCTACGAA CAAATCGAAG AAGCGGGGAT CGAGGTTGTT GATGCAGATG AAGCGCCCAA CGTTGCCGAC CTCGATCTCG ATTTAGATCT TGATAGTGAT CACGACCTCG ACGGAGCCTT GCCTGATCTT TCGGATATTG CGCTCGACGA CCCAGTGCGC ATGTACTTGC AAGAAATTGG TCAAGTACCC TTGCTGACTG CGGCGGAAGA AGTCGATCTG GCCAAGAAGA TGGAAGTGGC TCACGCCGCC AACGATACCT TGCTGCAAGA TGATGGCATG CTTGAACATA GTGATCGAAT TTCGTTAAAA CGCCATATCG ATACTGGCCG AGCAGCCCGC CAACATTTGA TTCAGGCTAA CTTGCGCTTG GTGGTTTCGA TTGCCAAAAA ATATACCTCG TATGGCTTGA CCATGATGGA TCTGGTGCAA GAGGGCAATA TTGGCTTGAT GCGGGCAGTC GAGAAATTCG ATTACACCAA AGGCCATAAG TTCTCGACCT ACGCGACGTG GTGGATTCGC CAAGCAATTA CGCGAGCGAT CGCCGACCAA AGCCGCACCA TCCGCTTGCC TGTGCACATG GGCGAGGCAA TTAGCCAAGT TAAGCGAACG TCGCATAAGT TGCAACAAAC CATGCAGCGC GAACCAACGC CCGAAGAAAT AGCCGAAGCC ATGGGCATTA CCGCCTCGAA AGTGCGTCGC ACCTTGGAAG CCTCGATGCA CCCACTTTCG TTGGAAATGC CGGTTGGTCA AGAAGGCGAA GGCCGCATGG GCGATTTCAT CGAGGATGAT CGAGTTTCAA CCCCAGTTGA TGCTGCGGCC ATGACCATGT TGCGCGAACA AATTGAAGAA GTTTTGCAAA AATTGCCTGA GCGCGAACGC AAAATAATTC AATTGCGCTA TGGCCTTAAA GATGGCCGCT ACCGTACCCT CGAAGAAGTT GGCCTTGAAT TTGGCATTAC TCGCGAACGG ATTCGCCAAA TCGAAGCGGT GGCCTTGCGT AAACTGCGTC ACCCCCACCT TGGCAAGAAA TTACGTGGCT ATCTTGATTA A
|
Protein sequence | MIDKLLELAA KNQNRIKRED ILSVLPEPET NIEALESLYE QIEEAGIEVV DADEAPNVAD LDLDLDLDSD HDLDGALPDL SDIALDDPVR MYLQEIGQVP LLTAAEEVDL AKKMEVAHAA NDTLLQDDGM LEHSDRISLK RHIDTGRAAR QHLIQANLRL VVSIAKKYTS YGLTMMDLVQ EGNIGLMRAV EKFDYTKGHK FSTYATWWIR QAITRAIADQ SRTIRLPVHM GEAISQVKRT SHKLQQTMQR EPTPEEIAEA MGITASKVRR TLEASMHPLS LEMPVGQEGE GRMGDFIEDD RVSTPVDAAA MTMLREQIEE VLQKLPERER KIIQLRYGLK DGRYRTLEEV GLEFGITRER IRQIEAVALR KLRHPHLGKK LRGYLD
|
| |