Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0015 |
Symbol | |
ID | 5736849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 18166 |
End bp | 19368 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277136 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001542795 |
Protein GI | 159896548 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0128011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTTG ACGATGAAAT CCTCGACTCT GAGCTTGCGG GTGATGAAAC TGACGATTTT CCCGCTTTAC GCACACTCAG CGATCTTTTA GCCACTGGCA AAAAACGGGG CTATGTTACG CCTGGCGAAA TTCAGCAAAT TATCGATTCG CAGCGTGACG AGGATAAACA ACTCGAAGAA ATTCGCGATA CCTTGCAAAA GGCCAATATT CGAGTCGAGG AAACTGGCGA CGACGACGAT GAAATTACGG TCAGCGAAAA CCCTGAAGCC GATTTTTCCA ATGAGGGCAT CAGTGTCAAC GACACCGTGC GCTTATATTT GCGCGAAATC GGCTTAGTGC CTTTGCTCAA AGGCGAGCAA GAGGTTGAGC TAGCCCGTGC TATGGAAGAT GGCGATAAAG CCCAACAAGA GTTGATCGAG AACGATCATA CGCTTTCGGA TGTTGAACGC CTAGCCTTGC GCCGCCGCAT CGACCGTGGC GAGCAAGCTC GTGAACACTT GACCACCGCC AACTTACGTT TGGTGGTCAG TGTGGCCAAA AAATACATTG GCCGTGGGCT TTCGTTACTC GATTTGATTC AAGAAGGCAA TGTTGGCTTG ATTCGCGCGG TCGAGAAATT CAACTACACC AAAGGCTTTA AGTTCTCTAC CTACGCCACT TGGTGGATTC GCCAAGCAAT CACGCGGGCA ATCGCCGACC AAGCCCGTAC CATTCGGATT CCGGTGCACA TGGTCGAAAC GATCAACCGT ATGATGCGCA CAGCACGGCG TTTAACCCAA GAGAATGGCC GCGAACCTAG CGATGAAGAA CTGGCCCAGG CACTCGATCT GACGGTCGAG AAAGTGCGAT CAATTCGCAA AACCTCGATG GAGCCAGTGT CGCTCGAAAC CCCAGTTGGT CAAGAAGAAG ATAGCCAACT TGGCGACTTC TTGCCCGATG AAAAACTTGA AGCGCCTTCC GATGCAGCTT CGCATCAGAT GTTGCGCGAA CAAGTTGCTC AGGTACTCGA TCAACTGACC GAGCGTGAAA AGCGCGTGTT GAAGCTACGC TTTGGGCTAG AAGATGGAAC CCAACGCACA CTCGAAGAAG TTGGCAAAGA ATTTGGAGTA ACTCGCGAAC GAATTCGCCA GATCGAGGTG AAGGCGCTGC GCAAACTGCG TCACCCGCGC TTTGGCAAAA AACTCCGCGA TTATCTGGAA TAA
|
Protein sequence | MALDDEILDS ELAGDETDDF PALRTLSDLL ATGKKRGYVT PGEIQQIIDS QRDEDKQLEE IRDTLQKANI RVEETGDDDD EITVSENPEA DFSNEGISVN DTVRLYLREI GLVPLLKGEQ EVELARAMED GDKAQQELIE NDHTLSDVER LALRRRIDRG EQAREHLTTA NLRLVVSVAK KYIGRGLSLL DLIQEGNVGL IRAVEKFNYT KGFKFSTYAT WWIRQAITRA IADQARTIRI PVHMVETINR MMRTARRLTQ ENGREPSDEE LAQALDLTVE KVRSIRKTSM EPVSLETPVG QEEDSQLGDF LPDEKLEAPS DAASHQMLRE QVAQVLDQLT EREKRVLKLR FGLEDGTQRT LEEVGKEFGV TRERIRQIEV KALRKLRHPR FGKKLRDYLE
|
| |