Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0367 |
Symbol | |
ID | 5732218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 439342 |
End bp | 440517 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277490 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001543146 |
Protein GI | 159896899 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000117749 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTACCA GACGTAACGT AGTAGATTGC ATAGAGAATC CTGCGGGTTC AATAGATAGC TTTCGTGCGC CGCCAGCAGC GAAAGCCATA GTGGGCGTAG CCCAGCGAGC GTGGACGATG ACAAAGTATG AAACGATGAA TGTAGCAAAA AACCAACCGG AACTCGAAGA TGCCCCCGAG GGCTGGGAGC TTGAGCCGGG TTTTAATGAT GAAGACGTAG AATTAGCAAT TGTAAACCTA CGCCGCAAAA AAACGCTCTA TTCCGAGGCT GTCGAAGATT CAGTCCAGAT GTATTTGCGT GAAATCGGCG AGGTTGAGTT GCTCACCGCC GAGGAAGAAA TTACGCTGGC CAAGCAAATT GAAACTGGCC GCAAAGCGGA ACATCAGTTG CAAACCGAGC AGTATGCAAC CTGGGATGAA CGTACCGCCC TCGAACGCCG CGTCGATTTT GGTAATGAAG GCCGTCGCAA GCTGACCCAA GCCAATTTGC GCTTGGTCGT GAGCGTCGCC AAAAAATATA TTGGCAATCA CATGTCGTTC ATGGATTTGA TCCAAGAAGG CAATATTGGT TTGATGCGGG CAGTTGAGAA GTTCGATTAT CGCAAGGGTA ATCGCTTTTC AACCTATGCT ACGTGGTGGA TTCGCCAAGC AGTTACCCGT GCGATCGCTG AGCAAAGCCG CACCATCCGC TTGCCAGTCC ACTTGAGCGA ATCAATCGCC CAAATGCGCC GCGTCGCTTA CCAGCTTGAG CAAGCTTTGC AACGTGAAGC AACGCCCGAT GAATTGGCTG ATGCCTTGGG TGTAAGTTTG CGCAAGGTCA AGCGTATGCT GAATGCTTCG GTACAGCCAG TGTCGCTTGA GCAACCAATT GGCAAAGAAG GCCAAGGCCG CGTTGGCGAA TTTTTGGCCG ACGATACCGA TGAAGCACCG CTTGAGCAGG CCACCCGCAT GATGTTGCAA GATGAATTGG CCGATGCGCT CTCGCAATTG CCCGACCGCG AACGCCAAAT CTTGTTGTTG CGCTATGGTT TGGCCGATGG CAAACGCCGC ACGCTCGAAG AAGTTGGCGC TGAATTCGGG ATTACTCGCG AACGCACCCG CCAAATCGAG GCTGAAGCGA TGCGTCACTT GCGCCAACCC AATGTTGGTC AACACTTGCG AGCTTACCTT GATTAA
|
Protein sequence | MFTRRNVVDC IENPAGSIDS FRAPPAAKAI VGVAQRAWTM TKYETMNVAK NQPELEDAPE GWELEPGFND EDVELAIVNL RRKKTLYSEA VEDSVQMYLR EIGEVELLTA EEEITLAKQI ETGRKAEHQL QTEQYATWDE RTALERRVDF GNEGRRKLTQ ANLRLVVSVA KKYIGNHMSF MDLIQEGNIG LMRAVEKFDY RKGNRFSTYA TWWIRQAVTR AIAEQSRTIR LPVHLSESIA QMRRVAYQLE QALQREATPD ELADALGVSL RKVKRMLNAS VQPVSLEQPI GKEGQGRVGE FLADDTDEAP LEQATRMMLQ DELADALSQL PDRERQILLL RYGLADGKRR TLEEVGAEFG ITRERTRQIE AEAMRHLRQP NVGQHLRAYL D
|
| |