Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4672 |
Symbol | |
ID | 5736519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5967897 |
End bp | 5969024 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281836 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001547431 |
Protein GI | 159901184 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGCAG CCCGTCAGCA TCGTTATCTA ACGCACGACC AAGTGCTGCT CCACCTTCCT CCCTCCGACA AAAACACAGC GCTGTTCGAC GAATTGTGTG TCCGCTGCCA AGCAGAAGAT ATTCCAATTT TGGAAGAACC GCCAACCGCT GCAGTGCATC TCCGTTCACG CGATGTCGAT GATGACGATC TCGAACGGTC TCTGATTCGG GAAGGGGTAA GCCTCGATGA TCCGGTGCGC ATGTATTTGC GCGAAATTGG CCAAGTGCAT CTCTTAACCG CCCACGAAGA AACCATGTTG GCCAAACAAT TGGAACGCGG CGAACGCGCA ATGATTCGGC TCGCCCGCCA CGAATATCTC CTCGAAGAAA AAGCCCGCCT GCAACAGCAA GTTCTCGAAA GCGAAGCAGC CCGTCAACAT CTCATCCAAG CCAATTTACG CCTCGTGGTG TCGATTGCAA AAAAATATGT TGGCCGCGGC ATGTCGTTTT TAGATTTAAT TCAAGAGGGC AATATTGGCC TGATGCGGGC AACCGAGAAG TTCGATTATC ACAAGGGCTT TAAGTTCTCA ACCTATGCAA CATGGTGGAT TCGCCAAGCA ATTACGCGCT CGATCTCTGA TCATAGTCGC ACTATTCGCC TGCCAGTTCA CGTGGGCGAG ACGATCAATC GGGTCAAGCG CACGGCCCAC AAGTTGCAAC AAGCCTTAGA GCGTGAGCCA AGTGCCGAGG AGATTGGCGA AGCCCTTGGG TTGCCCGCCG ATAAAGTGCG GCGCGTGCTC GAAGTTGCCC GTCAGCCAGT TTCGCTGGAA ACGCCCGTTG GCAACGAAGG CGATAGTGTG TTAGGCGATT TTATCGAAGA TGATCGCTCA TCCGAGCCAC TTGATCATGC GACCGAGCAT TTATTGCGCG AACAACTTGA TGAAGTGCTG TGCAAGCTCG ATGAGCGTGA ACGGCGGATT ATTGAGCTAC GCTATGGCTT GGTCGATGGC AAATATCGTA CCCTTGAAGA AGTTGGCCGC GAATTTGGCA TTACCCGCGA GCGAATTCGC CAAATTGAGG CCAAAGTACT GCGAAAACTG CGCCACCCAG CGTTTGGTGG GCGCAAATTG CGAGCCTATC TTGAATAA
|
Protein sequence | MGAARQHRYL THDQVLLHLP PSDKNTALFD ELCVRCQAED IPILEEPPTA AVHLRSRDVD DDDLERSLIR EGVSLDDPVR MYLREIGQVH LLTAHEETML AKQLERGERA MIRLARHEYL LEEKARLQQQ VLESEAARQH LIQANLRLVV SIAKKYVGRG MSFLDLIQEG NIGLMRATEK FDYHKGFKFS TYATWWIRQA ITRSISDHSR TIRLPVHVGE TINRVKRTAH KLQQALEREP SAEEIGEALG LPADKVRRVL EVARQPVSLE TPVGNEGDSV LGDFIEDDRS SEPLDHATEH LLREQLDEVL CKLDERERRI IELRYGLVDG KYRTLEEVGR EFGITRERIR QIEAKVLRKL RHPAFGGRKL RAYLE
|
| |