Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0955 |
Symbol | |
ID | 5732841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1094708 |
End bp | 1095559 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278087 |
Product | ECF subfamily RNA polymerase sigma-24 factor |
Protein accession | YP_001543731 |
Protein GI | 159897484 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02957] RNA polymerase sigma-70 factor, TIGR02957 family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0246581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCATG TTGAGCTATT CGAAAGCTAT CGCGGCTACG TGTTTGGCAT TGCCTATCGC ATGCTTGGCA GCGTGATGGA TGCCGAAGAT TGTTTGCAAG AGACCTTTTT GCGCTGGCAA AGCCATGCCG AGCAAGTTGA AAACCCACGG GCTTGGCTGG CAACCGTGGC AACGCGCTGG TGCCTTGATC AGCTTAACAC GGCACGCCAT CAACGTGAGC AATACATCGG CGAATGGTTG CCAGAACCAT TAATTATCGC CGCGCCTAGC AGTAACGTAG AACAACTCGA ATCGTTATCA ACCGCATTTT TGGTGCTGCT TGAACGGCTT TCGCCAGCTG AACGCGCAGT TTTCTTGCTA CACAAAGTTT TTGGCTACGA ATATAGCGAA ATTGCCGAGA TTGTGGGCAA AACTCCAGCG GCCTGTCGCC AACTTGGCCA TCGGGCCGCC GACCATGTGG CCCAAGCTCG TCCACGCTTC AAGGTTGAGC CAAGCGTGCA GCAACGGCTG AGCGAACAAT TTTTACAAAG TTGTGCAACT GGCGATTTAA ATGGCTTGAT GAGTTTGCTT ACCGAGGATG TTGTATTGCG CAGCGATGGC GGCGGCGTAG TCCAAGCAGC GCGGAACCCA ATTTATGGGC CGACGGCAGT TGGGCGTTTC TTGCTCGGCG TATTGCCCAA ATTGCCTGCC AATAGCATAT TTGAGCCACG CCTGATCAAT GGTCAAGCTG GGTTTGTAGC ACTGGTCGAT GGGCAGGCCA GCGGCACGTT AATCTTAGAT ATGCTTGATC AACAGATTGC TGGGATTTAT ATTATGCTCA ACCCGCAAAA ATTACAGCAT CTTAAGCCTT GA
|
Protein sequence | MNHVELFESY RGYVFGIAYR MLGSVMDAED CLQETFLRWQ SHAEQVENPR AWLATVATRW CLDQLNTARH QREQYIGEWL PEPLIIAAPS SNVEQLESLS TAFLVLLERL SPAERAVFLL HKVFGYEYSE IAEIVGKTPA ACRQLGHRAA DHVAQARPRF KVEPSVQQRL SEQFLQSCAT GDLNGLMSLL TEDVVLRSDG GGVVQAARNP IYGPTAVGRF LLGVLPKLPA NSIFEPRLIN GQAGFVALVD GQASGTLILD MLDQQIAGIY IMLNPQKLQH LKP
|
| |