Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4940 |
Symbol | |
ID | 5736776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6268003 |
End bp | 6268998 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641282107 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001547698 |
Protein GI | 159901451 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.193797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTAGACA TTGCAATGCC CAAACTCGAA TGTGTCGCAG CGGCGGAGAA CTATGGGCGG TTCAAAATTG AGCCTCTGGA TCCCGGGTAC GGGCACACAT TAGGGAACGC GCTGCGACGG GTGTTGTTGT CGTCCATCCC TGGGGCAGCG GTCATCAAGA TTAAAATTGA TGGGATTTTC CACGAGTTTT CGCCAATTCA AGGCGTGCGT GAAGATGCTA CCGAGATTGT GTTAAATGTG AAAGGCATTC GTTTGCGCTC CTACGCCGAA CGCCCAGTCA AGATGATTTT ATCCAAGACT GGCCCAGGCG TGGTGCGGGC AAGCGATATT GAATGCCCTT CAAATGTGGA GATCGTTAAC CCCGATCACT ACATCGCAAC CCTCGATAGC TCCGATGCTC GCTTGGACAT GGAGTTGACC GTGGAACGTC ACCGCGGCTA TCTGCCAGCT GAAAATCGTG ACCCGGTGCC GATTGGCGAA ATCCCGGTGG ATGCGATCTT CACGCCGGTG CACAAGGTTA ACTACGTTGT GGAACACACC CGCATTGGCG GGATGACCGA CTTCGACCGT TTGTTGCTTG AAATTTGGAC TGATGGCACG GTTAAGCCAG GCGATGCTCT CAGCTACGCC GCGCAAGTGC TGGTTCAATA TTCATCGATC ATCGCCGATT TCAATCGCTT TGACGATGAA CAAGAGCAAG TTGGCGATGC CAATGGTCTG GTTATTCCCA GCGAGATCTA CGATATGCCT ATCGAAGATC TTGACCTTTC AACCCGCACC TACAACTGTC TCAAGCGTGC CGACATCACC AAGGTTGGGC AAGTGCTGGA GATGGACGAG AAGCAACTGC TGGCTGTGCG GAACTTGGGG CAAAAATCCA TGGACGAAAT TCGCGAGAAA TTAATCGAGC GCAATTTACT GCCAACCTTG CCTTTCAACT CCGCTATTCT GAACACCAAT GTCGCTGCCC GTCTGAACGA CGGTAGCGCT GAATAG
|
Protein sequence | MLDIAMPKLE CVAAAENYGR FKIEPLDPGY GHTLGNALRR VLLSSIPGAA VIKIKIDGIF HEFSPIQGVR EDATEIVLNV KGIRLRSYAE RPVKMILSKT GPGVVRASDI ECPSNVEIVN PDHYIATLDS SDARLDMELT VERHRGYLPA ENRDPVPIGE IPVDAIFTPV HKVNYVVEHT RIGGMTDFDR LLLEIWTDGT VKPGDALSYA AQVLVQYSSI IADFNRFDDE QEQVGDANGL VIPSEIYDMP IEDLDLSTRT YNCLKRADIT KVGQVLEMDE KQLLAVRNLG QKSMDEIREK LIERNLLPTL PFNSAILNTN VAARLNDGSA E
|
| |