Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0833 |
Symbol | |
ID | 4711402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 914449 |
End bp | 915447 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855292 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001002411 |
Protein GI | 121997624 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGGGT ACCTGAAGGA CTTTCTCAAG CCGCGCTCGG TCGAGATCGA GCCGCTCTCG GACAACCGCG CCAAGGTGGT CCTGGAGCCG CTGGAGCGGG GCTTTGGCCA CACGCTGGGC AACGCCCTGC GGCGGCTGCT GCTCTCGTCC ATGCCCGGGA CTGCGGTCAC CGAGGTGGAG ATCGAGGGCG TACAGCACGA GTACACGGCG GTGGAGGGCA TCCACGAGGA CACTGTGGAC ATCCTCCTCA ACCTCAAGGA CCTCGCCGTG CGCCTCAACG AGCGTGACTC GGTGACGCTC AGCGTCGAGA AGCAGGGGCC GGGTTCGGTG ACGGCGGCGG ACATCGCCAC CGATCACGAC GTGGAGATCC AGAATCCGGA TCTGCACATC GCCACGATCA CCCACGAGCA GCCGTTCAAG GCGTCGCTGA AGATCGAGCG CGGCCGCGGC TACCTGCCGG TCACCGCGCG CGAGGAGGAA GACACCCGCA CCATCGGTCA CCTGGCCCTG GATGCGAGCT TCAGCCCGGT CCGCCGCGTC TCCTACGCGG TCGAGAGCGC CCGTGTCGAG CAGCGCACTG ACCTGGACAA GCTCGTCCTG GACGTCGAGA CCAACGGCGT GGTGACGCCC GAGGAGGCGG TGAAATTCGC GGCCAGCCTG CTGCGTGACC AGCTGTCGGT GTTCGTCGAC CTCGAGGGTG GGCTGCTGGA GGGCGGCGAC GAGCAGGAAG AGCCGGAGAT CGACCCGGTG CTGCTGCGTC CCATCGACGA CCTCGAGCTC ACGGTCCGTT CGGCCAACTG CCTCAAGGCT GAGAGCATCC ACTTCGTGGG TGATCTGGTT CAGCGCACTG AGGTCGAGCT GCTCAAGACG CCGAACCTCG GCAAGAAGTC GCTCAACGAG ATCAAGGACA CCCTGGCCGA GCACGGGCTG TCGCTGGGCA TGCAACTCGA TAACTGGCCG CCGCCGTCCC TCGGGGACCG CGCCCGTATC GCCGGCTAA
|
Protein sequence | MKGYLKDFLK PRSVEIEPLS DNRAKVVLEP LERGFGHTLG NALRRLLLSS MPGTAVTEVE IEGVQHEYTA VEGIHEDTVD ILLNLKDLAV RLNERDSVTL SVEKQGPGSV TAADIATDHD VEIQNPDLHI ATITHEQPFK ASLKIERGRG YLPVTAREEE DTRTIGHLAL DASFSPVRRV SYAVESARVE QRTDLDKLVL DVETNGVVTP EEAVKFAASL LRDQLSVFVD LEGGLLEGGD EQEEPEIDPV LLRPIDDLEL TVRSANCLKA ESIHFVGDLV QRTEVELLKT PNLGKKSLNE IKDTLAEHGL SLGMQLDNWP PPSLGDRARI AG
|
| |