Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1427 |
Symbol | |
ID | 4710325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1540818 |
End bp | 1541828 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639855894 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001002996 |
Protein GI | 121998209 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02394] RNA polymerase sigma factor RpoS [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.818946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCTCG ATACGGCGGT TGTCAGCGTG CACGCGTACG CTGATGTGGA CGCAGAGCGC GATCCCCCGA TGATCGATGA CCCCGGCGAG CTGATCCAGC CCTACGGCAT CGATGAGGGC AGCGAGCCCG AGCCGGCTCG CCGGCCCCGT AGGAGCCGAT CCAGCACGGA GTGTCGCACC GCGCTGGACG CCACGCAGCT CTACCTGAAT GAGATCGGCC ACGCCTCCCT GCTCACGGCC GAGGAGGAGG TCGCGCTGGC GAGACGCGTT CAGCAGGGCG ATGCGGCAGC CCGGGCGCGC ATGATCGAGA GCAACCTCCG GCTGGTGGTC AAGATCGCCC GGCGCTACAT GAACCGCGGC CTCGCCTTCC TGGACCTCAT CGAAGAGGGC AACCTGGGGT TGATCCGCGC CGTGGAGAAG TTTGATCCCG AGCGCGGGTT CCGTTTCTCG ACCTATGCGA CCTGGTGGAT CCGGCAGACC ATCGAGCGCG GCATCATGAA TCAGACACGC ACCATCCGGC TGCCGATCCA CGTCATCAAA GAGATCAACC AGTACCTGCG GACCCAGCGC CGCCTGACCC AGACACTGGA TCACGAACCC ACGGTCGATG AGATCGCCGA TGCTATGGGG CGTTCGCCGG AAGACGTCCG ACGCATGCGC GGTCTCAACG AGGGGACCAC CTCGGTGGAT GTGCCGATCG GCAAGGATTC CGACCGGGTG CTACTCGACG CCATTCCCGA CGAGACGCAG GGCATGCCGG AGAATGTCCT CGAGGACGAC GACGTCGTGC GGCATCTGCA AGACTGGCTC GGCTTCCTCA CTGACAAGCA GCGGGCCGTG CTGGAGCGGC GTTTCGGGCT CAACGGCCAC GAGCGCTACA CGCTCGAGCA GGTGGGCACG CAGGTGGGGG TGACCCGCGA GCGGGTACGA CAGATCCAGA TCGATGCACT GCGGCGGCTG CGCGAGCTCA TGGAGCGCGA TGGCTACTCC CAGGAGGCGG TCTTCGGCTA G
|
Protein sequence | MTLDTAVVSV HAYADVDAER DPPMIDDPGE LIQPYGIDEG SEPEPARRPR RSRSSTECRT ALDATQLYLN EIGHASLLTA EEEVALARRV QQGDAAARAR MIESNLRLVV KIARRYMNRG LAFLDLIEEG NLGLIRAVEK FDPERGFRFS TYATWWIRQT IERGIMNQTR TIRLPIHVIK EINQYLRTQR RLTQTLDHEP TVDEIADAMG RSPEDVRRMR GLNEGTTSVD VPIGKDSDRV LLDAIPDETQ GMPENVLEDD DVVRHLQDWL GFLTDKQRAV LERRFGLNGH ERYTLEQVGT QVGVTRERVR QIQIDALRRL RELMERDGYS QEAVFG
|
| |