Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1101 |
Symbol | |
ID | 4709935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1192926 |
End bp | 1194734 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855572 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001002679 |
Protein GI | 121997892 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.248126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAGG ATCAACAGTC GCAAATCAAG CAGCTCATCG CCAAGGGCAA AGAGCAGGGC TTCCTAACCT ACGCAGAGGT TAACGACCAC CTCCCGGACG ATATCGTCGA CCCGGATCAA ATCGACGATA TCATCGGGAT GATCAACGAC ATGGGGATCA ACGTCCACGA GACGGCCCCG GACTCGGACG AACTGCTGCT CGCCGAAACC ACTGTCGCCA CAGACGAGGA CGAGGCCGAG GAGGCCGCCG CTGCCCTCGC CGCAGTGGAC GCGGAGTTCG GCCGCACCAC GGACCCGGTG CGCATGTACA TGCGCGAGAT GGGCAGTGTC GAGCTGCTCA CGCGCGAGGG CGAGATCCAG CTGGCCAAGC GCATCGAGGA CGGGCTCGAC CGCGCCCTGG CCGCGCTGTC CTCTTACCCG GAGGCAGCCC GCCAGCTGAT CATGCTCTAC GATCGCGCCC AGGAGGGCGA GGCCCGGCTA ACCGATATCG TCGCCGGATT CCGGGATCGC GAAGACGACT CCGAGCAACC GCCGGAGGCC CCTGCCGCCC CGGAGGCCGC TGCGGACGAG GACGAGGAGG CGACCAGCGC CGAGACCGGC CCCGACCCGG AGGCCGTGGC CGAGCTCTTC GAGCGTCTGC GCGCCGCCTA CAACGAGATG CAGGAGGTCC TGGCCACTGA GGGCTCGGCG TCCCCTCGCA TTGCCGAGCT GCGCGGCGAA CTGGAAGAGG TCTTCCTCAG CATCAAGTTC ACCCCCAAGG TGGTCGACGC GGTGGCCGAC CGGCTGCGCG GCACGGTGGA TACGATCCGC GCCCGTGAGC GCGAGATCAT GAACCTGTGC ACCCGCGAGG GGCGCATGGC GCGCAAGGAG TTCGTCAAGA GCTTCCAGGA CCGCGAGACC GACCCCACCT GGCTGGACGA CCTGCTGGCC GAAGGCGAGG AGCGTGCCGA GCGGCTGCAG CCCCACGCCG AGGCGATCCG CAAGGCGCAG TCAGAACTCA AGGAGATCGC CGACAAGAAC GGCCTCTCGG TGGCGGAGAT CAAGGAGATC AACCGCCGCA TGTCCATCGG CGAGGCCAAG GCCCGCCGCG CCAAGAAGGA GATGGTCGAG GCCAACCTGC GCCTGGTGAT CTCCATCGCC AAGAAGTACA CCAACCGCGG GCTGCAGTTC CTCGACCTGA TCCAGGAAGG CAACATCGGC CTGATGAAGG CGGTGGACAA GTTCGAGTAC CGGCGCGGCT ACAAGTTCTC GACCTACGCC ACCTGGTGGA TCCGGCAGGC GATCACCCGC TCCATCGCCG ACCAGGCGCG GACCATCCGC ATCCCGGTGC ACATGATCGA AACGATCAAC AAGCTGAACC GGGTGTCGCG GCAGATGCTC CAGGAGATGG GCCGCGAGGC CACCCCCGAA GAGCTGGCCG AGCGCATGGA GATGCCCGAG GACAAGGTGC GCAAGGTCCT CAAGATCGCC AAGGAGCCGA TCTCCATGGA GACGCCGATC GGCGACGACG AGGACAGCCA CTTGGGCGAC TTCATCGAGG ACACCAGCGT CACCTCGCCG GTGGACTCGG CCACCTCGGA GGGACTGCGC GAGTCGGTCC GCGAGGTGCT CTCGGGGCTG ACCCCGCGCG AGGCCAAGGT CCTGCGCATG CGCTTCGGCA TCGACATGAA CACCGACCAC ACCCTGGAGG AGGTCGGCAA GCAGTTCGAC GTCACCCGTG AGCGGATCCG GCAGATCGAG GCCAAGGCGC TGCGCAAGCT GCGCCACCCC ACCCGCTCCG ACGGGCTGCG CAGCTTCCTC GAAGAGTAA
|
Protein sequence | MSQDQQSQIK QLIAKGKEQG FLTYAEVNDH LPDDIVDPDQ IDDIIGMIND MGINVHETAP DSDELLLAET TVATDEDEAE EAAAALAAVD AEFGRTTDPV RMYMREMGSV ELLTREGEIQ LAKRIEDGLD RALAALSSYP EAARQLIMLY DRAQEGEARL TDIVAGFRDR EDDSEQPPEA PAAPEAAADE DEEATSAETG PDPEAVAELF ERLRAAYNEM QEVLATEGSA SPRIAELRGE LEEVFLSIKF TPKVVDAVAD RLRGTVDTIR AREREIMNLC TREGRMARKE FVKSFQDRET DPTWLDDLLA EGEERAERLQ PHAEAIRKAQ SELKEIADKN GLSVAEIKEI NRRMSIGEAK ARRAKKEMVE ANLRLVISIA KKYTNRGLQF LDLIQEGNIG LMKAVDKFEY RRGYKFSTYA TWWIRQAITR SIADQARTIR IPVHMIETIN KLNRVSRQML QEMGREATPE ELAERMEMPE DKVRKVLKIA KEPISMETPI GDDEDSHLGD FIEDTSVTSP VDSATSEGLR ESVREVLSGL TPREAKVLRM RFGIDMNTDH TLEEVGKQFD VTRERIRQIE AKALRKLRHP TRSDGLRSFL EE
|
| |