Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2269 |
Symbol | |
ID | 6375963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2452311 |
End bp | 2453297 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642684755 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001960654 |
Protein GI | 189501184 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.32742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0326788 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATACC AAATGCAGAT GCCTGAGAAA ATCGATGTCG ATGAAGGTAC ACATAATGAT AAAAACGGAA TATTTATAGC GCAGCCGCTT GAGCGTGGCT ATGGTGTGAC ATTGGGCAAT GCAATGCGCA GAGTGCTTCT TGCTTCTCTG CCGGGTACGG CTATTACCGG AATTAAAATT GATGGTGTTT TCCATGAGTT CTCCGCAATA GACGGGGTGC GTGAGGATGT ACCGGATATT ATTCTGAACC TTAAAAAAGT ACGGTTCAGA TCTTCAACCA AGAGGAGTTG TAAAACGACT GTGACAGTGG ACGGTCCTGC AGATGTGACC GCAGGAGATA TTGTCGCACA GGAAGGCGAA TTTGAAGTAC TGAACACCGA TCTGCATATA GCTACGGTAA ATGAAGGGTC GAGGCTGAGC ATGGATGTGT ATATCGGGCG GGGCAGAGGG TATGTTCCGG CTGAGGACAA CCGCGGTGAC GGGATGCCTT TAGGATTTAT CGCAATCGAT TCGATCTTTA CTCCTATTAA AAACGTTAAA TTCTCAGTCG AGAATACACG TGTGGGACAG CGTACCGATT ACGAGAAAAT GATACTCGAT GTAGAGACAG ACGGTTCTGT ATCACCTGAT GATTCGATCA GCCTTGCAGG CAAGATCATC AACGAGCATG TTTCACTGTT TGCGAACTTT TCGCCGACGG ATGAAGAGTT TACCGAAGAG GAATACAAAC AGCAGGACGA TGAGTTTGAA AACATGCGTA AAATGCTCCT GACACGAATT GAAGATCTGG ATCTTTCCGT TCGTTCTCAT AACTGCTTGA GGCTTGCTGA AATAGATACA CTTGGTGATC TTGTTTCCCG CAAAGAAGAT GAGCTTCTGA CCTACAAGAA TTTTGGTAAA AAGTCCTTGA CTGAGCTGAA AGAGCAGTTG GAGAAGTTTG AGTTGAAGTT TGGTATGGAT ATTACCAAAT ATCAGATGAA AGGCTAA
|
Protein sequence | MIYQMQMPEK IDVDEGTHND KNGIFIAQPL ERGYGVTLGN AMRRVLLASL PGTAITGIKI DGVFHEFSAI DGVREDVPDI ILNLKKVRFR SSTKRSCKTT VTVDGPADVT AGDIVAQEGE FEVLNTDLHI ATVNEGSRLS MDVYIGRGRG YVPAEDNRGD GMPLGFIAID SIFTPIKNVK FSVENTRVGQ RTDYEKMILD VETDGSVSPD DSISLAGKII NEHVSLFANF SPTDEEFTEE EYKQQDDEFE NMRKMLLTRI EDLDLSVRSH NCLRLAEIDT LGDLVSRKED ELLTYKNFGK KSLTELKEQL EKFELKFGMD ITKYQMKG
|
| |