Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_2202 |
Symbol | |
ID | 6355996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 2442455 |
End bp | 2443441 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642669793 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001944205 |
Protein GI | 189347676 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00000991185 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATACC AGATGCAGAT GCCTGCTAAA ATAGAGGTCG ACGAAGCTAC TCATACTGAC AGGTTCGGTC GTTTTGTCGC GCAGCCTCTC GAGAGGGGTT ATGGTGTAAC CCTCGGTAAT GTGATGAGAA GAGCGCTTCT GGCTTCACTT CCGGGAACTG CAATAACAGG TTTAAAAATA GATGGCGTTT TTCATGAGTT TTCGACCATA AACGGTGTCA GGGAAGATGT TCCCGAGATC GTTCTGAATC TTAAAAAGGT TCGGTTCCGT TCGAACTGCA AGAGAAACTG CAAGACATCA TTGGTGCTGA ATGGGCAGAA AGAGTTCACC GCAGGCGATA TTATCGCTCA GGAAGGTGAG TTCGAGGTAC TCAATAAAGA CCTTCATATT GCCACCGTGA ATGAAGGGGC TACACTGAAA ATCGATATCT TCGTCGGAAG AGGACGCGGA TACCTTCCAT CCGAAGAAAA CCGTCCTGAT GGCATGCCGA TAGGGTTTAT CGCTATCGAT GCTATTTTCA CTCCTATAAG GAACGTGAAA TTCACAGTTG AAAATACCCG TGTGGGACAG CGTACCGACT ACGAGAAAAT GATTCTCGAT GTCGAAACAG ACGGTTCGAT CACTCCGGAT GACTCTATCA GTCTTGCCGG AAAGATCATC AATGAACATG TCACCTTTTT TGCTGATTTC TCTCCGACCG AAGAAGAGTT TACCGAAGAG GAGTTCAAGC AGCAGGATGA TGAGTTCGAG AATATGCGCA AGCTTTTCAA TACAAAGATT GAGGATCTCG ATCTTTCGGT ACGTTCTCAC AACTGTCTCC GTCTTGCAGA AATCGATACT ATCGGAGATC TCGTATCGAG AAAAGAAGAT GAGCTGCTTA ACTATAAAAA TTTCGGCAAG AAGTCTCTGA CCGAGCTGAA AGAGCAGCTT GAGAAGTTCG ATCTGAAATT CGGTATGGAC ATTACCAAGT ATCAGATGAA AGGGTAA
|
Protein sequence | MIYQMQMPAK IEVDEATHTD RFGRFVAQPL ERGYGVTLGN VMRRALLASL PGTAITGLKI DGVFHEFSTI NGVREDVPEI VLNLKKVRFR SNCKRNCKTS LVLNGQKEFT AGDIIAQEGE FEVLNKDLHI ATVNEGATLK IDIFVGRGRG YLPSEENRPD GMPIGFIAID AIFTPIRNVK FTVENTRVGQ RTDYEKMILD VETDGSITPD DSISLAGKII NEHVTFFADF SPTEEEFTEE EFKQQDDEFE NMRKLFNTKI EDLDLSVRSH NCLRLAEIDT IGDLVSRKED ELLNYKNFGK KSLTELKEQL EKFDLKFGMD ITKYQMKG
|
| |