Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0207 |
Symbol | |
ID | 3745670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 201622 |
End bp | 202608 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637768245 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_374138 |
Protein GI | 78186095 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00645161 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATACC AGATGCAGAT GCCTGCTAAA ATAGAGGTTG ACGAGGCTAC CCATACTGAC CAGTACGGCC GTTTCATTGC CCAGCCGCTC GAGCGCGGTT ATGGAGTAAC GCTCGGCAAC ATGATGAGAC GTGTGCTTCT TGCCTCCCTT CCCGGGACTG CGATCACCGG AATCAAGGTG GATGGTGTAT TCCATGAGTT TTCTTCGATA GAAGGGGTCC GCGAAGATGT TCCGGAAATC GTGCTCAACC TGAAGAAGGT CCGATTCAAG TCAACCTGCA AGAGAAGCTG CAAGACCACG CTCAGCATCG TCGGCCCGAA AGACTTCACC GCCGGTGACA TCGTTGCGCA GGAAGGTGAG TTCGAGGTGC TGAACAAGGA TCTTTACATC GCTACCGTGA ACGAAGGATC CACGCTGAAC ATCGATGTCT ACATCGGACG CGGACGCGGC TACACTCCTG CTGAAGAGAG CCGTCCTGAC AGCATGCCGA TCGGTTACAT CGCAATTGAT GCGATCTATA CCCCGATCCG CAATGTGAAG TTCGCCGTTG AGAACACCCG TGTGGGACAG CGGACCGATT ACGAGAAAAT GGTGCTTGAT GTCGAGACTG ACGGCTCCAT CACTCCCGAT GACTCCATCA GCCTTGCTGG CCGGATCATC AATGAGCATG TGACCTTCTT CGCCAACTTC TCGCCGACCG AGGAGGAGTT CACCGAGGAG GAGTACAAGC AGCAGGATGA CGAGTTCGAG ACCATGCGCC GTCTCTTGAA CACCAAGATC GAGGATCTCG ATCTTTCTGT TCGTTCGCAC AACTGCCTCC GCCTTGCAGA GATCGACACC ATCGGTGACC TGGTTTCACG GAAGGAGGAC GAGCTTCTCA ACTACAAGAA CTTCGGCAAG AAGTCGCTGA CTGAACTGAA GGAACAGCTG GAGAAGTTCG AACTCAAGTT CGGCATGGAC ATCACCCGCT ACCAGATGAA GGGATAA
|
Protein sequence | MIYQMQMPAK IEVDEATHTD QYGRFIAQPL ERGYGVTLGN MMRRVLLASL PGTAITGIKV DGVFHEFSSI EGVREDVPEI VLNLKKVRFK STCKRSCKTT LSIVGPKDFT AGDIVAQEGE FEVLNKDLYI ATVNEGSTLN IDVYIGRGRG YTPAEESRPD SMPIGYIAID AIYTPIRNVK FAVENTRVGQ RTDYEKMVLD VETDGSITPD DSISLAGRII NEHVTFFANF SPTEEEFTEE EYKQQDDEFE TMRRLLNTKI EDLDLSVRSH NCLRLAEIDT IGDLVSRKED ELLNYKNFGK KSLTELKEQL EKFELKFGMD ITRYQMKG
|
| |