Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5871 |
Symbol | |
ID | 8736347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 6283730 |
End bp | 6284890 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 646506497 |
Product | putative RNA polymerase, sigma 70 family subunit |
Protein accession | YP_003397646 |
Protein GI | 284047306 |
COG category | [R] General function prediction only |
COG ID | [COG2071] Predicted glutamine amidotransferases |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.41765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTCG ACCTCCTCAT CGCCACCCAC CGCCGCCCGA TCGAGCGCCA TCTGCGCCGC TACGTCGGCG ATCCCGGGCT TGCCGAGGAC CTCGCCCAGG AGGTCTTCCT GCGGGCCTGG CTGCACGCTC CCCGCGACGT GAGCGAGGAG CGCCAGCGCG CCTGGCTCTT CCGCGTCGCG CGCAATGCCG CGATCGACGC GCTGCGCGCC CGTCGTCCGC ACGACGACAC GTCACTGCTC GACGCCGTGG GCGGCGTCGC GGCGGCGGCT GCCGAGGACC ACGACGAGCG GCTCGCGATC GAGGCCGCGC TCGCGCAGCT GCCGGCGCGC GACCGCGCGC TCGTGACGCT CCAGTTCGCC GGCTTCGGCC CGACCGACGC CGCGCGGCTG CTGCAGACGA CGCCGGAGGC CGCGCGCAAG CGGCTGACGC GGGCGCGCGA GCGCTTTCGC GTCGTCTACA CCGACCAACG CCCGGTCGGC GAGCCGCCGC TGGTGCTGCT CGTCGCGCGC GACACGGACA CCGCGCTGTA CGAGCGGTGG CTCGGCGCCG CGGAGCTGCG CGTGCGCGTC GTCGCGCCCG CGGACGCGGC GCGCCAGCTC GCGACCGCGC ACGCGCTCGT GCTGTCCGGC AGCGAGCGCG ACATCCACCC CGCGATGTAC GGGCAGCCGG TGCGCAGCGC GACGGATCCC CGTCTCGACG TCGACCGCGC CGACGCCGCG GTGCTGCGCG ACGCGCTCGC GACGCGGATG CCGGTGCTCG GGATCTGCCG CGGCCACCAG CTGCTCAACA TCGTGCGCGG CGGCACGCTG CACCAGGACC TCAGCGAGCA GTCCAGCGCC GCGGACGGCC ACGCGCAGGG GACGCACCGG ATCGGCACGC GCGGCAGCAC GCTCGCGCGG CGGATCCTCG GCGCGCAGGG CGCGGTGCCG AGCGTCCACC ACCAGGCCGT CGCGCGGATC GGGCGCGGTC TCAACGTCGG CGCGATCTCG GCCGACGGCG TCGTCGAGAC GATCGAGGAT CCGCGGCTGC CGTTCGCGGT CGGCACGCAG TGGCACGCCG AGCTGCCTGA GGCCGGCGAG ACCGGCCGGC GGCTGCGCGA CGCACTCGCC CAGGCAGCGT TCCGCCATGC GGGCGCCGCG CCGCTGCCCG TCGCGGCCTG A
|
Protein sequence | MDLDLLIATH RRPIERHLRR YVGDPGLAED LAQEVFLRAW LHAPRDVSEE RQRAWLFRVA RNAAIDALRA RRPHDDTSLL DAVGGVAAAA AEDHDERLAI EAALAQLPAR DRALVTLQFA GFGPTDAARL LQTTPEAARK RLTRARERFR VVYTDQRPVG EPPLVLLVAR DTDTALYERW LGAAELRVRV VAPADAARQL ATAHALVLSG SERDIHPAMY GQPVRSATDP RLDVDRADAA VLRDALATRM PVLGICRGHQ LLNIVRGGTL HQDLSEQSSA ADGHAQGTHR IGTRGSTLAR RILGAQGAVP SVHHQAVARI GRGLNVGAIS ADGVVETIED PRLPFAVGTQ WHAELPEAGE TGRRLRDALA QAAFRHAGAA PLPVAA
|
| |