Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4597 |
Symbol | |
ID | 8735063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4899720 |
End bp | 4901324 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 646505226 |
Product | RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003396385 |
Protein GI | 284046045 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCT GGCTCTCGGA CACCCTGCTG CGGACGCAGA CCGACGAACG GCTCGTCACG CTGACGCGCG CCGGTCACGA GCGCGCGTTC GCGGCGATCG TCGAGCGCTA CCGCCGGCCG CTCGTGGCGT TCGCGCGGCG GATCGCGCCG GACAGCCGTG CCGAGGACGT CGTCCAGCAG GCGCTCACGA GCGCCTGGGC GGCGCTCGCC GCCGGGGCCG AGGTGGCTCA CCTGCGCGGC TGGCTGCACC AGATCGTTCG CCACGAGGCG ATTCGGATCG CCAAGCAGGA AGGGCGGGCG ATCGTCGACC CGCTCGCGGA GGAGGAGACG CACGCGCAGG CGGGCGGCCG TGACGTCGCC GCGGCCGCGG AGGAGAACGA GCGCGTGCGC GAGGCGCTCA GCGGCATCGC GGGGCTGCCC GACCAGCAGC GCGAGGCGCT TGTGCAGACG ACGCTCGCCG GTCGCAGCCG CGGCGAGGTC GCCGCGGCGC TCGGCCTCAG CGAGGGCGCG GTCCGTCAGC TCGTCCACCG AGCGCGCACG ACGCTGCGCG CCGCCGCGAC CGCGCTGACG CCGCTGCCGC TCGCGACGTG GGCCGCTGGG ATGGCCGGCG GCGGCGGTCC CGGCATCGCC GAGCTGGCGG CCGGCGCCGG TGGCGCGACG CTTGCGGGAA CCTTCGTGAA GGCGGGCGCC GTCGTCGTCG CGACCGGCGC GATCGCGACC GGCGTGACCG TCACGAGAGA CCGCCGCCCG GTCGATCCCC AAGCCGCGCA GGCGCGCGGC GGCGAGACGG CGGCGCGCCC GGCGGCCGCG GGCGGTGCCG GCAGCGGCGG CGTGCTCGTC CCCGCGGCGA TCGCCCCGTT CGGCGGCAGC GGCAGCGGCG GAGGCTTCGC GAGCGACGCC GCCGAGGACC GCCGCGGCCG CGGGCGCGGC GGCGACGACG ATCGCGACGA CGACCGGCGC GGGCGCGACG ACGATGACGA CCGGCGCGGT GGGGACGACG ACGATCGGCG CGGTGGGGAC GACGACCGCG ACCGCGACGA CGACCGGCGC GGCCGTGGTG ACGACGACCA CCGCGGCCGG GGCGATGACG ACGACCGCAG GCATGGTGAC GACGATCGCC GCGGTCGGAG CGGCGACGAC CGGCGTGGCG GCGAGCGCGA GGGCGATGAC CGGCGCCACG GTGGTGAGGA CGACGACGAT CGCGGTGGGC CGGCGCCGCG GGCCGATCGT GCAGGCGACG ACCGCTCCGG GCGCGGCCGC GGCTCCGGCG GAGACGATCG CGACGACGAC GCCGGCTCGC CCGGCTCCGG CGGCAGCTCG TCGGGCAGCG GCAAGCGGAG CGCCCCCGCA CGCGAGGCGG AGGACCGCGA GTCCGGCGGC AGCGGCTCCT CCGGCGCGGG CGGCTCCACC GGTGGAAGCG GCTCGTCTGG CGGAGGCCCG TCCGGCGGGG GAGGTTCCTC GGGCGGCAGC GGCTCCTCCG GCGGCAGCGG CTCGTCCGGC GGAAGCGGCT CCACGTCCGG CAGCGGCTCC TCCGGCGGAG GCGGCTCGTC CGGTGGCGAA GACGACTCGG GCGGCGGCGA CGACTCCGGC GGCGACGACG ACTGA
|
Protein sequence | MKPWLSDTLL RTQTDERLVT LTRAGHERAF AAIVERYRRP LVAFARRIAP DSRAEDVVQQ ALTSAWAALA AGAEVAHLRG WLHQIVRHEA IRIAKQEGRA IVDPLAEEET HAQAGGRDVA AAAEENERVR EALSGIAGLP DQQREALVQT TLAGRSRGEV AAALGLSEGA VRQLVHRART TLRAAATALT PLPLATWAAG MAGGGGPGIA ELAAGAGGAT LAGTFVKAGA VVVATGAIAT GVTVTRDRRP VDPQAAQARG GETAARPAAA GGAGSGGVLV PAAIAPFGGS GSGGGFASDA AEDRRGRGRG GDDDRDDDRR GRDDDDDRRG GDDDDRRGGD DDRDRDDDRR GRGDDDHRGR GDDDDRRHGD DDRRGRSGDD RRGGEREGDD RRHGGEDDDD RGGPAPRADR AGDDRSGRGR GSGGDDRDDD AGSPGSGGSS SGSGKRSAPA REAEDRESGG SGSSGAGGST GGSGSSGGGP SGGGGSSGGS GSSGGSGSSG GSGSTSGSGS SGGGGSSGGE DDSGGGDDSG GDDD
|
| |