Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4578 |
Symbol | |
ID | 8335932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5209655 |
End bp | 5210914 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644957679 |
Product | putative RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003115281 |
Protein GI | 256393717 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.638987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.470788 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCGG CTCAGGACAT CGAAGCCGTC TTCCGTGCCG AGTACGGCCG GGCGGTGGCT GTCCTGGTGC GCGCTTTCGG CGACATCGAC CTCGCCGAGG AAGCCGTCCA GGACGCCTTC GCCACCGCCT TGCAACGCTG GCCGGCAGAG GGTCCGCCAC CGTCTCCGGC GGGCTGGATC ATCACCACGG CCCGCAACCG CGCCATCGAC CGTCTTCGCC GTGAAGCGCG CCGCGAGGAC CACTATGCCC AAGCGGCGCT GCTGCACGCC GCCACCGATC CCGGCGCCGA TCGCCACCCC GCCACCGAAT CAGAACAGGA GGACCCGGTG CGCGACGACC GCCTCCGCCT GATCTTCACC TGCTGCCACC CGGCTCTGGC CGCGCCGACC CGCGTCGCCC TGACTCTGCG CCTGCTCGGC GGCCTGACCA CCGCCGAGAT CGCCCGCGCG TTCCTGGTCA GCGAGCAGAC CATGTCCCAG CGCCTGGTGC GCGCGAAGGG CAAGATCCGC GCCGCGCGCA TCCCGTACCG CATCCCCTCA GAGGCCGAAC TGCCGGACCG GCTGCGCGCC GTGCTCGCCG TCGTCTACCT GATCTTCACC GAAGGACACT CGGCCACCTC CGGGGAGAAC CTGGTCCGCG CCGACCTGTG TGCCGAGGCG ATCCGCCTGG CGCGCCTGCT GGTCGAGCTG ATGCCCGACG AACCCGAGGC CGCCGGTCTG CTCGCGCTGC TCCTGCTCAC CGAGGCACGC CGCCCGGCCC GCGTCGCCCC CGACGGCGCG CTCATCCTGC TCGGCGACCA GGACCGCGCG CGGTGGGACA GGGCCCTGAT CGAGGAGGGG CAAGACCTCC TGCGCTCGTG CCTGCGGCGC AACCAACCCG GGCCGTATCA GCTCCAAGCC GCCATCAACG CCGTCCACAG CGACGCGGCT CGCGCCGAGG ACACCGACTG GCTCCAGATC CTGACTCTCT ACGACCGGCT GCTGGCCGTC GAGCCGACCC CCGTCGTCGC CCTCAACCGA GCCGTCGCCC TCGCCGAAGC CCACGGCATC GCCCCAGCGC TGGACGTCAT CGAAGCACTC GCCACGCCCC TGGCCGAATA CGGTCCCTAC CACGCCGTGC GCGCCGACCT CCTGCGCCGC GCCGGCCGCC GGAGCGAAGC CGCCGCCGCG TACGAACAAG CCGCGGCGCG CGCCGGGAAC GCCAGCGAGC GCGCGTTCCT GCTGGCGCGG CGGAGCGCTC TGACCGACTC GCCGGATTGA
|
Protein sequence | MTSAQDIEAV FRAEYGRAVA VLVRAFGDID LAEEAVQDAF ATALQRWPAE GPPPSPAGWI ITTARNRAID RLRREARRED HYAQAALLHA ATDPGADRHP ATESEQEDPV RDDRLRLIFT CCHPALAAPT RVALTLRLLG GLTTAEIARA FLVSEQTMSQ RLVRAKGKIR AARIPYRIPS EAELPDRLRA VLAVVYLIFT EGHSATSGEN LVRADLCAEA IRLARLLVEL MPDEPEAAGL LALLLLTEAR RPARVAPDGA LILLGDQDRA RWDRALIEEG QDLLRSCLRR NQPGPYQLQA AINAVHSDAA RAEDTDWLQI LTLYDRLLAV EPTPVVALNR AVALAEAHGI APALDVIEAL ATPLAEYGPY HAVRADLLRR AGRRSEAAAA YEQAAARAGN ASERAFLLAR RSALTDSPD
|
| |