Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4098 |
Symbol | |
ID | 8335452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4625229 |
End bp | 4628366 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644957201 |
Product | protein serine phosphatase with GAF(s) sensor(s) |
Protein accession | YP_003114803 |
Protein GI | 256393239 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.108116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGTCGG GGATTCTGGC GTCGTGGCGG CGGTGTCGGG CCGGTGGGTT GGGGCCCGAG GATGTCGATC TGCCGTATGA GCCGGACACC GGTACTGAGG AGTCGCTGCT GCGGGCTTCG GCGCCGGTGT TGGAGCGGCT GCACGCGCTG TTGATCGATA CGCCGGTGTG CGTGGTGTTG AGCGATGCGG CGGCTCGGAT TCTGGTGCGG CGGGCGGGTG AGCCGGGGTT GAACCGGCAC TTGGACGCCG TGCAGCTGGC GGAGGGCTTC AGCTATCACG AGGCTGACGC CGGGACCAAT GGCATCGGGA CGGCTTTGGC CGAGGGGCGG CCGGCTGTGG TGCTCGCGGG GGAGCACTTC GCCGATCGGT TCTTGGCGTT CGTGTGTGCC GGGGTGCCGA TTCGCGATCC CTTCAGTGGA CGGATTCGGG GCGTCGTGGA CCTCACGTCG TGGCGTCGGG ATGCCAGTCC GCTGATGGCG GCGCTGGCGG CGGAGGCCGC GGAGAACATC GAGCTGCGGC TGCTGGAGCA GTACTCGGCG AAGGAACGGG CGTTGTTGGC CCAGGCACGG CGGGCCGGTG GGATCACGGT CGGCGGCGGC TGCGGCGAGG GCAGCACCAG GAGCGGCGAC GTCGGCGAAG CGCGGCGGAC CCGGGGTGAG CTGGTGCTCG CCACCCGGCA GCGGCACGGC GCCGACCCCG GCCACGGCGA CCGCCGCGAC CGACAGCTCG TACGGGGCAA GGCGGCCGAA CTGGTCGCGG CGGCCAGACG CGACGTGGTG ACGATCGCGT TGCCCGGCGG CCGCCACGCC AAGTTGACGG CCCGCACCGG TCGCACCGCG GCCGGCACGG AGGTGGTCAC GGTGGAAGCG GAGATGACGA GTGCCCCGGA GATGACGACG GGTGCTCTGG ATACGGCAAA GGAAATGTCT TCGGATGCGT CGACGGATAT GCCGATGGAT ATGCCCATGG GTGTGTCGGT GGGTGTATCG GTGGGTGTGT CGACCGGCGA GCTGGAGACG GCCGCTGATG CGCTGGGGAT GACAGCCGAG GAACTGGAGA CGGCCGGCGG TTGCGAGACC GCTGGTGCTG CGGCGAGGTT GCCGGCTGCG AGGACGCCGG ATGGGATGGG CGGCGCAAGC GGTTCGGGTG GCTCGGAGCA CCATCAGGCG CAGATGATCG CACTGCGGAT TGAACCGGTG GAGATGGCGA TGTCCGGTGT GCAGGCGGGG AGGGAAGCCA GATCAGTCGG GGAATGCGTT GCGACACAGA TCAATAACAG CCTCCCGCAC CCGACCGCCC CAGAACCAGA AGCCGAGCCC GGCCGCGACC CTCAACCAAC GCCCCCTACA TCAACCCCAC CCCCAGCCTC CGGCCACCAA ATCCCCCGCC CCGCCGGCGC CACCACCCCC CACCCCTACC CTGGCGCGCC GAATCCCGAG CTGGTCATGG TCGGGGAGCC GACCGTCGGG CGCATCGCGG TCGCCGCGCG CCAGCGCCTG GCGCTCCTCC TGGATGCCAG CGGCCGGATC GGCACCACGC TCGACATGGA GCTCACCGGT GGTGAGCTGG CCGAGATCGC GCTCCCGGAC TTCGCGCAGC ATGTCGCGGT GGACCTGGCG AGCTGGGTTC TGGACGGCGA GGAGTGCCTT CCGCCCGGCA GCGAAGTGAA GTTGCGGCGG GTTGCGGTGC GCACCGTCCG GCGCGGCAAG GTGCATCGTC CGGCGGGCGC GCATGTCGCC TACAACGCCG CCACCGCGCA GGCGCGCAGC ATGGTTCAGC GGCATCCGGT GCTGGACGCG CAGCTCACCG CCGCCACCGC CGGGAAGCGC TGGGTCGCCG AGGATCCCGA CAGCTCGGTG ATCGCCGCGC CGATCATGGT GCAGGGCATC GTGCTCGGCG TGGCGTCCTT CTACCGCCTA GGGAATGCCG ATCCTTTCGA CGAGGACGAC CTCCAGCTGG CCGGTGACCT CGCCTCGCGC GCGGCCGTGT GCCTGGACAA CGCGCGCCGC TTCGCCCGGG AGCGTGCGAT GGCGCTGGCG TTGCAACGCA GCCTTCTGCC ACGCGCCTTC CCGATGCAGT GCGCGGTGGA AGTCGCGCAT CGGTATCAGC CCGCGCAGGA GGGTGTCGGC GGCGACTGGT ACGACGTCAT CCCGTTATCC GGTGGGCGGG TGGCGCTCGT CGTCGGCGAC GTCGTCGGGC ACGGCATCCA CGCCGCGGCG ACGATGGGAC GGCTGCGGAC CGCCGTGCGC AACTTCTGCG CGCTGGACCT GCCGGCCGAG GATCTGCTCA GCCAGCTTGA CGCGCTCGTG GAGTCGATGG ACGCCGACGA GGCCGAGGAC CAGCGCGGTG TCGGCATCAT CGGCGCGACC TGCCTGTACG TGGTCTACGA CCCGGTGACC GGGCTGTGCT CGGTCGCGGC AGCCGGGCAC CCCTCGCCGG CCGTGGTGGC CCGCGACGGC TCGGTGGAGT ACCTGGACCT GCCGACCGGC CCGCCGCTGG GCCTCGGAGG ATCGGCGTAC GAAGCGGTCG AGCTGCCGAT CGACGAGGGC AGCATCCTGG TCCTCTACAC CGACGGCCTG GTCGAGAGCC GCGAGCAGGA CATCGGCGAC GGGCTGGAGC GGCTGAGCGC GGCGCTCGCC GGACCCGGAC GCGATCCGGA GGAGCTGTGC GCCTCGGCGA TCGGCGGTCT GCTGCCGGAG CGCCCCGCCG ACGACGTCGC CCTGCTCGCC GCGCGCGCCC GGCGCACGAC GCCGGATCGG GTCGCCACCT GGGACGTGCC GATGACGCCG GAGTCGGTGG CGTTCCTGCG CGCCGAGGTC TCCCGCCAGC TGCGCGCCTG GCGCCTGACC GAGCTGGTCT TCACCACCGA GCTCATCGTC AGCGAACTGG TGACGAACGC GATCCGGTAC GCCACCGGCC CGGTCGAGCT GCGCCTGCTG CGTGACAGGG CCCTGATCTG CGAGGTCGCG GACGGCAGCA GCGTTTCCCC GCGGTTGCGC CGCGCGCAGA CCTTCGACGA GGGCGGACGC GGCCTGTTCC TGGTCGCGCA GCTCTCACAG CGGTGGGGGA CCCGGTACAC CGCGCGCGGC AAGGTGATCT GGTCCGAGCA GCCGCTGCCG GCGAACGGCG ACTACTAG
|
Protein sequence | MRSGILASWR RCRAGGLGPE DVDLPYEPDT GTEESLLRAS APVLERLHAL LIDTPVCVVL SDAAARILVR RAGEPGLNRH LDAVQLAEGF SYHEADAGTN GIGTALAEGR PAVVLAGEHF ADRFLAFVCA GVPIRDPFSG RIRGVVDLTS WRRDASPLMA ALAAEAAENI ELRLLEQYSA KERALLAQAR RAGGITVGGG CGEGSTRSGD VGEARRTRGE LVLATRQRHG ADPGHGDRRD RQLVRGKAAE LVAAARRDVV TIALPGGRHA KLTARTGRTA AGTEVVTVEA EMTSAPEMTT GALDTAKEMS SDASTDMPMD MPMGVSVGVS VGVSTGELET AADALGMTAE ELETAGGCET AGAAARLPAA RTPDGMGGAS GSGGSEHHQA QMIALRIEPV EMAMSGVQAG REARSVGECV ATQINNSLPH PTAPEPEAEP GRDPQPTPPT STPPPASGHQ IPRPAGATTP HPYPGAPNPE LVMVGEPTVG RIAVAARQRL ALLLDASGRI GTTLDMELTG GELAEIALPD FAQHVAVDLA SWVLDGEECL PPGSEVKLRR VAVRTVRRGK VHRPAGAHVA YNAATAQARS MVQRHPVLDA QLTAATAGKR WVAEDPDSSV IAAPIMVQGI VLGVASFYRL GNADPFDEDD LQLAGDLASR AAVCLDNARR FARERAMALA LQRSLLPRAF PMQCAVEVAH RYQPAQEGVG GDWYDVIPLS GGRVALVVGD VVGHGIHAAA TMGRLRTAVR NFCALDLPAE DLLSQLDALV ESMDADEAED QRGVGIIGAT CLYVVYDPVT GLCSVAAAGH PSPAVVARDG SVEYLDLPTG PPLGLGGSAY EAVELPIDEG SILVLYTDGL VESREQDIGD GLERLSAALA GPGRDPEELC ASAIGGLLPE RPADDVALLA ARARRTTPDR VATWDVPMTP ESVAFLRAEV SRQLRAWRLT ELVFTTELIV SELVTNAIRY ATGPVELRLL RDRALICEVA DGSSVSPRLR RAQTFDEGGR GLFLVAQLSQ RWGTRYTARG KVIWSEQPLP ANGDY
|
| |