Gene Caci_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4098 
Symbol 
ID8335452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4625229 
End bp4628366 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content72% 
IMG OID644957201 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_003114803 
Protein GI256393239 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.108116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGTCGG GGATTCTGGC GTCGTGGCGG CGGTGTCGGG CCGGTGGGTT GGGGCCCGAG 
GATGTCGATC TGCCGTATGA GCCGGACACC GGTACTGAGG AGTCGCTGCT GCGGGCTTCG
GCGCCGGTGT TGGAGCGGCT GCACGCGCTG TTGATCGATA CGCCGGTGTG CGTGGTGTTG
AGCGATGCGG CGGCTCGGAT TCTGGTGCGG CGGGCGGGTG AGCCGGGGTT GAACCGGCAC
TTGGACGCCG TGCAGCTGGC GGAGGGCTTC AGCTATCACG AGGCTGACGC CGGGACCAAT
GGCATCGGGA CGGCTTTGGC CGAGGGGCGG CCGGCTGTGG TGCTCGCGGG GGAGCACTTC
GCCGATCGGT TCTTGGCGTT CGTGTGTGCC GGGGTGCCGA TTCGCGATCC CTTCAGTGGA
CGGATTCGGG GCGTCGTGGA CCTCACGTCG TGGCGTCGGG ATGCCAGTCC GCTGATGGCG
GCGCTGGCGG CGGAGGCCGC GGAGAACATC GAGCTGCGGC TGCTGGAGCA GTACTCGGCG
AAGGAACGGG CGTTGTTGGC CCAGGCACGG CGGGCCGGTG GGATCACGGT CGGCGGCGGC
TGCGGCGAGG GCAGCACCAG GAGCGGCGAC GTCGGCGAAG CGCGGCGGAC CCGGGGTGAG
CTGGTGCTCG CCACCCGGCA GCGGCACGGC GCCGACCCCG GCCACGGCGA CCGCCGCGAC
CGACAGCTCG TACGGGGCAA GGCGGCCGAA CTGGTCGCGG CGGCCAGACG CGACGTGGTG
ACGATCGCGT TGCCCGGCGG CCGCCACGCC AAGTTGACGG CCCGCACCGG TCGCACCGCG
GCCGGCACGG AGGTGGTCAC GGTGGAAGCG GAGATGACGA GTGCCCCGGA GATGACGACG
GGTGCTCTGG ATACGGCAAA GGAAATGTCT TCGGATGCGT CGACGGATAT GCCGATGGAT
ATGCCCATGG GTGTGTCGGT GGGTGTATCG GTGGGTGTGT CGACCGGCGA GCTGGAGACG
GCCGCTGATG CGCTGGGGAT GACAGCCGAG GAACTGGAGA CGGCCGGCGG TTGCGAGACC
GCTGGTGCTG CGGCGAGGTT GCCGGCTGCG AGGACGCCGG ATGGGATGGG CGGCGCAAGC
GGTTCGGGTG GCTCGGAGCA CCATCAGGCG CAGATGATCG CACTGCGGAT TGAACCGGTG
GAGATGGCGA TGTCCGGTGT GCAGGCGGGG AGGGAAGCCA GATCAGTCGG GGAATGCGTT
GCGACACAGA TCAATAACAG CCTCCCGCAC CCGACCGCCC CAGAACCAGA AGCCGAGCCC
GGCCGCGACC CTCAACCAAC GCCCCCTACA TCAACCCCAC CCCCAGCCTC CGGCCACCAA
ATCCCCCGCC CCGCCGGCGC CACCACCCCC CACCCCTACC CTGGCGCGCC GAATCCCGAG
CTGGTCATGG TCGGGGAGCC GACCGTCGGG CGCATCGCGG TCGCCGCGCG CCAGCGCCTG
GCGCTCCTCC TGGATGCCAG CGGCCGGATC GGCACCACGC TCGACATGGA GCTCACCGGT
GGTGAGCTGG CCGAGATCGC GCTCCCGGAC TTCGCGCAGC ATGTCGCGGT GGACCTGGCG
AGCTGGGTTC TGGACGGCGA GGAGTGCCTT CCGCCCGGCA GCGAAGTGAA GTTGCGGCGG
GTTGCGGTGC GCACCGTCCG GCGCGGCAAG GTGCATCGTC CGGCGGGCGC GCATGTCGCC
TACAACGCCG CCACCGCGCA GGCGCGCAGC ATGGTTCAGC GGCATCCGGT GCTGGACGCG
CAGCTCACCG CCGCCACCGC CGGGAAGCGC TGGGTCGCCG AGGATCCCGA CAGCTCGGTG
ATCGCCGCGC CGATCATGGT GCAGGGCATC GTGCTCGGCG TGGCGTCCTT CTACCGCCTA
GGGAATGCCG ATCCTTTCGA CGAGGACGAC CTCCAGCTGG CCGGTGACCT CGCCTCGCGC
GCGGCCGTGT GCCTGGACAA CGCGCGCCGC TTCGCCCGGG AGCGTGCGAT GGCGCTGGCG
TTGCAACGCA GCCTTCTGCC ACGCGCCTTC CCGATGCAGT GCGCGGTGGA AGTCGCGCAT
CGGTATCAGC CCGCGCAGGA GGGTGTCGGC GGCGACTGGT ACGACGTCAT CCCGTTATCC
GGTGGGCGGG TGGCGCTCGT CGTCGGCGAC GTCGTCGGGC ACGGCATCCA CGCCGCGGCG
ACGATGGGAC GGCTGCGGAC CGCCGTGCGC AACTTCTGCG CGCTGGACCT GCCGGCCGAG
GATCTGCTCA GCCAGCTTGA CGCGCTCGTG GAGTCGATGG ACGCCGACGA GGCCGAGGAC
CAGCGCGGTG TCGGCATCAT CGGCGCGACC TGCCTGTACG TGGTCTACGA CCCGGTGACC
GGGCTGTGCT CGGTCGCGGC AGCCGGGCAC CCCTCGCCGG CCGTGGTGGC CCGCGACGGC
TCGGTGGAGT ACCTGGACCT GCCGACCGGC CCGCCGCTGG GCCTCGGAGG ATCGGCGTAC
GAAGCGGTCG AGCTGCCGAT CGACGAGGGC AGCATCCTGG TCCTCTACAC CGACGGCCTG
GTCGAGAGCC GCGAGCAGGA CATCGGCGAC GGGCTGGAGC GGCTGAGCGC GGCGCTCGCC
GGACCCGGAC GCGATCCGGA GGAGCTGTGC GCCTCGGCGA TCGGCGGTCT GCTGCCGGAG
CGCCCCGCCG ACGACGTCGC CCTGCTCGCC GCGCGCGCCC GGCGCACGAC GCCGGATCGG
GTCGCCACCT GGGACGTGCC GATGACGCCG GAGTCGGTGG CGTTCCTGCG CGCCGAGGTC
TCCCGCCAGC TGCGCGCCTG GCGCCTGACC GAGCTGGTCT TCACCACCGA GCTCATCGTC
AGCGAACTGG TGACGAACGC GATCCGGTAC GCCACCGGCC CGGTCGAGCT GCGCCTGCTG
CGTGACAGGG CCCTGATCTG CGAGGTCGCG GACGGCAGCA GCGTTTCCCC GCGGTTGCGC
CGCGCGCAGA CCTTCGACGA GGGCGGACGC GGCCTGTTCC TGGTCGCGCA GCTCTCACAG
CGGTGGGGGA CCCGGTACAC CGCGCGCGGC AAGGTGATCT GGTCCGAGCA GCCGCTGCCG
GCGAACGGCG ACTACTAG
 
Protein sequence
MRSGILASWR RCRAGGLGPE DVDLPYEPDT GTEESLLRAS APVLERLHAL LIDTPVCVVL 
SDAAARILVR RAGEPGLNRH LDAVQLAEGF SYHEADAGTN GIGTALAEGR PAVVLAGEHF
ADRFLAFVCA GVPIRDPFSG RIRGVVDLTS WRRDASPLMA ALAAEAAENI ELRLLEQYSA
KERALLAQAR RAGGITVGGG CGEGSTRSGD VGEARRTRGE LVLATRQRHG ADPGHGDRRD
RQLVRGKAAE LVAAARRDVV TIALPGGRHA KLTARTGRTA AGTEVVTVEA EMTSAPEMTT
GALDTAKEMS SDASTDMPMD MPMGVSVGVS VGVSTGELET AADALGMTAE ELETAGGCET
AGAAARLPAA RTPDGMGGAS GSGGSEHHQA QMIALRIEPV EMAMSGVQAG REARSVGECV
ATQINNSLPH PTAPEPEAEP GRDPQPTPPT STPPPASGHQ IPRPAGATTP HPYPGAPNPE
LVMVGEPTVG RIAVAARQRL ALLLDASGRI GTTLDMELTG GELAEIALPD FAQHVAVDLA
SWVLDGEECL PPGSEVKLRR VAVRTVRRGK VHRPAGAHVA YNAATAQARS MVQRHPVLDA
QLTAATAGKR WVAEDPDSSV IAAPIMVQGI VLGVASFYRL GNADPFDEDD LQLAGDLASR
AAVCLDNARR FARERAMALA LQRSLLPRAF PMQCAVEVAH RYQPAQEGVG GDWYDVIPLS
GGRVALVVGD VVGHGIHAAA TMGRLRTAVR NFCALDLPAE DLLSQLDALV ESMDADEAED
QRGVGIIGAT CLYVVYDPVT GLCSVAAAGH PSPAVVARDG SVEYLDLPTG PPLGLGGSAY
EAVELPIDEG SILVLYTDGL VESREQDIGD GLERLSAALA GPGRDPEELC ASAIGGLLPE
RPADDVALLA ARARRTTPDR VATWDVPMTP ESVAFLRAEV SRQLRAWRLT ELVFTTELIV
SELVTNAIRY ATGPVELRLL RDRALICEVA DGSSVSPRLR RAQTFDEGGR GLFLVAQLSQ
RWGTRYTARG KVIWSEQPLP ANGDY