Gene Caci_4163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4163 
Symbol 
ID8335517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4708860 
End bp4711979 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content71% 
IMG OID644957266 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003114868 
Protein GI256393304 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0226672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGACG ACACGTCCCA CGAAAGCGCG AGCACCGCGG ATCCTCCGCC CAACCGCCGG 
AGCCTGGAGC AAGCCCAGCC GCCCCGGCGC CATCCCGTCG TGGGCGTCGC CACGGCCCTG
CTGCAGGCCG ACGGCCGCAT CGTCCACTGG AGCGCCGCCG CCGAGGCGAT GCTCGGCTAC
TCCGCCGCCG AGGCCGAAGG CGCCCTCGCC ATCGACCTGC TCGGCTCCGA ACGGCTCCGC
GGCGACATCC TGACCATCTA CGACGCCATC TTGCAGGGCG AGGACTGGAC CGGCGTCTTC
CCGGTCCGCC AGCGCGACGG CAACGTGGCC CAGCTGGAGA TCCACACCTA CCGCATCGAC
GCCGGCAGCC CGCCGCCGAT GGTCCTGGCC ACCGCGGTGG ACGTGCGCGC CGTCCGCGAG
GTCGAAGCCG ACCTGGCCGT CCTGGACAGC TTCTTCAGCC AGTCCCCGGT CGGCATGGCG
GTCTACGACA CCGAGACCCG CTTCGTCCAG CTCAACGCCG CCCTCGCGGC CGCGCACGGG
ATCTCCGTCG CCGAACACCT CGGCCGCCGC GTCCGCGACG TGCTGCCCGG CGAGGAGGGC
ATCCGGGTCG AAGCGCAGGT CCGGCAGGTG CTGGCCACCG GCGTGCCGAT CGCCGACGCC
CGCTGGGCCG GACCGACCAA CGGGGACGCC GAACACGGCG ACGCCGTCCA CGACCACACC
TGGTCGGCCT GGTACTCCCG GCTCCAGGAC GCCTCCGGCC GGGTCTTCGG CGTCAGCTCG
ACCGTCATCG ACGTCACCGA ACGCCACGAG GCCGAGGAAC AAGCCGCCCG CGCCCGCCGT
CGCCTGTCCC TGCTGGCCGA GGCCAGCGCC GCGATCGGTG CGACCCTGGA CGTCCGCCAA
GCAGCGCGCG AACTGGTCAA GGCGATGGTC CCGGAGATCG CAGACGTCTG CGGTGTCCAC
GTCCTGGAGC ATCGCTCCCA GCCCGAATCC GTGGCGGCGC AAGCGGATCC GGAGGCGTAC
GTAGCTCGGA GGGTGGCTTT CGACGCCGTC AGCGAGGACT TCCCCTACGA CGACGTCCCG
ATCGGCCAAC TGTTACGGCT CGACCCGAAA TCCCCGTACT CCGAAGCGCT GCGCAAGCGC
CAGACCGTCG TCGTCGCCCC GTCCGAGATG CCCTCGATCA TCGCCAACCC GACCGGCCGG
CTGAGCACGT ACTACGCGCG CCGGGCCCAG ACCGTGCGCG TCTCCCCGCT GGTGGCACGC
GGCGCGGTCC TGGGCTTCGT GTCCTACGCG CGCGGTGCTG TCCGCGAACC CTTCGACGAC
CAGGACATCA CCCTCGGCGA GGACCTGATC GCCCGCGCCG CCACCGCCTT CGACAACGCC
CTGCTGTTTC AACGCGAGCG CGAGACAGCA CTGGCCCGGC AGGAGACGCT GCGCCAAGCC
AACGCCGCCC AAGGACGCCT GGCGCTGCTC AACGACGCCA GCGTCCGCAT CGGTACCACG
CTGGACCTGC AACGCACCGC CGAGGAACTC ATCGAGGTGG TGCTGCCGCG CTTCGCGGAC
TTCGCAACCG TGGACCTGCT CGTCTCGGTC ATGCGCGGCG ACGAACCCGC CTCACCGCTG
CCCGGTGAGC CGGTGGTCGT GCAAGCCGTC GCCGTCGCAG AGTCCTTCCC CTCCGGACTG
ACAGAGGTCG CTGACGCAGT CGGAAAGACC TCCACCCGCG ACGCCGCCAA GGGCTACGCA
CGCAGCCTGC GCAGCGGACG TCCCATGGTG ATCCCCATCG TCGATGAGGA ATCGCTGGCC
TCCCTCGCCT CCTCCCCGGA GCGCGTCGCC GGAGGTCTGG CCGCAGGTAT CCACTCCTAC
CTGATGGTTC CGCTGCTCGC GCGCGGCGTC GTGCTCGGCG GCGCGGAGTT CATCCGCATG
CAGGACCGCG AGCCGTTCGG TCGCGCGGAC GTCGCGCTGG CAGAGGAACT GGCGGCGCGC
GCCGCGCTGT GTATGGACAA CGCGCGCCTG TATCGCAGGG AACGTAGAAC CGCGCTGACG
CTGCAACGCA GCCTTCTGCC ACAGAACGTG CACCACACGA TCGGCATGGA GATCGCGCAC
CGCTACCTGC CCAGCAGCCG GGTCAGCGAG GTCGGCGGCG ACTGGTTCGA CGTCGTCCCG
CTCTCGTGCG GCCGGGTGGC GCTGCTGGTC GGCGACGTGA TGGGACACGG CATCCGGGCC
GCGGCGACGA TGGGGCAGTT GCGCACGGTG GCCCGGACGC TGGCGACCTT GGACATGGAG
CCCGAGCAGG TGCTGACCCG CCTGGACGCG ACTGCCGCCA ACAGCGGCGA CGACCAGTTC
GCGACGTGCG TGTGCGCGGT GTACGACCCG GTGGAGCGCT CCGGCGTGAT CGCCTCGGCC
GGTCACCTGC CGCCGGTGAT CGTGGCGCCG GACGGGACGA CCACCGTGCT CGACGTCCCG
CCGGGACCGC CGCTGGGGGT CGGGGGCGTG CCGTTCGAGA GCGTGGAGTT CGTCCTGCCC
GAGCGCAGCG TGATGGCGAT GTACACCGAT GGTCTGGTCG AGCGCCGCGG CCGCGATCTC
GACGAGGGGA TCAGCCTGCT GCGACAAGCG CTGACACAGC GCGACCGGCC CTTGGAAGAG
GCCTGCGACG CCGTGCTGGC GGCACTGGTC CCCGGCGGCG CCGAGGACGA CGTGGCGCTG
ATCATGGCCA AGACCGTCTC GCTGGCCGGG GACCGGGTCG CCACCCTGGC GCTGTCCGGC
GACCGCCGCA TGGCCGGCCA GGCGCGCAGC TTCACCCGCG GCAAACTCCG CGACTGGGGA
CTGGCATCCC TCACCGACCT GGCCGAACTC CTGGTCAGCG AACTGGTGAC CAATGCCCTG
ACCCACACCG GCCACCCCCG CCAACTCCGC CTGTTCTGCG ACCGCACCCT CACCGTCGAG
GTCGCCGACT CCGACCCCCG AGCCCCCACC GCCCGCGGCT TCACCGACTA CGAGGAAAGC
GGCCGCGGCA TCCAACTGGT CGACGAACTG TCCCGCCGCT GGGGCAGTCG CATCACCCGG
CACGGCAAGG TGGTCTGGTT CGAGCTGGAG ATCCCGTCCG GCGCGCCGAC GGAGCATTAG
 
Protein sequence
MVDDTSHESA STADPPPNRR SLEQAQPPRR HPVVGVATAL LQADGRIVHW SAAAEAMLGY 
SAAEAEGALA IDLLGSERLR GDILTIYDAI LQGEDWTGVF PVRQRDGNVA QLEIHTYRID
AGSPPPMVLA TAVDVRAVRE VEADLAVLDS FFSQSPVGMA VYDTETRFVQ LNAALAAAHG
ISVAEHLGRR VRDVLPGEEG IRVEAQVRQV LATGVPIADA RWAGPTNGDA EHGDAVHDHT
WSAWYSRLQD ASGRVFGVSS TVIDVTERHE AEEQAARARR RLSLLAEASA AIGATLDVRQ
AARELVKAMV PEIADVCGVH VLEHRSQPES VAAQADPEAY VARRVAFDAV SEDFPYDDVP
IGQLLRLDPK SPYSEALRKR QTVVVAPSEM PSIIANPTGR LSTYYARRAQ TVRVSPLVAR
GAVLGFVSYA RGAVREPFDD QDITLGEDLI ARAATAFDNA LLFQRERETA LARQETLRQA
NAAQGRLALL NDASVRIGTT LDLQRTAEEL IEVVLPRFAD FATVDLLVSV MRGDEPASPL
PGEPVVVQAV AVAESFPSGL TEVADAVGKT STRDAAKGYA RSLRSGRPMV IPIVDEESLA
SLASSPERVA GGLAAGIHSY LMVPLLARGV VLGGAEFIRM QDREPFGRAD VALAEELAAR
AALCMDNARL YRRERRTALT LQRSLLPQNV HHTIGMEIAH RYLPSSRVSE VGGDWFDVVP
LSCGRVALLV GDVMGHGIRA AATMGQLRTV ARTLATLDME PEQVLTRLDA TAANSGDDQF
ATCVCAVYDP VERSGVIASA GHLPPVIVAP DGTTTVLDVP PGPPLGVGGV PFESVEFVLP
ERSVMAMYTD GLVERRGRDL DEGISLLRQA LTQRDRPLEE ACDAVLAALV PGGAEDDVAL
IMAKTVSLAG DRVATLALSG DRRMAGQARS FTRGKLRDWG LASLTDLAEL LVSELVTNAL
THTGHPRQLR LFCDRTLTVE VADSDPRAPT ARGFTDYEES GRGIQLVDEL SRRWGSRITR
HGKVVWFELE IPSGAPTEH