Gene Caci_2332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2332 
Symbol 
ID8333681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2643093 
End bp2645609 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content73% 
IMG OID644955485 
ProductRNA binding S1 domain protein 
Protein accessionYP_003113091 
Protein GI256391527 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.430037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGCGA CAGCAGCGTC CATCCACTTG CGCATCGCCG AGGAGCTCGG CGTCGTCGAC 
CGGCAGGTCC GGGCGGCCGT CGAGCTCCTG GACGGCGGGG CGACCGTGCC TTTCGTCGCG
CGGTATCGCA AGGAGGTGAC CGAGGGGCTG GATGACGCGC AGCTGCGCAC GCTGGAGGAG
CGGCTGCGCT ACCTGCGCGA GATGGAGGAG CGCCGGGAGG CGATCCTGGA GTCCATCCGC
AGCCAGGGCA AGCTCGATGA GGCGCTGGAG GCGCAGATCC TCGCCGCGGA TTCCAAGTCC
CGGCTGGAGG ACATCTACCT GCCCTTCAAG CCCAAGCGGC GGACCAAGGC GATGATCGCG
CGCGAGGCCG GGCTGGAGCC GCTGGCCGAC GCGCTGCTCG CGGACCCGAG CCTGGATCCG
GCTGCCACCG CCGCGGGATA CGTGGACGCC GACAAGGGCG TCGCGGATGC CCCGGCCGCG
CTGGACGGCG CGCGCTCGAT CCTGGTCGAG CGGTTCGGCG AGGACGCCGA CCTGATCGGC
GCGCTGCGCG AGCGGATGTG GTCCTCCGGC CGGCTGACCT CCAAGGTCCG CGAGGGCAAG
GAGGAGGCGG GCGCGAAGTT CTCCGACTAC TTCTCCTTCG CAGAGCCCTT CACCGCGCTG
CCCTCGCACC GCATCCTGGC GCTGCTGCGC GGGGAGAAGG AGGAGGTCCT GGACCTCACC
TTCGATCCCG AGCCCGAGGA CGCCGAGCCC GGCGCGCCGG TGACGCAGAG CGGGTACGAG
GTCCGCATCG CCAAGAACTT CGCCATCCCC TCGCCGACCG CGGCCGGCGC CGGCGTGAAG
TGGCTGAACG ACACCGTGCG CTGGGCCTGG CGCACGCGCA TCCTGGTGCG CCTGGCCGTG
GACGTCCGCA TGCAGCTGTG GACGCTCGCC GAGGACGAGG CGGTCCGGGT CTTCGCCGCG
AACCTGCGCG ACCTGCTGTT GGCCGCCCCC GCCGGTACCC GCGCCACGAT GGGCCTGGAC
CCCGGTTACC GCACCGGCGT GAAGGTCGCG GTCGTCGACG CCACCGGCAA GGTCGTGGCC
ACCGACGTGA TCTACCCGCA CGTTCCGCAG AACAAGTGGA ACGAGGCGCT GGCCAAGCTC
GCCTCGCTGG TCAAGGTCCA CAAGGTCGAG CTGATCGCGT TCGGCAACGG CACGGCCTCC
CGCGAGACCG ACAAGCTGGC CCAGGAACTG GTCGCCAACC TGCCGGACCT GAAGCTGACG
AAGATCATGG TCTCCGAGGC CGGCGCCTCG GTGTACTCGG CCTCGGCCTT CGCCTCCCAG
GAACTGCCGA ACATGGACGT CTCGCTGCGC GGCGCGGTCT CGATCGCCCG CCGCCTGCAG
GACCCGCTGG CCGAGCTGGT GAAGATCGAC CCGAAGTCCA TCGGCGTCGG GCAGTACCAG
CACGACCTCG CCGAGGGGAA GCTCTCGCGC TCCCTGGACG CCGTGGTCGA GGACTGCGTG
AACGCCGTCG GCGTGGACGT GAACACCGCC TCGGCGCCGC TGCTGACCCG GGTGTCGGGC
ATCGGCTCGT CCCTGGCGGA CAACATCGTG GCCCACCGCG ACGCCAACGG CCCGTTCGCC
ACCCGCAGCG CCATCAAGGG CGTCCCGCGC CTGGGCGCCA AGGCCTTCGA GCAGTGCGCG
GGCTTCCTGC GCATCCCCGG CGGCGACGAC CCGCTGGACG CCTCCTCGGT CCACCCCGAG
GCCTACCCGG TGGTGCGCCG CATCCTGGCG AGCACCGGCA GCGGTATCAA GGAGCTGATC
GGCGACACCA AGACCCTGCG GGCCCTGCGC CCGGCGGAGT TCGCCGACGA CACCTTCGGC
GTCCCGACCG TCACCGACAT CCTGGCCGAG CTGGAGAAGC CCGGCCGCGA CCCGCGCCCC
GCCTTCAAGA CCGCGACCTT CAAGGAGGGC GTGGAGAAGA TCGGCGACCT GCAGCCCGGC
ATGATCCTGG AGGGCGTGGT CACCAACGTC GCGGCCTTCG GCGCCTTCGT CGACGTCGGC
GTCCACCAGG ACGGCCTGGT CCACATCTCG GCGCTGTCGA AGACGTTCGT CAAGGACCCG
CGCGACGTGG TGAAGCCCGG CGACGTGGTC CGCGTGAAGG TGCTGGACGT GGACGCGGTG
CGCAAACGGA TCGCGCTGAC CCTGCGGCTG GACGACGAGG CCGGTGCCGG CGGCTCGGGC
GGCTCCGGCG GTCCCGGACG CCAGCGGCAG TCGGGCGAGA ACAGCGGTGG CGGTCGCGGG
CGCGATGGGC GCGGCGGCGG TGGCGGCGGT GAGCGGCGCG GCGGTCAGGG CGGCCAGGGC
AGTGGCGGCG GCAACGCCGG CAACGGCGGC GGCGACCGGC GTGGTGGCGG CGGCGGTGGA
AACAGCGGTG GCGGCAACAG CGGCGGTGGC GGCGGCCGGG GCGGCGACCG ACGCGGTGGC
ACCAGCGAGC CGCAGGGCGC GCTGGCCGAT GCGCTGCGGC GCGCCGGGCT CGCCTAG
 
Protein sequence
MAATAASIHL RIAEELGVVD RQVRAAVELL DGGATVPFVA RYRKEVTEGL DDAQLRTLEE 
RLRYLREMEE RREAILESIR SQGKLDEALE AQILAADSKS RLEDIYLPFK PKRRTKAMIA
REAGLEPLAD ALLADPSLDP AATAAGYVDA DKGVADAPAA LDGARSILVE RFGEDADLIG
ALRERMWSSG RLTSKVREGK EEAGAKFSDY FSFAEPFTAL PSHRILALLR GEKEEVLDLT
FDPEPEDAEP GAPVTQSGYE VRIAKNFAIP SPTAAGAGVK WLNDTVRWAW RTRILVRLAV
DVRMQLWTLA EDEAVRVFAA NLRDLLLAAP AGTRATMGLD PGYRTGVKVA VVDATGKVVA
TDVIYPHVPQ NKWNEALAKL ASLVKVHKVE LIAFGNGTAS RETDKLAQEL VANLPDLKLT
KIMVSEAGAS VYSASAFASQ ELPNMDVSLR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ
HDLAEGKLSR SLDAVVEDCV NAVGVDVNTA SAPLLTRVSG IGSSLADNIV AHRDANGPFA
TRSAIKGVPR LGAKAFEQCA GFLRIPGGDD PLDASSVHPE AYPVVRRILA STGSGIKELI
GDTKTLRALR PAEFADDTFG VPTVTDILAE LEKPGRDPRP AFKTATFKEG VEKIGDLQPG
MILEGVVTNV AAFGAFVDVG VHQDGLVHIS ALSKTFVKDP RDVVKPGDVV RVKVLDVDAV
RKRIALTLRL DDEAGAGGSG GSGGPGRQRQ SGENSGGGRG RDGRGGGGGG ERRGGQGGQG
SGGGNAGNGG GDRRGGGGGG NSGGGNSGGG GGRGGDRRGG TSEPQGALAD ALRRAGLA