Gene Caci_5761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5761 
Symbol 
ID8337122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6659007 
End bp6660098 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content71% 
IMG OID644958865 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003116460 
Protein GI256394896 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.230872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.079116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG CCGATTCCGA CGGCGCTGTC CTGATGCCGC GCATCGATCA GACCGGAGCC 
GCTGTGGGCG ACAGCAGCCT CACCGACGTC GCCGTGCGCG CGGGGGTCTC GACGGCGACG
GTCTCGCGGG CGCTGCGCGG GCTGCCGTCG GTGACCGAGG AGACGCGGGC GCGGATCAAG
GCGGTCGCCG ACGAACTGGG CTATGTGGTC TCCCCCAGCG CGTCGCGGCT GGCCACCGGG
CGCACGCACA CCGTCGGCGT CATCGTGCCC TCCATCGACC GCTGGTTCTC CGGCCAGGTC
ATCAAAGGAG TCGAGCAGGT CCTCCGCGCC GCCGGCTACG ACCTCCTGTT CTACAACCTC
GGCGACGACG AAGGCCGCGC CCGCTTCTTC GAGGCGATGC CCCTGCGCCG CCGCGTGGAC
GCGGTCCTGG TACTTTCCGT GCCCCTGCAG GACCCCGAAG TCGCCAAACT GCGCTCCCTG
CACCTGCCGA TCGGCCTGGT GGGCGCCTCA GCCGACACCT TCTCCAGCGT CCGCATAGAC
GACCTCGCCG GCGCCGCCAC CGCCGTCCGC CACCTGATCG GCCTCGGCCA CCGCGACATC
GCCCTGATCT CCGGCGGCAC CGACGTCCCC CCGCACTTCA CCACCCCCAC CGACCGCCGC
CGCGGCTACC TGGACGCCCT GGCCGCATCA GGCATCGGCT ACGACCCCGC CCTCGAAGCC
GCCGGCGATT TCACCATCAC CGGCGGCGAA CGCGCCATGA GCCACCTCCT GGGCCGCCCC
CACCACCCCA CCGCCGTCTT CGCCCTCTGC GATGAAATGG CCTTCGGCGC CATGCGCGTC
CTGCGCACAT CAGGCCTCCG CATCCCCCGC GACATCTCCG TCATCGGCTT CGACGACCAC
GAAATGTCCG ACCTCCTCGA CCTGACCACC ATCAGGCAGC CGGTGGTGGA ACAGGGCGCG
ACAATCGCCC GCCTCCTCCT GGACCGCCTA TCAGCAGAGG CACACACCCG CCCGCACCAG
AACGAGGTCT CGCTGCCGAC ACAGTTGGTA GTGCGCGGCA GCACGGCGCC GAGGCGCGCC
CGACGCGCCT GA
 
Protein sequence
MSTADSDGAV LMPRIDQTGA AVGDSSLTDV AVRAGVSTAT VSRALRGLPS VTEETRARIK 
AVADELGYVV SPSASRLATG RTHTVGVIVP SIDRWFSGQV IKGVEQVLRA AGYDLLFYNL
GDDEGRARFF EAMPLRRRVD AVLVLSVPLQ DPEVAKLRSL HLPIGLVGAS ADTFSSVRID
DLAGAATAVR HLIGLGHRDI ALISGGTDVP PHFTTPTDRR RGYLDALAAS GIGYDPALEA
AGDFTITGGE RAMSHLLGRP HHPTAVFALC DEMAFGAMRV LRTSGLRIPR DISVIGFDDH
EMSDLLDLTT IRQPVVEQGA TIARLLLDRL SAEAHTRPHQ NEVSLPTQLV VRGSTAPRRA
RRA