Gene Caci_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4142 
Symbol 
ID8335496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4682230 
End bp4683186 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content72% 
IMG OID644957245 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003114847 
Protein GI256393283 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00637596 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.887483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGG CCCGCCTGCG CCGCATCGCC GTCCTCGTCC TGGAAGGTGC CAAGCCGCTG 
GATGTCGGCA TTCCCGCGCA GGTGTTCACC ACGCGCGCGA GCATGCCGTA CGAGGTCCGG
GTGTGCGGTG CCGCGCCCGG GCTGGTGACC GGCGGCGACG GGTTGGCGTA CCACGTCGCG
CATGGTCTGG AGGCGCTGGC GTGGGCGGAC ATCGCCTTCA TCCCCGGCTA CCGCGCTCCC
GACCGCGACG ATCCGCCGCC GGCCGTCGTG GCGGCACTGA TCGCCGCGCA CGAAGGGGGC
ACGCGGCTCG CCGCGATCTC CACCGGGGCG TTCGCTCTGG CCGCGACCGG GCTGCTCGAC
GGCAAGCGCG CCACGACCCA CTGGCACTAC ACGCGCACAC TCGCGCAGAA GCATCCGCAG
ATCCGCGTCG ATGAGAACGT CCTGTTCGTC GACGAAGGCA GTGTCCTGAC ATCGGCCGGC
GCCGCGTCGG GCATCGACCT GTGTCTGCAC ATCCTGCGCG GCGACCTCGG GGTGTCGGCG
GCGAACCACG CGGCGCGCCG GCTCGTCGCC GCGCCGTATC GCAGCGGCGG GCAGGCGCAG
TACGTGCCGC GCAGCGTGCC CGAACCGCTC GGCGAACGCT TCGCAGCCAC GCGCGAATGG
GCTCTGCGTC GACTCGGCGA TCCGCTGAGT CTGGAATCCC TCGCCGAACA CGCGGCGGTC
TCCCCGCGTA CGTTCTCCCG GCGTTTCATG GAGGACACCG GCTACACGCC GATGCAGTGG
GTCACGCGTG CCCGCGTCGA CCTGGCCCGC GAGCTGCTGG AGCGGTCGCA GCGCAGTATC
GAGCAGATCG CGAACGACGT CGGGCTCGGG ACCGGCACGA ACCTGCGGGC GCATTTCCAG
CGGATCCTCG GCACGACGCC GAGCGAGTAC CGGCGGACCT TCACGCGCGG CGAGTAA
 
Protein sequence
MPTARLRRIA VLVLEGAKPL DVGIPAQVFT TRASMPYEVR VCGAAPGLVT GGDGLAYHVA 
HGLEALAWAD IAFIPGYRAP DRDDPPPAVV AALIAAHEGG TRLAAISTGA FALAATGLLD
GKRATTHWHY TRTLAQKHPQ IRVDENVLFV DEGSVLTSAG AASGIDLCLH ILRGDLGVSA
ANHAARRLVA APYRSGGQAQ YVPRSVPEPL GERFAATREW ALRRLGDPLS LESLAEHAAV
SPRTFSRRFM EDTGYTPMQW VTRARVDLAR ELLERSQRSI EQIANDVGLG TGTNLRAHFQ
RILGTTPSEY RRTFTRGE