Gene Caci_4345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4345 
Symbol 
ID8335699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4933747 
End bp4934994 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content73% 
IMG OID644957448 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003115050 
Protein GI256393486 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.756748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTCCG ACCAACGCCT GGAGGATCTG CTCGCCGAGC TCAGGCCCGC GGTCCTCGGC 
GCCCTGGTCC GACGGCACGG GCAGTTCGAC GGCTGCGAGG ATGCGGTGCA GGAGGCGCTC
GTCGCCGCGG CCACGCAGTG GCCAGCCGAG GGCGTTCCGG ACAACCCGCG CGCCTGGCTG
CTCACCGTCG CGGGGCGCCG CCTCACCGAC TATTGGCGCA GCGACCACGC GCGCCGCACG
CGCGAGGCCA CGGTCGCGGC CATGGCGGCG CCGGAATCCG CGTACGCTCC GGCGCCGGAC
GACGAGGAGC GGATCTCAGC CGACGACGAC ACCCTGATGC TGCTGTTCCT GTGCTGCCAT
CCGGTGCTCA GCCCGTCCTC GCAGGTGGCG CTGACGCTGC GGGCGGTCGG CGGACTTACC
ACCGAGGAGA TCGCGCAGGC GTTCCTGGTC CCGCAGACCT CGATGACGCG CCGCATCTCC
CGCGCCAAGC AGCAGGTCAA GGACGCAGGG CTGACGTTCC GCCTGCCGCC GCAGGCCGAG
CGTGCCGAGC GGACGCGCGC GGTCCTGCAC GTGCTGTACC TCATCTTCAA TGAGGGCTAC
ACCGCCACCG CCGGTCCGGA TCTGCTGCGT CCGGACCTCA CCGCCGAGGC GATCCGCCTG
ACCCGGCAGG TCCACCGCGT CCTGCCGGAG AACGGCGAGG TCGAGGGTCT GCTGGCGCTG
ATGCTCCTGA CCGAGGCGCG CAGCCCGGCG CGGACGCTGG CCGACGGGAC TCTGGTCCCG
ATGGCCGATC AGGACCGGTC GCTGTGGAAC GGCGATCTGG CCGAGGAGGG GTTGGCGTTG
GTGGTCGAGG CGCTGGCTCG GCCGGGCGTC GGTCCCTACC GCTTGCAGGC TGCGATCGCT
GCGGTGCACG TGGAGACGCC CGCCGACGGC ACGACGGACT GGCCGCAGAT CCTGGCGCTG
TACGACCTGC TGGAGCAGAT GGCGCCGAAC GCGGTGGTCC GCCTGAACCG GGCGGTGGCG
ATGGCGATGG TCGAGGGGGC GCGCGAGGGA CTGCGGCTGC TGGAGCCGCT GGAACAGGAC
CGATGGATGG CGGGCAACCA TCGGCTGAGC GCGGTGCGTG CCTACCTGCT GGAGATGGAC
GGCGATCGCG CCGGCGCGCG CGAGGCGTAC CGGACGGCGG CGCGACAGGC GGCGAGCGGG
CCGGAGCAGC GGTATCTACG GGAGCAGGCG GAGCGGTTGG GCGCCTGA
 
Protein sequence
MRSDQRLEDL LAELRPAVLG ALVRRHGQFD GCEDAVQEAL VAAATQWPAE GVPDNPRAWL 
LTVAGRRLTD YWRSDHARRT REATVAAMAA PESAYAPAPD DEERISADDD TLMLLFLCCH
PVLSPSSQVA LTLRAVGGLT TEEIAQAFLV PQTSMTRRIS RAKQQVKDAG LTFRLPPQAE
RAERTRAVLH VLYLIFNEGY TATAGPDLLR PDLTAEAIRL TRQVHRVLPE NGEVEGLLAL
MLLTEARSPA RTLADGTLVP MADQDRSLWN GDLAEEGLAL VVEALARPGV GPYRLQAAIA
AVHVETPADG TTDWPQILAL YDLLEQMAPN AVVRLNRAVA MAMVEGAREG LRLLEPLEQD
RWMAGNHRLS AVRAYLLEMD GDRAGAREAY RTAARQAASG PEQRYLREQA ERLGA