Gene Caci_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4226 
Symbol 
ID8335580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4793357 
End bp4794589 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content70% 
IMG OID644957329 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003114931 
Protein GI256393367 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.094038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG AACCGGCGAG CGAAGTCATC GCCGCCCTCT ACGCCGACAG CTGGAGCCGG 
ATCGTGGCGA CCATGATCCG CTTCACCGGC GGCGACTGGA ACCTGGCCGA GGAGTGCGCA
CAAGACGCCT TCACCCAAGC CCTGCGCCGC TGGCCCGACG AAGGCATCCC CCGCCAACCG
CTCGCCTGGC TGACCACCGC CGCCCGCAAC CGCGCCATCG ACACCTTCCG GCGCGCCTCC
ACCGAGGCGG CGAAGCTGCG CCAGTGGGCA GCCCTGGAAG CCAAGGCGCC GCCGCCCTAC
ACCCGCGACA GCGAAATCCC GGACGAGCGC CTGGAGCTCA TGTTCACCTG CTGCCACCCG
GCACTGAACC TCGACGCGCA AGTAGCGCTG ACCTTGCGCT CACTCGCGGG CTTGAGCACC
GCCGACATCG CACGCGCCTT CCTGGTCAGC GAACGCACCA TGGCCCAGCG CATCTTCCGC
GCCAAGCAGA AGGTGGCGCA CGCCGTCATC CCGTTCCGCG TCCCGCCGGC CCACCTGCTG
CCCGATCGCC TGCCCGCAGT CCTGCACGTG CTGTATCTGC TCTACAACGA GAGCTACAGC
GACCAGCAGC ACAAGGCGCG CCTGAGCACG GAGGCGATCC GACTGGCGCG AGTCCTGGCC
ACGCTGATGC CGGACGAGCC CGAGGCACAG GGCTTGCTGG CCCTGATGCT GCTGCAGGAG
GCGCGCCTCG CGACACGTCT GGACGCCGAA GGCGAACTGG TCACGCTGGA ACACCAGGAC
CGCTCCCTAT GGGACCGCAA GCGCATCGAC GAAGCCACGG CAATCTTGGA GAGCGCCCTA
CGCCGCCGCC GAGCCGGACC CTTCCAGATC CAGGCAGCCA TCGCCGCCTG CCACGCAACC
GCAGCCTCCG TCGAGAAGAC CGACTGGCCG CAAATCGTCG GCCTCTACGA CCAACTCCGC
CGCGTCTCGC CAAGCCCCCT GGTCGACCTG AACCGCACCG TCGCCCTAGC AATGGCACAG
GGCCCTGAGA CCGCCCTGCC CGACCTCGAC ACCCTCACCG CCTCCGGACG CCTCGACGGC
TACCACCTCC TGCACGCGGC CCGCGCCGAC CTGCTCGCCA AGGTAGGACG CGACGCGGAA
GCGAGGGCAA GCCTGGAAAC CGCGTTGGAG CTGGCACCGA CCGACGCCGA GCGGCGGCTG
TTGCAGCAGC GGTTGCATCC CGGGCGTACG TAA
 
Protein sequence
MTDEPASEVI AALYADSWSR IVATMIRFTG GDWNLAEECA QDAFTQALRR WPDEGIPRQP 
LAWLTTAARN RAIDTFRRAS TEAAKLRQWA ALEAKAPPPY TRDSEIPDER LELMFTCCHP
ALNLDAQVAL TLRSLAGLST ADIARAFLVS ERTMAQRIFR AKQKVAHAVI PFRVPPAHLL
PDRLPAVLHV LYLLYNESYS DQQHKARLST EAIRLARVLA TLMPDEPEAQ GLLALMLLQE
ARLATRLDAE GELVTLEHQD RSLWDRKRID EATAILESAL RRRRAGPFQI QAAIAACHAT
AASVEKTDWP QIVGLYDQLR RVSPSPLVDL NRTVALAMAQ GPETALPDLD TLTASGRLDG
YHLLHAARAD LLAKVGRDAE ARASLETALE LAPTDAERRL LQQRLHPGRT