Gene Caci_3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3137 
Symbol 
ID8334490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3443858 
End bp3445108 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content74% 
IMG OID644956284 
Productputative RNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003113887 
Protein GI256392323 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAAC CGAGCGAGGC CGTCCACGAC GGCGTCCACG CCGCTGTCAC CCGCGCCCAC 
CGCGAGGAGT GGGCCCGGGT GGTCGCCACC CTGACCCGGC GCTTCGGCGA CCTCGACGTC
GCCGAGGAGG CGGCCGCCGA GGCGTTCGCG GCGGCCGTCG AGCACTGGCG CGCCGAGGGC
GTGCCGCCCA ACCCCGGCGG CTGGCTGACC ACCACCGCGC AGCGCAAGGC CATCGACCGG
GTACGCCGCG AGGCCAAGCG CGGCGACAAG CAGAAGGAGG CTCAGGTGTT CTTCCACGAC
GACCCGCCCG AGCCGGTCGG CGCCATCGAG GACGAACGGC TCCGGCTGGT CTTCACCTGC
TGCCACCCGG CACTGGCGAT GGAGAGCCGC GTCGCCCTGA CCCTGCGCAT GCTCGGCGGC
CTGACCGTCG CCGAGATCGC CCGCGCCTTC CTGGTGCAGG AGACCGCGAT GGGGCAGCGC
ATCACCCGCG CGAAGGGCAA GATCAAGGCC GCGCGCATCC CCTACCGCGT CCGGTCGGCC
GAAGACCTGC CGGCCCGCGT CACCGGCGTG CTCTCGGTCC TGTTCCTGGT CTTCAACGAG
GGCTACCTGG CCACCGGCGC CGACACCGAC CCCGTCCGCC AGGACCTCAC CGCCGAGGCG
ATCCGCTTGA CCCGCCTGAT CCGCACCCTG CTGCCCCGCG ACGGCGAGAC CGCCGGACTG
CTGGCGCTGA TGCTCCTCAC CGAGGCCCGC CGCCCCGCCA GAGTCTCGGC GACCGGCGAA
CTGGTCTCCC TCGACGAGCA GGACCGCGGA GCCTGGGACA GGGCCCTGAT CGCCGAGGGG
CACCAGCTGG TCCGCGAACG CCTGGCCAGC GGCATCGCGC CGGGCCGCTA CCAGATCCTC
GCGGCGATCA ACGCCGTCCA CACCTCGGCC CGCGACGCGC GCGACACGGA CTGGTCGCAG
GTCGTCGCCC TCTACGACCA GCTGGTCCGG ATCGACCCCT CACCGATCGT CGCCCTGAAC
CGCGCCATCG CCGTCGCCGA ACTCGACGGC CCGGAGGTTG CCCTGGCGAC GATCGACCGC
CTCGGCGACC GGCTCGACGG CTACCACGCC TACCACGCGG CGCGCGCCGA CCTGCTGCGG
CGCACTGGCC GCAGCACGGA GTCGCGGACG GCGTACGACA GGGCGATCGA GCTGGCGGGG
AACTCCGGGG AGACGGCCTA TCTGACGCGG CGGCGGGATC AGTTGGGGTA G
 
Protein sequence
MGKPSEAVHD GVHAAVTRAH REEWARVVAT LTRRFGDLDV AEEAAAEAFA AAVEHWRAEG 
VPPNPGGWLT TTAQRKAIDR VRREAKRGDK QKEAQVFFHD DPPEPVGAIE DERLRLVFTC
CHPALAMESR VALTLRMLGG LTVAEIARAF LVQETAMGQR ITRAKGKIKA ARIPYRVRSA
EDLPARVTGV LSVLFLVFNE GYLATGADTD PVRQDLTAEA IRLTRLIRTL LPRDGETAGL
LALMLLTEAR RPARVSATGE LVSLDEQDRG AWDRALIAEG HQLVRERLAS GIAPGRYQIL
AAINAVHTSA RDARDTDWSQ VVALYDQLVR IDPSPIVALN RAIAVAELDG PEVALATIDR
LGDRLDGYHA YHAARADLLR RTGRSTESRT AYDRAIELAG NSGETAYLTR RRDQLG