Gene Caci_0883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0883 
Symbol 
ID8332214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1025731 
End bp1029171 
Gene Length3441 bp 
Protein Length1146 aa 
Translation table11 
GC content71% 
IMG OID644954034 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003111657 
Protein GI256390093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.581781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTTG CCCGCAAACA CCGCATCGCG GTGGTCGCGG TCACGGCGCT GGGACTGGGG 
ATCGGCACTT CCGCCGCCGC CCTGGCCGCT CCGGCGCCCG CCAGGCCGTC GGCGGCTCAG
CAGTTGGCCG CTTTGTCGAC CGGGGCGAAG CACCCCGTGA TCGTGCTGCT GAAGAACCAG
CACCCCGAAC TGTCGGTCAA GACCGCGAAG GCCCAGCGCA AGGCGGCGAC CACCGCCGAC
CAGACTCCGC TGGTGAACAG CGCGCAGGCG ACCGGCGCTC AGGACATCAA GAAGTTCTCG
GTGATCAACG GCTTCTCGGC CAAGATGACC GACGCCGAGG CCGCGAACCT GCGGCAGAAC
CCGGGCGTCG AGGCGGTCGT CACCGACCAG CAGCACGTCG TGAACACGCT GACCGACGCG
CAGAAGTTGG CCATCGCCGA CTCCGCGGGC GGTACGGCCG CGGGCGCCAA GCCCGCCGCG
ACCGGCGCCG ACGGCCAGAC CCCGGCGGAC AAGGTCATCC CGGGCACCTG CCCGACCGAC
CCCAGCAAGC CGCTGCTGGA GCCGGAGGCG TTGCAGACCA CGAACACCGC CTTCACGAAC
AAGAGCCAGC CGCAGGCCCA GAACATCGTC GACGGCAAGG GCGTGAAGGT CGCGTGGATC
GCCGACGGGC TGGACGTCAA CAATCCGGAC TTCATCCGGG CCGACGGCTC CCACGTCTTC
AGCGACTACC AGGACTTCTC CGGGACCGAC CCGAACGGCG ACGAGAGCGG TGACGAGGCC
TTCGGCGACG CCAGCTCGAT CGCGGCGCAG GGTCTGCACA GCTACGACCT GTCCAAGTAC
GTCATGCCCG GCCACCCGCT CCCGGCCGGC TGCAACATCA CGGTCCGCGG CGTGGCTCCG
GGCGCCTCGC TGGTCGGCCT GAACGTCTTC GGCGCGGCCA ACCTGGTCTT CGACTCCACC
GTCGTGCAGG CCGTCGACTA CGCGGTGAAC GTGGACAACG TCGACGTCAT CAACGAGTCG
CTGGGCAGCA ACGCGCAGCC CACCGAGGGC CTGGACATCA CCAGCCTGGC CGACGACGCC
GCGGTCGCCG CCGGCGTCAC GGTCGTCACC TCCACCGGCG ACGGCGGCGT GACCAACACC
GAGGGCCAGC CGGCCGTCGA CCCGAACGTG ATCGGCGTCG GCGCCACCAC GACCTTCCGC
GACCAGGCGC AGACCGGCAC CGGCGGCGCG CGGAACCTGG CGAGCAGCTG GGCGTCGAAC
AACACCGCCG CGCTGTCCTC CTCCGGCACC AACGACCGGG ACCGGGTCCC GGACCTGGTC
GCGCCGGGCC AGGGCGGCTG GGCGCTGTGC AGCCCCGAGG CGCGGTTCAG CGCCTGTGTG
GACTACAACG GCAATCCGGC CTCCGTGGAG GACTTCGGCG GCACCAGCAT GGCCTCGCCG
CTGGTCGCCG GCGGCGCCGC GCTGGTCATC GAGGCCTATG AGAACACGCA CGGCGGGGCC
CGGCCGGCGC CGGCGCTGGT GAAGCAGATC CTCACCTCCT CGGCCAGCGA CCTCGGCCTG
CCGGCCGACC AGCAGGGCTC CGGTGAGCTG AACACCTACC GCGCGGTCCG GATGGCCATG
TCCGTCAAGG ACGGCAACGG CTCCCCGGCG GCGCAGGGCG ACGGCCTGAT GGCCACCACC
GGCACCGGCG ACACGCAGAT CTCGCTGATC GGTACCGGCG GCTCGAAGCA GAGCGCGTCG
GTCACGCTGA CCAACACCAG CCCCACCATT CAGACGGTCT CGGCGAATGT CCGCGAGCTG
GACACCACGG TCGCCGACAT CAAGGGCACC AAGGCGGTGG ACTTCACCGA CCCGAACTCG
CCGTGGTTCT ACGAGGGCTA CACCCTCGGC ACGCCCGGCC TGCAGCGGCA CTGGTTCAGC
ACCACGTTCA CCGTGCCGGC CGGCGCCGAC CACCTGACCG GCATGGCGAC CTGCGCCTGC
ACCGGGACCA GCACGCTGCT GCGCCTGGTG CTGGTCGGTC CGAACGGCGA GTACGAGAAC
TGGAACAGCC CGCAGGGCAC GACCAACTAC GCGACCGTGG ACCAGGCGAA CCCGCCGGCT
GGCAAGTGGA CGGCGTACTT CTACGCCAAC GCCAACGCCA CCGGCTTCAA GGGCAACATC
AGCTATGACT TCCTGGCCAC CAAGTACAAG GACGTCGGCT CGGTGAGCCC GGCGAGCGCG
GTGCTCAAGC CGGGGCAGTC GCAGAAGTTC ACCGTCAAGC AGACGCTGGC GCGCAACCCC
GGCGACGTCT CCGCGGCGCT GGCCTTCTCC ACGCCGTTCC ACCAGGTCAC CACGATGCCG
GTGACCAAGC GGACGCTGAT CTCCACGAAT AACGACGGCG GTTCCTTCAC CGGGACGCTG
ACCGGCGGCA ACGGCCGCGC GAGCACGCCG TCGCAGACCG AGTCGTACTA CTTCGACGTG
CCGCGCGGCA AGAAGAACCT GGCGATGGAC CTGACCTTCG CCGGCAGCCA CGCGGTCTCG
GCCTTCCTGG AGTCCCCGGA CCACCAGGTG GTGTCGCTGA GCACGAACAT CGCGGTGGAC
GCGCAGGGCA ACGAGAACCT GCTGCCCTCG CTGACCGGCT ACGTCGACGC CCCGGCCGCC
GGCCGCTGGG TGTTGTTCAT GGACGACATC AACCCCGGCG TGCTCTCCGG CGACCTGGCT
GACACCTACA ACGGTCAGCT GCGGTACAAC GCGGTCGACG CCAGTGCCAT GGGCCTGCCC
TCGGGCAAGC TCGCAGCCGG CAAGGCGGTG ACGGCGAAGG TGACGATCAA GAACACCGGC
GCGGCTCCGC TGACGGTGTT CGCCGACCCG CGCCTGAACA GCAGCGCCGA CTACGACCTG
CCGGCGCAGC AGCCGCTGGG CGCGACCGTG GCCCTGCCGT TCGCCACCAC CGGCGCGCAG
CCCTCCTTCC AGATCCCGAC GCACACCACC GAGCTCCGGG CGTCGCAGTC CTCGACGATC
CCGGCGGACT TCTCGACCAG CGGCCCGTCC GGCATGCCGG AGGTCTACGG CGTGTCCAAG
GGCCTGACCG CGGGCGCCAC CGTCGACTCC CCGTGGCTCA CCCCGGGCAT CTGGGGCCAG
GACCCGACCC CGCTGGGCCC GACGAACGCC GCGGTCACCG GCAGCGCGAC CGAGGCCGAG
AGCGTGACCA CGCTGGCCTT CGACCGCACC GCCGCGGCCT CCACCGGCGA CCTGTGGCTG
ACCGGCGTCG ACCCCAGCGC CCCGGCACTC GTGCCGGTGA CGATCATGCC GGGCCAGACC
GGGACCCTGA CCGTCACCTT CACACCGACC GGCGCGAGCG GCAGCAAGGT GAGCGGCGTG
GTGTACGTGG ACACGTACAA CGCGGCATTC GGCACCGCCG ACGAGCTGAC GGGCCTGCCC
TACAGCTACA CGGTGAAGTA G
 
Protein sequence
MQFARKHRIA VVAVTALGLG IGTSAAALAA PAPARPSAAQ QLAALSTGAK HPVIVLLKNQ 
HPELSVKTAK AQRKAATTAD QTPLVNSAQA TGAQDIKKFS VINGFSAKMT DAEAANLRQN
PGVEAVVTDQ QHVVNTLTDA QKLAIADSAG GTAAGAKPAA TGADGQTPAD KVIPGTCPTD
PSKPLLEPEA LQTTNTAFTN KSQPQAQNIV DGKGVKVAWI ADGLDVNNPD FIRADGSHVF
SDYQDFSGTD PNGDESGDEA FGDASSIAAQ GLHSYDLSKY VMPGHPLPAG CNITVRGVAP
GASLVGLNVF GAANLVFDST VVQAVDYAVN VDNVDVINES LGSNAQPTEG LDITSLADDA
AVAAGVTVVT STGDGGVTNT EGQPAVDPNV IGVGATTTFR DQAQTGTGGA RNLASSWASN
NTAALSSSGT NDRDRVPDLV APGQGGWALC SPEARFSACV DYNGNPASVE DFGGTSMASP
LVAGGAALVI EAYENTHGGA RPAPALVKQI LTSSASDLGL PADQQGSGEL NTYRAVRMAM
SVKDGNGSPA AQGDGLMATT GTGDTQISLI GTGGSKQSAS VTLTNTSPTI QTVSANVREL
DTTVADIKGT KAVDFTDPNS PWFYEGYTLG TPGLQRHWFS TTFTVPAGAD HLTGMATCAC
TGTSTLLRLV LVGPNGEYEN WNSPQGTTNY ATVDQANPPA GKWTAYFYAN ANATGFKGNI
SYDFLATKYK DVGSVSPASA VLKPGQSQKF TVKQTLARNP GDVSAALAFS TPFHQVTTMP
VTKRTLISTN NDGGSFTGTL TGGNGRASTP SQTESYYFDV PRGKKNLAMD LTFAGSHAVS
AFLESPDHQV VSLSTNIAVD AQGNENLLPS LTGYVDAPAA GRWVLFMDDI NPGVLSGDLA
DTYNGQLRYN AVDASAMGLP SGKLAAGKAV TAKVTIKNTG AAPLTVFADP RLNSSADYDL
PAQQPLGATV ALPFATTGAQ PSFQIPTHTT ELRASQSSTI PADFSTSGPS GMPEVYGVSK
GLTAGATVDS PWLTPGIWGQ DPTPLGPTNA AVTGSATEAE SVTTLAFDRT AAASTGDLWL
TGVDPSAPAL VPVTIMPGQT GTLTVTFTPT GASGSKVSGV VYVDTYNAAF GTADELTGLP
YSYTVK