Gene Caci_8798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8798 
Symbol 
ID8340191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp10200395 
End bp10202227 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content71% 
IMG OID644961888 
ProductPeptidase S53 propeptide 
Protein accessionYP_003119452 
Protein GI256397888 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.424248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.572713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATCTGG CTGGGCAGGA TGCGCAGGGG CTGGCTGACT ACGCGCGGTC GGTCGCCGAT 
CCCAAGAGTG GGGCGTATCA GCACTTTCTG ACGCCGGCTC AGGTTCAGGC GCGGTTCGGT
GCCAGTGCTG AGCAGGTGGC GGCGGTCAAG GCGTGGCTCA GTGGCGCCGG GTTGACCGTC
ACCGGGCAGA CCGAGGACTA CATAGCCGTG CAGGGCAGCA CTGCTGCCGT CGCGGCTGCT
CTGAACACCT CGTTCCACGA GTACGCCACC TCCGATGGCT CGCTGCGGGC GCCGTCGCGG
GACGTCAGCG TGCCGGCCGG GGTGCGGTCG GCGATCATCG GCATCACCGG GTTGTCGCAG
GAGCGGACGG CCAACAAGCC GAACAACTCC GACGCCGCCG ACGCCCGCGA CAAGCAGAGC
TCGAAGGTCG CCGCCGACGG CGTTCCGTAC CTGGGTACCT ATCCCTGCTC GGACTACAGC
GGGCAGCAGG TCGCCACGAG CCTGCCGGCG CTGAACGGCA AGTCCGTGCC GTGGGCGGTC
TGCGGCTACA CGCCGAAGCA GGTGCGCGGC GCCTACGGCG TCAGCGACAG CGGCCTGAGC
GGCAAGGGTG TCACCGTCGC GGTGTTGGAC GCCTACGGGC TGCCGACCAT GCAGGCCGAC
GCGAACAAGT ACGCGTCCCT GCACGGCGAC AAGCCGTTCC GCCCGGGCCA GTACTCGGAG
ATCGTCACCC CCGGCCAGTG GACGGACGCC GACGCCTGCG GCGGTGCGGA CGGCTGGGCC
GGCGAGGAGG CGCTGGACGT CGAGTCGGTG CACGGCATGG CGCCCGACGC GAAGGTCCTC
TACATCGGCG CGAACTCCTG CTTCGACACC GCCGGCGACG GCGTCGCGGC CAACGGCGGG
CTGCTGGACT CCCTGCAGCT GGTCGTGAAC CACCACCTGG CGGACATGGT GTCCAACTCG
TGGGGCGAGC TGATGCACTT CCTGGACCCG TCGGGCAACC CGGTCGACCT CGACCCGGCG
CTGATCCAGA TCTACGAGCA GACCTTCCAG AAGGGCGCCG CGGAGGGCAT CGGCTTCTAC
TTCTCCTCCG GTGACTGCAG CGACGACAAC ACGGGCAGCG GCTGCGGAGC CGACAACGGC
TCCTCGCGCT CGCAGGCCGA GTACCCGACC TCCTCCCCGT GGGTGACCTC AGTCGGCGGC
ACCAGCGTGG CCATCGGCGC GGACAACCGG CTGCAGTTCC AGACCTCGTG GCAGACCGCC
AGCTCCTCGC TGGCGACCGG CAACTCGGCC TGGACCCCCT CCTCCTACCT CTACGGCGGT
GGCGGCGGCA CCAGCGACGT GTTCGCGCAG CCGTGGTACC AGAGCTGGAC GGTCCCGTCC
TCGCTGTCCC GCACGCTGCT GGACGGCACG GCGACCTCGC CCAAGCGCGT GGTCCCGGAC
GTCGCCGCCT ACGGCGACCC GAGCACCGGC TTCCTGCAGG GCTACACCCA GGAGCTGCCC
GACGGCTCGA CCGGCTACGC CGAGTCCCGC ATCGGCGGCA CCAGCCTGGC CGCGCCGACC
TTCGTCGGCA TCCAGGCCGA CGCGCAGCAG GCCCAGCACC GGGCGATCGG CTTCGCGAAC
CCGGAGATCT ACCTGCGCGC GACCTTCGGC CTGTTCACCG ACGTGACCGA CCACCCGCGC
ACGAACACCC CGCTGGCCGT GGTCCGCGGC CTGCCGACCG CCCCGTCGCT GCGCCTGCTC
GGTGACGGCG TCGACCTGCA CGCGACCCAG GGCTACGACA ACGCCACCGG CGTCGGCAGC
CCGAACGCCC GCTATCTCGC TTCCTTCAGG TGA
 
Protein sequence
MYLAGQDAQG LADYARSVAD PKSGAYQHFL TPAQVQARFG ASAEQVAAVK AWLSGAGLTV 
TGQTEDYIAV QGSTAAVAAA LNTSFHEYAT SDGSLRAPSR DVSVPAGVRS AIIGITGLSQ
ERTANKPNNS DAADARDKQS SKVAADGVPY LGTYPCSDYS GQQVATSLPA LNGKSVPWAV
CGYTPKQVRG AYGVSDSGLS GKGVTVAVLD AYGLPTMQAD ANKYASLHGD KPFRPGQYSE
IVTPGQWTDA DACGGADGWA GEEALDVESV HGMAPDAKVL YIGANSCFDT AGDGVAANGG
LLDSLQLVVN HHLADMVSNS WGELMHFLDP SGNPVDLDPA LIQIYEQTFQ KGAAEGIGFY
FSSGDCSDDN TGSGCGADNG SSRSQAEYPT SSPWVTSVGG TSVAIGADNR LQFQTSWQTA
SSSLATGNSA WTPSSYLYGG GGGTSDVFAQ PWYQSWTVPS SLSRTLLDGT ATSPKRVVPD
VAAYGDPSTG FLQGYTQELP DGSTGYAESR IGGTSLAAPT FVGIQADAQQ AQHRAIGFAN
PEIYLRATFG LFTDVTDHPR TNTPLAVVRG LPTAPSLRLL GDGVDLHATQ GYDNATGVGS
PNARYLASFR