Gene Caci_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3631 
Symbol 
ID8334984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4062883 
End bp4064028 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID644956772 
Productserine protease 
Protein accessionYP_003114375 
Protein GI256392811 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.523915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.916295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCTC GTATCAGGCT TCTGTTCTCC ATGCTCGTCG CGCTTCTGCT GGCCATGGCT 
GTGCCGGCGT TCGCCGATGC CAGTTCCTCG GTCGCGGCGA CCACGTACGC GCATCCGATG
TTCGTTCCCG GGCACGCCGC GCCCGGGCTG CACCCCGACC TCGTGCCGTC CGGTTACGGG
CCGGCTGATC TGCAGTCCGC CTACAAGCTG CCTTCGGGCA CCAACGGCGC CGGCCAGACG
GTGGCGATCG TCGACGCCAA CAACGACCCG ACCGCCGAGG CCGACCTCGG CGTGTACCGG
GCGCAGTACG GCTTGCCGGC GTGCACGACG GCGAACGGCT GCTTCAGGAA GGTGAACCAG
ACCGGCGGCA CGAGCTATCC GCCGACCGAC GCCGGGTGGG CGACCGAGAT ATCCCTGGAC
CTGGACATGG TCTCGGCGGT CTGCCCCAAG TGCCACATCC TGCTGGTCGA GGCCACGTCG
GCCTCGTACG CCAACCTGGG CCAGGCGGTC AACGAGGCGG CGGCGCTGCA CGCCACCACG
ATCTCCAACA GCTACGGCGG CGGCGACCTG TCCGACAGCT CGGCTCCGTA CTACAACCAC
CCCGGCATCA TGATCACGGC CAGCTCCGGT GACGCCGGCT ACGGCGTGGA GTTCCCGGCG
TCCTCGCGCT ACGTCACCGC GGTCGGCGGC ACCTCGCTGA CCCGGGCCTC CAACGCGCGC
GGCTGGAACG AGACCGCCTG GAGCGGCGCC GGCTCCGGCT GCTCGGCCTA CAACCCGGCA
CTGAGCGGCC AGGCCAGCTA CGGCACCGGC TGCGCCCGCC GCGCCGTGGC CGACGTGTCC
GCCGTGGCCG ACCCGGCGAC CGGCGTCGCG GTCTACGACT CGACCCCCTA CGGCGGCCGC
AGCGGCTGGC AGGTCTACGG CGGCACCTCG GTGGCCTCCC CGATCATCGC CTCCGTGTAC
GCCCTGGCCG GCAACGCCGC CAGCATCAAC AACAACTACC CCTACACCCA CTACTCCGCG
AGCACCTTCT TCGACATCAC GTCCGGCTCC AACGGCTCGT GCTCCCCGAC CCAGCTGTGC
CACGCCCGCG TGGGCTGGGA CGGCCCGACC GGCCTGGGCA CCCCCAACGG CGTCGGCGGG
TTCTGA
 
Protein sequence
MTARIRLLFS MLVALLLAMA VPAFADASSS VAATTYAHPM FVPGHAAPGL HPDLVPSGYG 
PADLQSAYKL PSGTNGAGQT VAIVDANNDP TAEADLGVYR AQYGLPACTT ANGCFRKVNQ
TGGTSYPPTD AGWATEISLD LDMVSAVCPK CHILLVEATS ASYANLGQAV NEAAALHATT
ISNSYGGGDL SDSSAPYYNH PGIMITASSG DAGYGVEFPA SSRYVTAVGG TSLTRASNAR
GWNETAWSGA GSGCSAYNPA LSGQASYGTG CARRAVADVS AVADPATGVA VYDSTPYGGR
SGWQVYGGTS VASPIIASVY ALAGNAASIN NNYPYTHYSA STFFDITSGS NGSCSPTQLC
HARVGWDGPT GLGTPNGVGG F