Gene Caci_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1899 
Symbol 
ID8333242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2154531 
End bp2155970 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content69% 
IMG OID644955048 
Productprotease-like protein 
Protein accessionYP_003112660 
Protein GI256391096 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.116716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC GCAAACGTGC CACCCGTGCC GGCGCGCTGT CCCTCGCGAT CGCGAGCGCG 
TTCGCCCTGG CTGCTTCCTC TGCTGCGTCC ACTGTGACGT CGGCGGGGAT CCAGGGGTCC
GGCAAGAGCG TCCAGGCGCA CCCCATGACC TGGGGGAGCC GGGAGGTGGC TGATCTGCCG
ACGCCGTTGC CGACCTCGCA ATGCAAGGCT CAGCTCGGTA TCAACTGCTA CAGCCCTCTG
CAGTACCGCA GTGCCTATGA CCTGAACCCC CTGTACCAGG CGGGGATCAC CGGTCGGGGG
AAGACGATCG TGATCGTCGA CTCCTACGGA TCGCCGACCA TCCAGGCCGA CTTGGACGTC
TTCGACAAGC AGTGGGGTCT GCCGGACACC AAGGTGGACG TGCGGCAGTT CGGGACCATC
CCGGCGTTCG ACCCGACCGA CTCCACCATG GTCGGGTGGG CCGACGAGAC GACGCTGGAC
GTCGAGTACG CGCACGCGAT CGCCCCCGGC GCGAAGATCG TGCTGGCCGA GACCGCGGTC
GCCGAGACCG AGGGCGTCAC GGGCCTGCCG GAGATGATGA ACGCTGAGAA GTCGCTCATC
GACGCCGGGG TCCCGGACGT GATCTCGCAG AGCTTCGGCG CGACCGAGGA CACGTTCCCC
GGGTTCGACC AGCACGACTA CTCCTCGCTG ACGAACCTGC GCTACGCGTT CAAGGACGCC
GCGGCGCACC ACGTGACCGT GCTGGGCTCC TCCGGCGACA ACGGCGTGAC CAGCCAGACC
CTCGACGGCA ACGGCTTCTT CCCGTACGCG GCGAACTCCT GGCCCTCCTC CGACCCGCTG
GTCACCTCGA TCGGCGGCAC GTACCCGGCG ATCGACGACA CCGGCAAGCG CCTGGCGCCC
GACGTCACCG GCAACGACAA CGACCTGCTC TACCCGGGCG GCGTCGTCGG CGGCGGCGGC
CAGTCCCACG TCTTCAAGCG CCCGGACTAC CAGAACAGCG TCAAGAGCGT CGTCGGCGCC
CAGCGCGGCA CCCCCGACGT CTCCTTCAGC GCCACCCTGT CCGGCGCGGC ATGGGTGTAC
TACAGCTTCA CCAACCCGGG CTGGCACCTG ATCGCCGGCA CCAGCGAGTC CTGCCCGATC
ATGTCCGGCG TCGTCGCCCT CGCCGCCCAG GCCGCCGGCC ACCGCCTCGG CAACATCAAC
CCGGCCCTGT ACGAACTGGG CCAGGTGTCG AAGAACCCGG CCTTCGGCAA GTACACCGGC
ATCCAGGACG TGACCGTCGG CAACATCAGC GACAACGGCG TCACCGGCCC GAACGCCGGA
CCCGGCTACG ACATGGCCAC CGGCTGGGGC ACCATCGACG GAGCCCGCTT CGTCCCGGCC
CTGGCGATCG CCGCCTCCGC CCCGAGCAAT CAGGGCAACC GGGAGGATCA GGGGCACTGA
 
Protein sequence
MSTRKRATRA GALSLAIASA FALAASSAAS TVTSAGIQGS GKSVQAHPMT WGSREVADLP 
TPLPTSQCKA QLGINCYSPL QYRSAYDLNP LYQAGITGRG KTIVIVDSYG SPTIQADLDV
FDKQWGLPDT KVDVRQFGTI PAFDPTDSTM VGWADETTLD VEYAHAIAPG AKIVLAETAV
AETEGVTGLP EMMNAEKSLI DAGVPDVISQ SFGATEDTFP GFDQHDYSSL TNLRYAFKDA
AAHHVTVLGS SGDNGVTSQT LDGNGFFPYA ANSWPSSDPL VTSIGGTYPA IDDTGKRLAP
DVTGNDNDLL YPGGVVGGGG QSHVFKRPDY QNSVKSVVGA QRGTPDVSFS ATLSGAAWVY
YSFTNPGWHL IAGTSESCPI MSGVVALAAQ AAGHRLGNIN PALYELGQVS KNPAFGKYTG
IQDVTVGNIS DNGVTGPNAG PGYDMATGWG TIDGARFVPA LAIAASAPSN QGNREDQGH