Gene Cpin_3678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3678 
Symbol 
ID8359846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4615173 
End bp4616873 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content48% 
IMG OID644965847 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003123341 
Protein GI256422688 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.294778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.160701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTA ATAAAGCTAT GGCAGTCATG GCTGGTGTTG TGTTGATGGC TGCTCTAACC 
GCTAACCCGA CTTATGCCCA AACCACAGCG ATTCCGAGAA ACTGGCACTT GCTGAATTAT
GCACAGGATA GTGTATACGG TACCGGCGTT GAGAAGGCCT ACAACGAGTT ACTAAAAGGA
AGAAAAAGCA CCCCTGTACT GGTAGCGGTG ATCGATTCAG GTATTGACAC CCTGCATGAA
GATCTTAAAC CCATCCTTTG GGTAAATCCG AAAGAGATCC CTGGCAATGG TATTGACGAC
GACAAAAACG GCTATGTGGA CGACGTTCAT GGCTGGAATT TCCTCGGCGG CAAAGATGGT
CGTAGCGTAA AGGAAGACTC TGACGAGGCA ACCCGCGAAT ACTACCGCTA TAAGAACCTG
TACGGTAACC CGGATTCCGC ACTGGACAAA CAGTCCAAGG AATACCTTTA CTGGCAGAAA
TTGCAGGGTA AAGTAGTGAA ACCTTCCGCT AACGAAGCGA AAGTCACTTA CAAAACCATG
CTGAAATTGC AGGAAAGTCT GCGCAAATGC GAAACGCTGC TCACAGGATA CCTGAAACAA
AATGACTTCA CTGCCGCAAA ACTGGACAGC ATCCAGACGG CTGATGCAGA CGTAATGGTC
GCCAAGAAGT TCATGCAGCG TATCTTCCAG AACACCGGCG AAGAAAATAT CAGCTACTCT
GACCTGAAAT CTGAGTTTGA TGAATACCTC GCCGACCTGA AACGTAAAGC GGAAGCTGCT
GAAAGTGACG GTCCTTCTGA TAAACGCGCA GACATCATCG GTGATAACAT CAACGATATC
AATGATAAAT ACTACGGTAA CGCTGATGTA ATGGGACAAT TCGGTTTCCA TGGCACTCAC
GTTTCCGGTA TCATCGCCGC TGTAAGAGGC AATGGCGTTG GTATGGACGG TATCAATGAC
AACGTACGCA TCATGATGGT GAAAGCCGTT CCTGATGGTG ACGAACGTGA TAAAGACGTC
GCCCTCGCTA TCCGTTATGC GGTAGACAAC GGCGCACGTG TGGTGAATAT GAGTTTTGGT
AAAGGCTTCT CCCCTCACAA AGACTGGGTA GACGCAGCGG TTAAATATGC AGAAGAAAAA
GGCGTATTGC TGATTCATGC CGCGGGTAAC GATGGTAGTG ACAACGACGT GGTAGACAAC
TTCCCGAATC CTGATTTCGC AGATCATTCT CCAAGAGCAA ACAACTACAT CACCGTTGGC
GCCAGCAGCA ATGGCAGAGG CTCCAAAGTA GCCAGCTTCT CCAACTACGG TAAAAAGAAC
GTAGACGTCT TCGCACCCGG TGTACAGATC TACTCTACTG TTCCCGGCGG TAATAAATAC
GGTAGCGCCA GCGGAACAAG TATGGCCGCT CCTGTTGTTG CCGGTGTAGC AGCACTGGTA
CTGGCTTACC ATCCCAACCT GACCGCACAG CAGCTGAAAT ATATCCTCGT GAAATCATCC
ACCCCATTAC CGGATGGTAC GACCGAAGTA AATAAACCAG GTGCAGGCGA TACCAAAGTG
CCTTTCGCAG ATCTGTCTAT CTCCGGTGGC TTAGTAAATG CTTACGAAGC ACTGAAACTG
GCGGATACGA TCGATACCGA AAAAGGTACA AATCCAAAGA AAAAGAAAAA AGCAAAGATG
GAGTCTATTA AGAAAGGCTA A
 
Protein sequence
MKSNKAMAVM AGVVLMAALT ANPTYAQTTA IPRNWHLLNY AQDSVYGTGV EKAYNELLKG 
RKSTPVLVAV IDSGIDTLHE DLKPILWVNP KEIPGNGIDD DKNGYVDDVH GWNFLGGKDG
RSVKEDSDEA TREYYRYKNL YGNPDSALDK QSKEYLYWQK LQGKVVKPSA NEAKVTYKTM
LKLQESLRKC ETLLTGYLKQ NDFTAAKLDS IQTADADVMV AKKFMQRIFQ NTGEENISYS
DLKSEFDEYL ADLKRKAEAA ESDGPSDKRA DIIGDNINDI NDKYYGNADV MGQFGFHGTH
VSGIIAAVRG NGVGMDGIND NVRIMMVKAV PDGDERDKDV ALAIRYAVDN GARVVNMSFG
KGFSPHKDWV DAAVKYAEEK GVLLIHAAGN DGSDNDVVDN FPNPDFADHS PRANNYITVG
ASSNGRGSKV ASFSNYGKKN VDVFAPGVQI YSTVPGGNKY GSASGTSMAA PVVAGVAALV
LAYHPNLTAQ QLKYILVKSS TPLPDGTTEV NKPGAGDTKV PFADLSISGG LVNAYEALKL
ADTIDTEKGT NPKKKKKAKM ESIKKG