Gene Cphy_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3062 
Symbol 
ID5743388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3744339 
End bp3745322 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content39% 
IMG OID641294163 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001560158 
Protein GI160881190 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000521678 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTAG CGAGCAGGAC CATTCGGCTA AGGTACGCTC ATGATCGTAG AATATACGGA 
ACCGGGATAG GGGTAGCAAT TGTAGATACA GGGATATGTC AGCATCCGGA TTTTTTAAAA
AATTGTAATC GTAAAATTTA TTTTAAAGAC TTTATTCATC AGAGGACTTT AATGTATGAT
GACTGTGGTC ATGGAACTCA TGTAGCAGGA ATTATTGGAG GGAGTGGCTA TGCCTCTGGT
GGAAAATATA TAGGAGTGGC ACCAAACTGC AATCTGATTA TGGCAAAAAC CTTAAATTAT
AAAGGAGATG GCAATATATC AGATGTACTC ATAGCACTCG ACTGGATTGT GAAGCATAAG
GAAGAGCTTG GTATCCGAAT TGTAAACTTA TCTTTTGGAA TGGGCAACAA AGAAATTAGT
CAGGATGGCA GAAATTTAAT TAATGCAGTT GAAAATGTCT GGGATAGCGG AATCGTCGTG
GTAGCAGCAG CTGGAAATGG TGGACCGAAT TTAGGCAGCG TTGCAGTTCC TGGGAGTAGT
AAGAAAATAA TAACTGTAGG TGCGTCAGAT GATAATATTG AAGTTGACTT AATGGGAAAT
CGTGCGAGAA ATTACTCTGG AAGAGGTCCA ACTTTTGAAT GTATTAAAAA GCCAGATATC
GTTGCACCAG CAAGTAATAT TATGAGTTGC GCGCTAGCTA GGTCTTATGG AGCGGATAAG
TTTCGTGCAT TAAATAATAA TCACTTTTCT TCAGCTGCAA AGGATGGTAG GGAGCGATTT
GGAAATTTCT ATACTGAAAA AAGTGGAACC AGTATGGCAA CTCCAATTGT TAGTGGAGCA
ATTGCTTTGC TATTATCAAT TAAGCCAAAC TTGACGCCCA AAGAAGTCAA GATTAAGATA
CGAGATAGTT GCGTAAACAT TGGAGAACCA CAGAGTAAAC AAGGTTGGGG GCTATTAGAT
ATTGAGAATT TATTAAAGGC GTAG
 
Protein sequence
MDLASRTIRL RYAHDRRIYG TGIGVAIVDT GICQHPDFLK NCNRKIYFKD FIHQRTLMYD 
DCGHGTHVAG IIGGSGYASG GKYIGVAPNC NLIMAKTLNY KGDGNISDVL IALDWIVKHK
EELGIRIVNL SFGMGNKEIS QDGRNLINAV ENVWDSGIVV VAAAGNGGPN LGSVAVPGSS
KKIITVGASD DNIEVDLMGN RARNYSGRGP TFECIKKPDI VAPASNIMSC ALARSYGADK
FRALNNNHFS SAAKDGRERF GNFYTEKSGT SMATPIVSGA IALLLSIKPN LTPKEVKIKI
RDSCVNIGEP QSKQGWGLLD IENLLKA