Gene Cphy_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3606 
Symbol 
ID5742630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4452740 
End bp4453996 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content36% 
IMG OID641294716 
Productcarboxyl-terminal protease 
Protein accessionYP_001560692 
Protein GI160881724 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000172646 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA GATTTGTGAC CGGCGTAGTC TCCGGAGTGA TCTGTACCTT GATTATTTGT 
ACTCTTTCTT TTGGCATTCT ATATCGTAAT GCTTTAGCAC AAAAAGATAA TGCCTACACG
AATGAATCAG CACAGTCAAC AGATAATGGC ACGAAAACAG AAGGAAGTAC TGCAATTGAT
GAAGCAACGT TTCAAAAGAA ACTAAAGTAT ATTAAAAACT TAGTTAATAA TTATTATTTA
TGGGATGTAA ATGAGGGGGA TTTTCAAACC GGTATGTTAA AGGGGATGAT GAGTGCCCTA
AATGACCCAT ATTCTACTTA TTATACGAAA GAGGAATATG ACGCCTTGAT GGAAACTACC
AATGGTATTT ACTATGGTAT CGGTGCTACT GTTAGCCAGA ATGTAAATAC TGGTATCATT
ACTATAGTAA AGCCATTTGT CAATGGACCT GCAAATAAGG CTGGTGTTCT TCCGGGAGAT
ATTCTTTATA AGGTGGAAGA CGAAGAGGTA ACAGGTACTG AACTCACCAA AGTTGTTAGT
AAGATGAAAG GTGAAGAAAA TACCATAGTT AAAATCACTG TTATGAGAGA AGGCAAAAGT
GAACCAATTG AGATTTCGAT TACAAGAGGT CAGGTAGAGA TTCCAACCAT TGAACATGAG
ATGTTAAAAG ATAAAATTGG TTACATTAGC ATTCTAGAAT TTGATAAAAT AACAGTAGAT
CAATACATGG CAGCGATTAA TGACTTAGAA AAACAAGGAA TGAAAGGTCT TGTAATTGAC
CTTCGTGATA ATCCAGGTGG ATTATATGAT TCTGCAGTTA AGATGCTCGA TCGTATCATA
GGGAAAGGGC TATTAGTTTA TACTGAGACT AAGGATGGTA CTCGTTCCGA AGATTATGCG
ACTTCAAAAG AAGAATTAAA GGTTCCATTG ACTGTAATCG TAAATGGCAA TAGTGCAAGC
GCTTCCGAGA TCTTCGCTGG TGCAATTCAG GATTATAAGA AGGGTACTAT TGTAGGTACG
CAGAGTTTTG GAAAAGGTAT TGTACAGTCC CTCTTCCCAT TGTTTGATGG AAGTGCGGTG
AAGGTAACGG TATCCAACTA CTTTACTCCA AATGGAAGAA GCATTCATAA AACAGGAATT
ACCCCAGATG TGGTAGTAGA GTTAAATGAA GAATTAAAGA AAAAAGTAGT GATTACTCAT
GATGAAGATA ATCAGCTTCA GAAAGCTATT GAGGTCTTGA AAAGTCAAAT AAAATAA
 
Protein sequence
MKNRFVTGVV SGVICTLIIC TLSFGILYRN ALAQKDNAYT NESAQSTDNG TKTEGSTAID 
EATFQKKLKY IKNLVNNYYL WDVNEGDFQT GMLKGMMSAL NDPYSTYYTK EEYDALMETT
NGIYYGIGAT VSQNVNTGII TIVKPFVNGP ANKAGVLPGD ILYKVEDEEV TGTELTKVVS
KMKGEENTIV KITVMREGKS EPIEISITRG QVEIPTIEHE MLKDKIGYIS ILEFDKITVD
QYMAAINDLE KQGMKGLVID LRDNPGGLYD SAVKMLDRII GKGLLVYTET KDGTRSEDYA
TSKEELKVPL TVIVNGNSAS ASEIFAGAIQ DYKKGTIVGT QSFGKGIVQS LFPLFDGSAV
KVTVSNYFTP NGRSIHKTGI TPDVVVELNE ELKKKVVITH DEDNQLQKAI EVLKSQIK