Gene Cphy_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0533 
Symbol 
ID5743447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp678258 
End bp679514 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content34% 
IMG OID641291645 
ProductTPR repeat-containing protein 
Protein accessionYP_001557659 
Protein GI160878691 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00172679 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACAT TGAAGAAGAA AAAAGCATTT CTAGCGGTAT TATGTTTATC GCTTGTTTTT 
AGCGGCTGTG GGCTAAGTAA GGCAAATCGA AGCTATAGTA AAGCGATGTC CTATTATGAA
AATGGAGAAT ATGAAAATGC TGAGAAAAGT TTTATAGATG CTATTAAAGC AAATCCGGAT
AAAGCAGAGT TTTATTTAGA TTATGGTTTT ACATTGATTA AACTCTCACG GTATGAAGAT
GCCATTAAGG AATTTGAAAG CATAATTATG GATAAAGAAA TAGCTATAGT AAAACAAAAT
AACAAAAAAG CTTATCGAGG TATTGGTATC GCTTATTTAT ATGCACAGTC TTATGAAGAA
GCGATTAAAA ATTTTGATTT AGCACTCGCA ATCTCAGAAG AGAAAAACTT AGATACGGAT
ATTCTATATT ACAAAGGGAA TGCACTAGAG AGAAGTGGAA ATCTTGAGGA AGCATCGAAT
ATCTATTCTG AAATTTTAGA AACCGAAAAA GATGATACTG CGATTTATAA TGCGAGAGCG
AATATTAATA GGATTCTTGG TAATTATGAA GAGAGTATCA AAGACTATGA TAAGGCAATA
GAAATTTCCA AGGGAGACTT TGACCTTTAT TTTGGAAAAT TTGCTGCTTT AAAAGAACTC
TCCCGTACAG CCGAGGCAGA GGAAGTATTA AAGATTGCAG CTTCGCTTCC AGTCCACACA
GAGAGAGATA GCTTTGAGTT AGCGAAGGTT TATTTTTATC AGAAAAATTA TGACCATGCA
AAGCTTCAAT TAGCACAGTC ATTAAAAAAT GGTTTTATAG AAGCAAATTA CTTCCTAGGA
GAAATAAGCA TAGAAGAAGG AGATTTTAAA CAAGCGATCC AGTATTTTGA GACCTTTGAG
GAATCAGGTG GTATGGTGTC TGCCATGTTT TATAATCAGC TTCTGACCTG TTATTTGAAT
GAGGAAGAGT ACGATAAGGC GAAAAATTGT CTAAAGAAGG CAAAAAAATA CTCTGATGTA
ACGATAGAAC AGCAACTTTT AAGAAATGAG ATTATACTAT TAGAAAAGAC CGGAGACTTT
AAAGAAGCAT ATGAAAAGAT GAAAAAGTAT CTGGTTCGCT ATCCAGACGA TGTGGATGCG
AAGAAAGACG CTACCTTCTT AAAAACTAGA GTGGAAGGTG CAAGTAACGA AACAAATACA
AATCAAACAA ATCAAACAGA GCAAGTAGGA ACATCTGAAA CTGTAAAGAA ACCATAA
 
Protein sequence
MDTLKKKKAF LAVLCLSLVF SGCGLSKANR SYSKAMSYYE NGEYENAEKS FIDAIKANPD 
KAEFYLDYGF TLIKLSRYED AIKEFESIIM DKEIAIVKQN NKKAYRGIGI AYLYAQSYEE
AIKNFDLALA ISEEKNLDTD ILYYKGNALE RSGNLEEASN IYSEILETEK DDTAIYNARA
NINRILGNYE ESIKDYDKAI EISKGDFDLY FGKFAALKEL SRTAEAEEVL KIAASLPVHT
ERDSFELAKV YFYQKNYDHA KLQLAQSLKN GFIEANYFLG EISIEEGDFK QAIQYFETFE
ESGGMVSAMF YNQLLTCYLN EEEYDKAKNC LKKAKKYSDV TIEQQLLRNE IILLEKTGDF
KEAYEKMKKY LVRYPDDVDA KKDATFLKTR VEGASNETNT NQTNQTEQVG TSETVKKP