Gene Cphy_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0042 
Symbol 
ID5744962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp59873 
End bp61063 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content37% 
IMG OID641291124 
Productrhomboid family protein 
Protein accessionYP_001557171 
Protein GI160878203 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0740017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAATTAT ATAATAAAAT ATTAGAATAT TTTTTAACCA AGGGATGTCA GCGTTTACAG 
GTAAATGCAT CTGGTATGGA TGTATTCTAT CAAACAAATC TAGAGCAATG CACCGTATAT
TGGCTGATTG AAATTTATAA TGGAACAGAA ATCTTAAATG ATCAGTATCA AAACATTAAA
CGTCAAATCG AAACAGCCTT TTTTAGCAAG GGTTATCAAA AGGTTTTGGT ATACGCAATT
CTTTGCACAA AGAATACAGA GGAAGCTTTA AAGATTTGTA GGATGAGCGA AAGCGCTTGT
ATCGTTGATA TGCTTCAGTA TCGTCTGATG CTTTTTGAGG GTCAGGCTAG GGATGAGAAT
GGAGTATTTC GTGGGTTAGA GCAACTAATT TCAGAATATG GAAGATTATA TTTTGCTACC
CATGGAGGGC AGGATTATTC AAGCCAAAAT ACTGATAGTC AAGGAAACAA TGAAAACTCT
TGGGGTGGTT ATGGCAGTGA TTACGAAAAT CAAAGAAGTC AAGGTACCCA TCAACAAGGC
TCCTATCAGC AAGGGTACGG TGATTATCAG TCAGGAAAAG CATATCAGAG AAAGAACGGA
TTTTTCTCTG GTGTTGGCAT AGTTACACTA GTTCTAATTG CACTAAACGC AGTGGTATTT
TTCTATACTG ACCTGTCCGG TAATTATAAT AAGATTATTT CAGAAGGTTG CATTTTCTGG
CCGCTTATAA AATTTAATAA CGAATACTAT CGACTTTTAA CCTATCAATT TTTACATGCG
AATATAAGTC ATCTAGTAAA TAACATGCTT ATTCTGGCAA TTATGGGGTC CACATTGGAG
CGACATGTCG GTAAATTTAA GTATCTATTG ATTTACTTTC TGTCTGGAAT TGTTGCAGGG
ATTGCCTCTA TGAGTTATAA TATGTGGAAA GGTTTATTTA GCAATAGTAT TGGCGCGTCA
GGAGCAGTAT TTGGCGTTAT AGGAGCAATC GCATTGATTG TAGTAGTGAA CAAAGGAAGA
CTAGAGACGA TTGGTACCAG GCAGATTATC ATCTTTATAG CACTTAGTTT GTATGGAGGA
TTTACAAGTC AGGGTGTTGA TAATGCTGCT CATGTGGGAG GTCTGCTTGC GGGCTTCTTT
ATTGCTATGC TTGTTTACCG AAAGAAGAGG GGGCGCATAC GTGAAGATTA A
 
Protein sequence
MELYNKILEY FLTKGCQRLQ VNASGMDVFY QTNLEQCTVY WLIEIYNGTE ILNDQYQNIK 
RQIETAFFSK GYQKVLVYAI LCTKNTEEAL KICRMSESAC IVDMLQYRLM LFEGQARDEN
GVFRGLEQLI SEYGRLYFAT HGGQDYSSQN TDSQGNNENS WGGYGSDYEN QRSQGTHQQG
SYQQGYGDYQ SGKAYQRKNG FFSGVGIVTL VLIALNAVVF FYTDLSGNYN KIISEGCIFW
PLIKFNNEYY RLLTYQFLHA NISHLVNNML ILAIMGSTLE RHVGKFKYLL IYFLSGIVAG
IASMSYNMWK GLFSNSIGAS GAVFGVIGAI ALIVVVNKGR LETIGTRQII IFIALSLYGG
FTSQGVDNAA HVGGLLAGFF IAMLVYRKKR GRIRED