Gene CPR_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1438 
Symbol 
ID4204270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1618370 
End bp1619398 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content28% 
IMG OID642565992 
Productrhomboid family protein 
Protein accessionYP_698757 
Protein GI110803398 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAT TTGAGCAAGA TTATTTCAAC TTACTTATAA ATAATTATGG TTTTTATGTA 
GAAGACTTAA AAGGAGAACA GGATAAAGAA CTATGGATAG CTTTAAAGAC AGTTAAAGAT
GATGGAAAAT ATGCAGTTAT AATATCTAAG TCTTATGAAG AGGAAGAAAA TTTAAAAATT
GCAGAAGATT ATTTAAAAAG CTTAGGTAAG TCATATTCTC TTCATAATAT AATTCTTTAT
AAAAGCTATG ATAGGGATGA AAAAAAGGAT GAAGACTTTT CTATAGATGA GAATTGTCAT
AGAGTAATTG TTGATGTTCA GAAAAGAGAA GTTTTAAAAA GTGATAGGAG CTCTGAGCCT
CTAGCAAAAA TACTAGAATT TCTATTAAAA AAGAAAGAAG AACCAAAGGT TCCTTGGTAT
AAAAAATTAA GATGTGGAAA AGTTACAGGA ATATTGATTG GTTTAAATAT TTTAGCTTTT
CTAGTTTGTC TTATTGTAGC TACTGCTTTA GGTGCTGGAT TCTTCAGAAA TATAGTAGAG
ATGAATCCCA AAATTCTATA TTGGATGGGT GCTAAGCATA ATAATGCAAT AATATTTCAT
GGAGAATATT ATAGATTAGT AACCTCTATG TTTTTGCATA GTGGAATAGT ACATCTTTTA
TTTAATATGT ATGCTCTCTA TATATTAGGA GATTTCATAG AAAGGATTTA TGGAGCGAAA
AAATATTTAG TTATCTATTT TGTTTCAGGA ATAGTAGCAA GTATATTTAG CTTATACTTT
TCACCAGTTA TGGGAGTTGG CGCTTCAGGA GCTATATTTG GACTTTTAGG GGCAGCTTTA
GTTTTTGCTT ATAATGAAAA AGATAGAATT GGTAAAGCTT TAGTAACTAA TATAATAGTT
ATTATATTGC TTAATGTATT TATCGGTCTA TCAATGTCTA ATATAGATAT ATCTGCTCAT
TTTGGCGGAT TTATAGCAGG AGCTATTTTA GGACTTTTCT TCCATAATTA TAAAATAATA
AGAAAATAA
 
Protein sequence
MSKFEQDYFN LLINNYGFYV EDLKGEQDKE LWIALKTVKD DGKYAVIISK SYEEEENLKI 
AEDYLKSLGK SYSLHNIILY KSYDRDEKKD EDFSIDENCH RVIVDVQKRE VLKSDRSSEP
LAKILEFLLK KKEEPKVPWY KKLRCGKVTG ILIGLNILAF LVCLIVATAL GAGFFRNIVE
MNPKILYWMG AKHNNAIIFH GEYYRLVTSM FLHSGIVHLL FNMYALYILG DFIERIYGAK
KYLVIYFVSG IVASIFSLYF SPVMGVGASG AIFGLLGAAL VFAYNEKDRI GKALVTNIIV
IILLNVFIGL SMSNIDISAH FGGFIAGAIL GLFFHNYKII RK