Gene CPF_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1705 
Symbol 
ID4200975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1927319 
End bp1928347 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content28% 
IMG OID638082577 
Productrhomboid family protein 
Protein accessionYP_696141 
Protein GI110799144 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.745116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAT TTGAGCAAGA TTATTTCAAC TTACTTATAA ATAATTATGG TTTTTATGTA 
GAAGACTTAA AAGGAGAACA GGATAAAGAA CTATGGATAG CTTTAAAGAC AGTTAAAGAT
GATGGAAAAT ATGCAGTTAT AATATCTAAG TCTTATGAAG AGGAAGAAAA TTTAAAAATT
GCAGAAGATT ATTTAAAAAG CTTAGGTAAG TCATATTCTC TTCATAATAT AATTCTTTAT
AAAAGCTATG ATAGGGATGA AAAAAAGGAT GAAGACTTTT CTATAGATGA GAATTGTCAT
AGAGTAATTG TTGATGTTCA GAAAAGAGAA GTTTTAAAAA GTGATAGGAG CTCTGAGCCT
CTAGCAAAAA TATTAGAATT TTTATTAAAA AAGAAAGAAG AACCAAAGGT TCCTTGGTAT
AAAAAATTAA GATGTGGAAA AGTTACAGGA ATATTGATTG GTTTAAATAT TTTAGCTTTT
CTAGTTTGTC TTATTGTAGC TACTGCTTTA GGTGCTGGAT TCTTCAGAAA TATAGTAGAG
ATGAATCCAC AAATTCTATA TTGGATGGGT GCTAAGCATA ATAATGCAAT AATATTCCAC
GGAGAATATT ATAGATTAGT AACCTCTATG TTTTTGCATG GTGGAATAGT ACATCTTTTA
TTTAATATGT ATGCTCTATA TATATTAGGA GATTTCATAG AAAGGATTTA TGGAGCGAAA
AAATATTTAG CTATCTATTT TGTTTCAGGA ATAGTAGCAA GTATATTTAG CTTATACTTT
TCACCAGTTA TGGGAGTTGG AGCTTCAGGA GCTATATTTG GACTTTTAGG GGCAGCTTTA
GTTTTTGCTT ATAATGAAAA AGATAGAATT GGTAAAGCCT TAGTAACTAA TATAATAGTT
ATTATATTGC TTAATGTATT TATCGGTCTA TCAATGTCTA ATATAGATAT ATCTGCTCAT
TTTGGCGGAT TCATAGCAGG AGCTATTTTA GGACTTTTCT TCCATAATTA TAAAATAATA
AGAAAATAA
 
Protein sequence
MSKFEQDYFN LLINNYGFYV EDLKGEQDKE LWIALKTVKD DGKYAVIISK SYEEEENLKI 
AEDYLKSLGK SYSLHNIILY KSYDRDEKKD EDFSIDENCH RVIVDVQKRE VLKSDRSSEP
LAKILEFLLK KKEEPKVPWY KKLRCGKVTG ILIGLNILAF LVCLIVATAL GAGFFRNIVE
MNPQILYWMG AKHNNAIIFH GEYYRLVTSM FLHGGIVHLL FNMYALYILG DFIERIYGAK
KYLAIYFVSG IVASIFSLYF SPVMGVGASG AIFGLLGAAL VFAYNEKDRI GKALVTNIIV
IILLNVFIGL SMSNIDISAH FGGFIAGAIL GLFFHNYKII RK