Gene CPF_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0868 
Symbol 
ID4203588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1032227 
End bp1033354 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content34% 
IMG OID638081751 
Productmetalloprotease 
Protein accessionYP_695318 
Protein GI110800906 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0549814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATC AAAGATTAAA TAAGTTAGCT AAACTGCTTG TAAATTATTC AACAGGAGTT 
AAGGAAGGAG ACTTTGTTTT TGTATCTTGT AATGAGGTTG CAAATCCTTG GCTTACTGAA
GTAGTAAAGG AAGCTACTAA GGTAGGAGCT CATGTTGAGT ATATTTTAGA ATCAGAAGAA
GCTAAGGAGG CAAGACTTAA ATTTTCTACA AAGGATCAAT TATTATCAGG GAATTTAATG
ATGGAAACTA TGCTTGAAAA GGCAGATGTT TGGTTAAGTG CATGGGGAGC TAGAAATACT
AGAGCCTTTA GCAATATAGA TTCAGAAAAA ATAAAAGATA GCAGAGCTGG AGAAAAGGGA
TGGAGAAAGT TCTATTCAGG AAGAATGGGA GATGGTTCTT TAAGATGGTG TGGAACTCAA
TTTCCTACAT ATGCAGATGC ACAGGAAGCT TCAATGAGTT TTAGTGAATA TGAAGATTTT
GTTTATGGAG CAGGACTTTT AGACGATGAA GATCCTGTGG CAGAATGGAA TAGAGTAAGC
AAAGAACAAG AAAGATGGGT TAAATATTTA GATACTAAAA AAGAACTTCA TATTTTAGCA
GAAGGAACTG ATATTAAGGT TTCAGTAGAG GGAAGAAAGT GGATAAATTG TGATGGTAGA
GTAAACTTCC CAGATGGTGA AATATTTACA TCACCAGTTG AAAATAAGAT AAATGGACAC
ATAACTTTTT CATTCCCAGG GATTTATGCA GGAAAGGAAA TAGAAGGTAT AGAACTTGAA
GTTAAAGATG GTAAAGTTGT TTCATATAAA GCTAAAAAAG GAGAAGATTT ATTAAAGGCT
TTATTAGAAA CTGATGAAGG AGCAAGCCAT TTTGGAGAAG TAGCTATAGG TACAAACTAT
GGAATTAAGA AGTTTACTAG AAATATGCTA TTTGATGAGA AAATAGGAGG AACAGTTCAT
ATGGCTATAG GAGATTCTAT GCCAGAGGCT GGTGGTAAAA ATAGATCATC ACTTCATTGG
GACATGCTTT GTGACATGAG AAATGGTGGA AGAATATATG CAGATGGAGA ACTTTTCTAT
GAAAATGGAG AGTTTAAAAA AGAAATATTA GAAAAATATA ATCTTTAA
 
Protein sequence
MADQRLNKLA KLLVNYSTGV KEGDFVFVSC NEVANPWLTE VVKEATKVGA HVEYILESEE 
AKEARLKFST KDQLLSGNLM METMLEKADV WLSAWGARNT RAFSNIDSEK IKDSRAGEKG
WRKFYSGRMG DGSLRWCGTQ FPTYADAQEA SMSFSEYEDF VYGAGLLDDE DPVAEWNRVS
KEQERWVKYL DTKKELHILA EGTDIKVSVE GRKWINCDGR VNFPDGEIFT SPVENKINGH
ITFSFPGIYA GKEIEGIELE VKDGKVVSYK AKKGEDLLKA LLETDEGASH FGEVAIGTNY
GIKKFTRNML FDEKIGGTVH MAIGDSMPEA GGKNRSSLHW DMLCDMRNGG RIYADGELFY
ENGEFKKEIL EKYNL