Gene CPR_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0855 
Symbol 
ID4204317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp994737 
End bp995864 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content34% 
IMG OID642565414 
Productmetalloprotease 
Protein accessionYP_698180 
Protein GI110801468 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATC AAAGATTAAA TAAGTTAGCT AAACTGCTTG TAAATTATTC AACAGGAGTT 
AAGGAAGGGG ACTTTGTTTT TGTATCTTGT AATGAGGTTG CAAATCCTTG GCTTACTGAG
GTAGTAAAGG AAGCTACTAA GGCAGGAGCT CATGTTGAGT ATATTTTAGA ATCAGAAGAA
GCTAAGGAGG CAAGACTTAA ATTTTCTACA AAGGATCAAT TATTATCAGG GAATTTAATA
ATGGAAACTA TGCTTGAAAA GGCAGATGTT TGGTTAAGTG CATGGGGAGC TAGAAATACT
AGAGCCTTTA GCAATATAGA TTCAGAAAAA ATAAAAAATA ACAGAGCTGG AGAAAAGGGA
TGGAGAAAGT TCTATTCAGG AAGAATGGGA GATGGCTCTT TAAGATGGTG TGGAACTCAA
TTTCCTACAT ATGCAGATGC TCAAGAAGCT TCCATGAGTT TTAGTGAATA TGAAGACTTT
GTTTATGGAG CAGGTCTTTT AGACCATGAA GATCCTGTGG CAGAATGGAA TAGAGTAAGC
AAAGAGCAGG AAAGATGGGT TAAATATTTA GATACTAAAA AAGAACTTCA TATATTAGCA
GAAGGAACTG ACATTAAGGT CTCAGTAGAG GGAAGAAAGT GGATAAATTG TGATGGTAGA
GTAAACTTCC CAGATGGTGA AATATTTACA TCACCAGTTG AAAATAAGAT AAATGGACAC
ATAACTTTTT CATTCCCAGG TATTTATGCA GGAAAGGAAA TAGAGGGTAT AGAGCTTGAA
GTTAAAGATG GTAAAGTTGT TTCATATAAA GCTAAAAAAG GAGAAGATTT ATTAAAGGCT
TTATTAGAAA CTGATGAAGG AGCAAGCCAT TTTGGAGAAG TAGCTATAGG TACAAACTAT
GGAATTAAGA AGTTTACTAG AAATATGCTA TTTGATGAGA AAATAGGAGG AACAGTTCAT
ATGGCTATAG GAGATTCTAT GCCAGAGGCT GGTGGTAAAA ATAGATCATC ACTTCATTGG
GACATGCTTT GTGACATGAG AAATGGTGGA AGAATATATG CAGATGGAGA ACTTTTCTAT
GAAAATGGAG AGTTTAAAAA AGAAATATTA GAAAAATATA ATATTTAA
 
Protein sequence
MADQRLNKLA KLLVNYSTGV KEGDFVFVSC NEVANPWLTE VVKEATKAGA HVEYILESEE 
AKEARLKFST KDQLLSGNLI METMLEKADV WLSAWGARNT RAFSNIDSEK IKNNRAGEKG
WRKFYSGRMG DGSLRWCGTQ FPTYADAQEA SMSFSEYEDF VYGAGLLDHE DPVAEWNRVS
KEQERWVKYL DTKKELHILA EGTDIKVSVE GRKWINCDGR VNFPDGEIFT SPVENKINGH
ITFSFPGIYA GKEIEGIELE VKDGKVVSYK AKKGEDLLKA LLETDEGASH FGEVAIGTNY
GIKKFTRNML FDEKIGGTVH MAIGDSMPEA GGKNRSSLHW DMLCDMRNGG RIYADGELFY
ENGEFKKEIL EKYNI