Gene CPR_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1076 
Symbol 
ID4205402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1223066 
End bp1224472 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content31% 
IMG OID642565632 
Productputative aminopeptidase 1 
Protein accessionYP_698398 
Protein GI110802137 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATG TAAAAGTTTT AAAAAAAGAA TATGAAAATG CTTGGACTAA ATATGAAGAG 
GCAGATTTAA AACAGGTATT TTCATTAAGT GATAGATATA GGGAGTTCAT GTCAGTTGCT
AAAACTGAAA GAGAATGTGT TAAGGTTTTA GCTAATATGG CAGAAAGTAA AGGGTTTAAA
AATTTTTATG AAGTTATTAA AAATGGAGAA AAAGTGACAG CTGGGGATAA ACTTTATTCT
ATAAACATGG ATAAAACAAT AACTTTAATA AAAGTAGGTT CAGAACCATT AGAAAATGGA
TTAAGAATAA TAGGAGCTCA CATAGATTCT CCAAGAATAG ATGTTAAACA AAATCCATTA
TATGAAGATT CAGGATTAGC ATTATTAGAT ACTCATTATT ACGGAGGAGT AAAAAAATAT
CAATGGGTAA CTATACCTCT TGCAATACAT GGAGTTGTTG TAAAAAAAGA TGGAACTAGG
GTAGATATTA AAATAGGAGA AGATGAAAAT GATCCTGTTT TAGGAATTTC AGATCTTTTA
ATCCATTTAT CAGCAGATCA ATTAGATAAA AAAGGAGCTA AAGTAGTAGA AGGAGAAGAC
CTAAATATTT TAGTTGGAAG TATGCCATTA AAAGGAACTG AGGAAAAAGA GGCTGTTAAA
GCTAATATAT TAGTTTTATT AAATGAAAAA TATGGCATAA CTGAGGAAGA TTTTGTATCA
GCTGAGTTAG AAGTAGTTCC TGCTGGTAAG GCAAGAGATT ATGGATTAGA TAGAAGTATG
ATTTTAGCAT ATGGTCACGA TGATAGAATA TGTGCATATA CTTCAGCAGA AGCTTTAATG
GATTTAGAAA ATGTTGATAA AACATGTGTA GCATTATTAG TTGATAAAGA AGAAATAGGT
AGTGTGGGAG CTACAGGAAT GCAATCAAGA TTTTTTGAAA ATATAATTGC AGAACTTATG
GATAGAAAAG GAGAATATTC TGAGTTAAAG CTTAGAAGAT GTCTTCAAAA TTCAATGATG
TTATCAGCTG ATGTTACTGC AGCTTTTGAT CCAAACTATC CTTCTGTATG TGAAAAGAAA
AACACAGCTT ATTTTGGACA TGGAGTAGTA TTTAGCAAAT ATACAGGAGC TAGAGGAAAA
GCAGGTTGCA ATGATGCTAA TGCAGAATAT ATAGCTCACT TAAGAAATAT AATGGATAAA
AATGGTGTTG TATGGCAAAC TGGAGAGCTT GGAAAAGTAG ACCAAGGTGG CGGTGGTACA
ATCGCTTATA TATTAGCTCA ATACAATATG GAAGTTATAG ATTGTGGAGT AGCATTACAA
AATATGCATG CGCCTTTAGA AGTAGCATCT AAAGCAGATT TATATGAAAC TAAAAAATGT
TATAAGGCAT TTTTTGAAGA AGCATAA
 
Protein sequence
MKDVKVLKKE YENAWTKYEE ADLKQVFSLS DRYREFMSVA KTERECVKVL ANMAESKGFK 
NFYEVIKNGE KVTAGDKLYS INMDKTITLI KVGSEPLENG LRIIGAHIDS PRIDVKQNPL
YEDSGLALLD THYYGGVKKY QWVTIPLAIH GVVVKKDGTR VDIKIGEDEN DPVLGISDLL
IHLSADQLDK KGAKVVEGED LNILVGSMPL KGTEEKEAVK ANILVLLNEK YGITEEDFVS
AELEVVPAGK ARDYGLDRSM ILAYGHDDRI CAYTSAEALM DLENVDKTCV ALLVDKEEIG
SVGATGMQSR FFENIIAELM DRKGEYSELK LRRCLQNSMM LSADVTAAFD PNYPSVCEKK
NTAYFGHGVV FSKYTGARGK AGCNDANAEY IAHLRNIMDK NGVVWQTGEL GKVDQGGGGT
IAYILAQYNM EVIDCGVALQ NMHAPLEVAS KADLYETKKC YKAFFEEA