Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1076 |
Symbol | |
ID | 4205402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1223066 |
End bp | 1224472 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642565632 |
Product | putative aminopeptidase 1 |
Protein accession | YP_698398 |
Protein GI | 110802137 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1362] Aspartyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATG TAAAAGTTTT AAAAAAAGAA TATGAAAATG CTTGGACTAA ATATGAAGAG GCAGATTTAA AACAGGTATT TTCATTAAGT GATAGATATA GGGAGTTCAT GTCAGTTGCT AAAACTGAAA GAGAATGTGT TAAGGTTTTA GCTAATATGG CAGAAAGTAA AGGGTTTAAA AATTTTTATG AAGTTATTAA AAATGGAGAA AAAGTGACAG CTGGGGATAA ACTTTATTCT ATAAACATGG ATAAAACAAT AACTTTAATA AAAGTAGGTT CAGAACCATT AGAAAATGGA TTAAGAATAA TAGGAGCTCA CATAGATTCT CCAAGAATAG ATGTTAAACA AAATCCATTA TATGAAGATT CAGGATTAGC ATTATTAGAT ACTCATTATT ACGGAGGAGT AAAAAAATAT CAATGGGTAA CTATACCTCT TGCAATACAT GGAGTTGTTG TAAAAAAAGA TGGAACTAGG GTAGATATTA AAATAGGAGA AGATGAAAAT GATCCTGTTT TAGGAATTTC AGATCTTTTA ATCCATTTAT CAGCAGATCA ATTAGATAAA AAAGGAGCTA AAGTAGTAGA AGGAGAAGAC CTAAATATTT TAGTTGGAAG TATGCCATTA AAAGGAACTG AGGAAAAAGA GGCTGTTAAA GCTAATATAT TAGTTTTATT AAATGAAAAA TATGGCATAA CTGAGGAAGA TTTTGTATCA GCTGAGTTAG AAGTAGTTCC TGCTGGTAAG GCAAGAGATT ATGGATTAGA TAGAAGTATG ATTTTAGCAT ATGGTCACGA TGATAGAATA TGTGCATATA CTTCAGCAGA AGCTTTAATG GATTTAGAAA ATGTTGATAA AACATGTGTA GCATTATTAG TTGATAAAGA AGAAATAGGT AGTGTGGGAG CTACAGGAAT GCAATCAAGA TTTTTTGAAA ATATAATTGC AGAACTTATG GATAGAAAAG GAGAATATTC TGAGTTAAAG CTTAGAAGAT GTCTTCAAAA TTCAATGATG TTATCAGCTG ATGTTACTGC AGCTTTTGAT CCAAACTATC CTTCTGTATG TGAAAAGAAA AACACAGCTT ATTTTGGACA TGGAGTAGTA TTTAGCAAAT ATACAGGAGC TAGAGGAAAA GCAGGTTGCA ATGATGCTAA TGCAGAATAT ATAGCTCACT TAAGAAATAT AATGGATAAA AATGGTGTTG TATGGCAAAC TGGAGAGCTT GGAAAAGTAG ACCAAGGTGG CGGTGGTACA ATCGCTTATA TATTAGCTCA ATACAATATG GAAGTTATAG ATTGTGGAGT AGCATTACAA AATATGCATG CGCCTTTAGA AGTAGCATCT AAAGCAGATT TATATGAAAC TAAAAAATGT TATAAGGCAT TTTTTGAAGA AGCATAA
|
Protein sequence | MKDVKVLKKE YENAWTKYEE ADLKQVFSLS DRYREFMSVA KTERECVKVL ANMAESKGFK NFYEVIKNGE KVTAGDKLYS INMDKTITLI KVGSEPLENG LRIIGAHIDS PRIDVKQNPL YEDSGLALLD THYYGGVKKY QWVTIPLAIH GVVVKKDGTR VDIKIGEDEN DPVLGISDLL IHLSADQLDK KGAKVVEGED LNILVGSMPL KGTEEKEAVK ANILVLLNEK YGITEEDFVS AELEVVPAGK ARDYGLDRSM ILAYGHDDRI CAYTSAEALM DLENVDKTCV ALLVDKEEIG SVGATGMQSR FFENIIAELM DRKGEYSELK LRRCLQNSMM LSADVTAAFD PNYPSVCEKK NTAYFGHGVV FSKYTGARGK AGCNDANAEY IAHLRNIMDK NGVVWQTGEL GKVDQGGGGT IAYILAQYNM EVIDCGVALQ NMHAPLEVAS KADLYETKKC YKAFFEEA
|
| |