Gene CPR_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2539 
SymbolpepP 
ID4204058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2761707 
End bp2762951 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content28% 
IMG OID642567089 
Productxaa-pro aminopeptidase 
Protein accessionYP_699786 
Protein GI110801641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTT TAGTTTTTAC TAAAAACAGA GAGAATTTAT TAAAGAAACT AGAAGACAAT 
TCATTATTAG TTTTATTTGC AGGAGAGGCT AAAAGAAAAA CAGCAGATGA ATATTTTCCA
TTTACTCCAA ACAGAAATTT TTATTATTTA ACAGGAGTGG ATGAAGAAAA GCATATACTA
ATGATAAAGA AAATAAATGG TGTGGTTGAT GAAGTCCTTT ATATACTAAA ACCAAATTTA
GAGCAAGAAA GATGGACTGG AAAGACTATA AGAGATTATG AAGCTAAAGA AGTATCTGGC
ATAGAAAATA TAAAATATTT AGAAGAGTTT AAAAGTGATT TAAATATGAT TTTTACTAAT
GGAATTGCAG AAAATCTTTA TTTAGATTTA GAAAGAGTTT CATTTGATGA AGAAATGAGT
AAAAGTCAAA GTTTTGCAAA GGAAATTAAG GAGAGATATC CTCAAGTAGT TATAAAAGAT
GTTTATTCTG ATATAGCTTC CTTAAGACAA ATTAAATGTA AAGAAGAAGT AGAAGAAATA
AAGAAGGCTG CTCACATAAC AGCTAAAGGT GTAGAACTTT TAATGAAAGA ATGTAAGCCT
GGAATGAAAG AATATGAATT AGAAGCATAT TTTGATTTTT ATTTAAAACA AAATGGAGTT
AAAGATTATG CTTTTAAAAC TATAGCAGCT GCTGGCGTAA ATGCTGCTAC TTTACATTAC
GTTGATAATA ATAGTGAAAT AAAAGATGGA GACTTAATTC TTTTTGATTT AGGGGCTCAA
GTAAATTATT ATAATGGAGA TATTTCAAGA ACATTCCCTG CTAATGGTAA GTTTACTAAG
AGACAAAAAG AGGTTTATGA AGAAGTTTTA AAAGTAAATG AAGAGATAAT AAACTCTATT
AGACCAGGGG TTGGATTCTA TGAAATAAAT GACAAAGCAA ATAATCTTTT AGCTGAAGCT
TGTGTAAGAT TAGGTCTTAT AGAGGACAAA AAAGATTATA GAAAGTATTA TTTCCACTCA
ATAGGACATA GTTTAGGTCT TGACACTCAT GATGTTGGTA AGAGAGATAT CATTCTTGAA
GAAGGTATGG TTTATACTGT AGAGCCAGGA TTATATATTG AAGAAGAAGC TATAGGAATA
AGAATAGAGG ACGATGTTTT AGTTACTAAA GATGGATGTG AAGTTTTAAC AAAAGAATGC
ATTAAGTCTG TAGAAGATAT AGAAAAGTTC ATGAGTAATA GATAA
 
Protein sequence
MKSLVFTKNR ENLLKKLEDN SLLVLFAGEA KRKTADEYFP FTPNRNFYYL TGVDEEKHIL 
MIKKINGVVD EVLYILKPNL EQERWTGKTI RDYEAKEVSG IENIKYLEEF KSDLNMIFTN
GIAENLYLDL ERVSFDEEMS KSQSFAKEIK ERYPQVVIKD VYSDIASLRQ IKCKEEVEEI
KKAAHITAKG VELLMKECKP GMKEYELEAY FDFYLKQNGV KDYAFKTIAA AGVNAATLHY
VDNNSEIKDG DLILFDLGAQ VNYYNGDISR TFPANGKFTK RQKEVYEEVL KVNEEIINSI
RPGVGFYEIN DKANNLLAEA CVRLGLIEDK KDYRKYYFHS IGHSLGLDTH DVGKRDIILE
EGMVYTVEPG LYIEEEAIGI RIEDDVLVTK DGCEVLTKEC IKSVEDIEKF MSNR