Gene CPF_2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2854 
SymbolpepP 
ID4201533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3120233 
End bp3121477 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content29% 
IMG OID638083721 
Productxaa-pro aminopeptidase 
Protein accessionYP_697218 
Protein GI110799449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTT TAGTTTTTAC TAAAAACAGA GAGAATTTAT TAAAGAAACT AGAAGACAAT 
TCATTATTAG TTTTATTTGC AGGAGAGGCT AAAAGAAAAA CAGCAGATGA ATATTTTCCA
TTTACTCCAA ACAGAAACTT TTATTATTTA ACAGGAGTAG ATGAAGAAAA GCATATACTA
ATGATAAAGA AAATAAATGG TGTGGTTGAT GAAGTCCTTT ATATACTAAA GCCAAATTTA
GAGCAAGAAA GATGGACTGG AAAAACTATA AGAGATTATG AGGCTAAAGA AGTATCTGGC
ATAGAAAATA TAAAATATTT AGAAGAATTT AAAAGTGATT TAAATATGAT TTTTACTAAT
GGAATTGCAG AAAATCTTTA TTTAGATTTA GAAAGAGTTT CATTTGATGA AGAAATGAGT
AAAAGTCAAA GTTTTGCAAA GGAAATTAAG GAGAGATATC CTCAAGTAGT TATAAAAGAT
GTTTATTCTG ATATAGCTTC TTTAAGACAA ATTAAATGTA AAGAAGAAGT AGAAGAAATA
AAGAAGGCTG CTCACATAAC AGCTAAGGGT GTAGAACTTT TAATGAAAGA ATGTAAGCCT
GGAATGAAAG AATATGAATT AGAAGCATAT TTTGACTTTT ATTTAAAACA AAATGGAGTT
AAAGATTATG CTTTTAAAAC TATAGCAGCT GCTGGAGTAA ATGCTGCTAC TTTACACTAT
GTTGATAATA ATAGTGAAAT AAAAGATGGA GACTTAATTC TTTTTGATTT AGGTGCTCAA
GTAAATTATT ATAATGGAGA TATTTCAAGA ACATTCCCTG CTAATGGTAA GTTTACTAAG
AGACAAAAAG AGGTTTATGA AGAAGTTTTA AAAGTAAATG AAGAGATAAT AAACGCTATT
AGACCAGGGG TTGGATTCTA TGAAATAAAT GACAAAGCAA ATAATCTTTT AGCTGAAGCT
TGTGTAAGAT TAGGTCTTAT AGAAGACAAA AAGGATTATA GAAAGTATTA TTTCCACTCA
ATAGGACATA GTTTAGGTCT TGACACTCAT GATGTTGGTA AGAGAGATAT CATTCTTGAA
GAAGGTATGG TTTATACTGT AGAGCCAGGA TTATATATTG AAGAAGAAGC TATAGGAATA
AGAATAGAGG ATGATGTTTT AGTTACTAAA GATGGCTGTG AAGTTCTAAC AAAAGAATGC
ATCAAGTCTG TAGAAGATAT AGAAAAGTTC ATGAGTAATA GATAA
 
Protein sequence
MKSLVFTKNR ENLLKKLEDN SLLVLFAGEA KRKTADEYFP FTPNRNFYYL TGVDEEKHIL 
MIKKINGVVD EVLYILKPNL EQERWTGKTI RDYEAKEVSG IENIKYLEEF KSDLNMIFTN
GIAENLYLDL ERVSFDEEMS KSQSFAKEIK ERYPQVVIKD VYSDIASLRQ IKCKEEVEEI
KKAAHITAKG VELLMKECKP GMKEYELEAY FDFYLKQNGV KDYAFKTIAA AGVNAATLHY
VDNNSEIKDG DLILFDLGAQ VNYYNGDISR TFPANGKFTK RQKEVYEEVL KVNEEIINAI
RPGVGFYEIN DKANNLLAEA CVRLGLIEDK KDYRKYYFHS IGHSLGLDTH DVGKRDIILE
EGMVYTVEPG LYIEEEAIGI RIEDDVLVTK DGCEVLTKEC IKSVEDIEKF MSNR