Gene CPR_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1965 
Symbol 
ID4204513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2171017 
End bp2172252 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content31% 
IMG OID642566515 
Productaminopeptidase 
Protein accessionYP_699274 
Protein GI110802497 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAT TTGAAAGCAT GTTAGAAAAT TATGCAAAAT TAGCTACTCA TATAGGAGTA 
AATGTTCAAG AGGGTCAAAC CCTTGTTATC TCTTCTCCTG TAGAATGTGC AGAATTTACA
AGAATGCTTG TTAAATCTGC TTATGAAAAA GGAGCAAAGG ATGTTGTTGT TCAATGGAAT
GATGAAATAT GCGGCAAAAT CAAATATGAA CACTCTCCAT TAGAAGTTTT TGAAAACTTT
CCAGATTGGA TGAAAGAATC AAGATTAAGT TATGCTAAAG AAGGAGCTTG CTTCTTAAGT
ATTTCTGCCT CTGATCCTGA ACTTCTAAAA AATATAGACC CTGCAAAAAT AGCAGCCTTT
AGAAAATCAT CAAGTACAGC TTTAAAAGAA TTTAGTGAAA TGTTAATGAG CAATAAAAAT
TCATGGTCAA TAGTTTCTAT TCCAACTAAA GCTTGGGCTA AAAAAGTTTT CTCTGATTTA
CCTGAGAAAG AAGCAGTAGA TAAATTATGG AATGAAATCT TTAAAATAGT TAGAGTTGAT
ACAGAGAACC CTGTTGAAGC TTGGAATAAA CATAAAGAAA CTTTAAAATA CCATATGGAT
TATTTAAATG AAAAGAATTT AAAATCACTT CATTTTGAAA ATTCACTTGG AACTGATTTA
ACTATAGAAT TACCAGAAAA TCATCTTTGG GCTGGTGGAG CTGAATACAC TCAAGATGGA
GTTGAATTCA TAGCTAATAT GCCTACTGAG GAAGTATTCT CTATGCCTTC TAAATTTGGA
GTTAATGGAA CAGTATTTAG TTCTAAACCT TTAAACTACG GTGGAAATTT AATAGATAAT
TTCTCAGTTA CTTTTAAAGA TGGAAAAGTT GTTGATTTCT CAGCTAAAAA AGGATACGAC
ACTTTAAAAC ATCTTCTAGA TACTGATGAA GGTGCTAAAT ACTTAGGAGA AGTAGCTCTT
GTTCCTTATA ATTCTCCTAT ATCAAACTCA GGAATAATTT TCTTCAACAC TCTATATGAT
GAAAATGCTT CTTGTCATTT AGCTTTTGGT AAAGCATATT CTCTATGCAT AAAAAATGGT
GAAAATATGA CTAATGAAGA GCTTGAAAAA GCTGGAGCTA ATGATTCATT AACTCATGTA
GATTTTATGA TAGGAACTAA AGATTTAAAA ATTACAGGTT TAACTCATGA TAATGTTGAA
ATTCCAGTAT TTAAAGATGG TAACTGGGCA TTTTAA
 
Protein sequence
MNKFESMLEN YAKLATHIGV NVQEGQTLVI SSPVECAEFT RMLVKSAYEK GAKDVVVQWN 
DEICGKIKYE HSPLEVFENF PDWMKESRLS YAKEGACFLS ISASDPELLK NIDPAKIAAF
RKSSSTALKE FSEMLMSNKN SWSIVSIPTK AWAKKVFSDL PEKEAVDKLW NEIFKIVRVD
TENPVEAWNK HKETLKYHMD YLNEKNLKSL HFENSLGTDL TIELPENHLW AGGAEYTQDG
VEFIANMPTE EVFSMPSKFG VNGTVFSSKP LNYGGNLIDN FSVTFKDGKV VDFSAKKGYD
TLKHLLDTDE GAKYLGEVAL VPYNSPISNS GIIFFNTLYD ENASCHLAFG KAYSLCIKNG
ENMTNEELEK AGANDSLTHV DFMIGTKDLK ITGLTHDNVE IPVFKDGNWA F