Gene CPR_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1410 
Symbol 
ID4205511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1586942 
End bp1587919 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content34% 
IMG OID642565964 
ProductT4 family peptidase 
Protein accessionYP_698729 
Protein GI110802634 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.568813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAA TAAAGATTAC AGATATAGAT GGTTTTAAAC TAGGCCACGC TCAAGATTTT 
GAAGGTGCTA CAGGATGTAC AGTATTACTA TGTGAAGAAG GTGCTTCTGG AGGAGTTGAT
GTTCGCGGTG GTGCTCCTGG AACTAGAGAA ACTGATTTAT TAAATCCTAT GGAAATGGTT
GATAAAGTTC ATGCTGTAGT ATTATCTGGT GGATCTGCCT TTGGACTTGA TTCCTGCTCA
GGGGTCATGG AATATCTAGA AAATAAAAAT GTTGGATTTG ACGTAGGGGT AACTAAAGTT
CCTATAGTAT GTGGTGCTGT TTTATTTGAT TTAGCCTGTG GTAATCCTAA AGTAAGACCT
AATAAGGAAA TGGGCTTAGA AGCTTGTAAA AATTCTGAAA CCTACTTTGA CTCAAAAAAC
GGTAATATAG GTTGTGGAAC AGGTGCCACA GTAGGTAAAG CCTTAAATCA AAAACTTGCT
ATGAAAGGCG GTTTTGGAAG CTATGCAGTT CAAATCGGAG ATTTAAAGGT AGGAGCTATT
GTAGGAGTTA ATAGCCTAGG TGATATTGTT GACCCTAATG ATAACAATAA AATAATAGCG
GGTGGATTAA GCCAAGATAT GAATTCCTTT ATGAACATAG AGAAAAGCTT ATTAGCTAAT
TATTCTAATC CTAAAAATGT TTTTAAAGGA AATACTACTA TTGGGTGCAT AGTGACTAAT
GGTGATTTTA ATAAAGCTGA GGCTAATAAA ATTGCATCTA TGGCTCAAAA TGGTTTTGGA
AGAACCATTC GCCCTGCTCA CACTATGTTT GATGGTGATA CAATATTTAC TTTATCTTCA
AATAAAGTTA AGGCAGATAT AAATGTAGTT GGTCTTTTAG CTGCTCAAGT TATGGAAAAA
GCTATTATAA AAGCTGTTAA AGAAGCTGAT TCTTCATATG GATTCTTATC ACATAAAGAT
TTAAAATTTA ATGTATAA
 
Protein sequence
MFEIKITDID GFKLGHAQDF EGATGCTVLL CEEGASGGVD VRGGAPGTRE TDLLNPMEMV 
DKVHAVVLSG GSAFGLDSCS GVMEYLENKN VGFDVGVTKV PIVCGAVLFD LACGNPKVRP
NKEMGLEACK NSETYFDSKN GNIGCGTGAT VGKALNQKLA MKGGFGSYAV QIGDLKVGAI
VGVNSLGDIV DPNDNNKIIA GGLSQDMNSF MNIEKSLLAN YSNPKNVFKG NTTIGCIVTN
GDFNKAEANK IASMAQNGFG RTIRPAHTMF DGDTIFTLSS NKVKADINVV GLLAAQVMEK
AIIKAVKEAD SSYGFLSHKD LKFNV