Gene CPR_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0029 
SymbolpepT 
ID4205636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp37264 
End bp38484 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content30% 
IMG OID642564572 
Productpeptidase T 
Protein accessionYP_697374 
Protein GI110803414 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG TTCATGAAAG ATTTTTAGAA TATGTAAAAG TAGATACTAA ATCAGATGAA 
ACAACAAGGG TTACTCCTAG TACAAAAGGT CAATTAGAAT TAGGAAAAAT CCTTGCAGAG
GAATTAAAGA AAATCGGAGT AGATGAAGTA AGAATAAGTG ATAAAGGATA TGTATATGCT
TGTTTAAAGA GTAATTGTGA TAAGGATATT CCGAAAATAG GATTTATTTC ACATATGGAT
ACTGCACCAG ATATGAGTGG AAAAAATGTT AATCCTAAAA TTGTTGAAAA TTATGATGGT
AAAGATATTG AACTTGGAAA TGGATATACA TTATCACCAA GTTTTTCACC AGAACTTCCA
ATGTATAAAG GGCAAACTTT AATAACTACT GATGGAACTA CTCTTTTAGG TGCTGATGAT
AAGGCTGGGG TAGCAGAAAT AATAACAGCT ATTGAATATT TAATAAATCA TCCAGAAATA
AAACATGGTG ATATTAAAAT AGGATTTACT CCAGATGAAG AAATTGGAGA AGGCGCAGAT
CATTTTGATG TTGAAGGCTT TGGAGCAGAT TTTGCTTACA CACTAGATGG TGGAAGAATA
GGTGAATTAG AATATGAAAA CTTTAATGCT GCAAGTGCTA AGGTTGAAAT AATAGGTAAA
AATGTTCACC CAGGAAGTGC TAAAGGAAAA ATGATCAACT CTATTTTAGT TGCTCATGAA
TTTGTTTCTA TGCTTCCTTT AAATGAAGTT CCAGAAAAAA CAGAAGGATA TGAAGGTTTC
TCATTCTTAT TAGATATACA AGGTGAAGTA GAAAAAACTT CATTATCATT TATAATAAGA
GATTTTGATA AAGAAGGATT TAAAAATAGA AAAGAAAGAT TTAATGAAAT AGCTAATGAA
TTAAATAAAA AATATGGAGA AGGTACTGTT ACAGTAACTT TAAAAGATCA ATACATGAAC
ATGAAGGAAA TGATAGAACC TAGAATGCAT ATTGTAGAAA CTGCTGAAAA AGCAATGAAA
CAATGTGGAA TTGAGCCAAT CAAAAAGCCT ATAAGAGGAG GTACTGATGG GGCAAGATTA
TCATTTATGG GACTACCAAC ACCAAATATA TTTACTGGTG GAGAAAACTT CCATGGAAGA
TATGAATATA TATCAGTAAA TTCAATGGAA AAAGCTGTTG AAGTAATACT GAATATAATA
AAAATTTATG CTGAAAAATA A
 
Protein sequence
MKKVHERFLE YVKVDTKSDE TTRVTPSTKG QLELGKILAE ELKKIGVDEV RISDKGYVYA 
CLKSNCDKDI PKIGFISHMD TAPDMSGKNV NPKIVENYDG KDIELGNGYT LSPSFSPELP
MYKGQTLITT DGTTLLGADD KAGVAEIITA IEYLINHPEI KHGDIKIGFT PDEEIGEGAD
HFDVEGFGAD FAYTLDGGRI GELEYENFNA ASAKVEIIGK NVHPGSAKGK MINSILVAHE
FVSMLPLNEV PEKTEGYEGF SFLLDIQGEV EKTSLSFIIR DFDKEGFKNR KERFNEIANE
LNKKYGEGTV TVTLKDQYMN MKEMIEPRMH IVETAEKAMK QCGIEPIKKP IRGGTDGARL
SFMGLPTPNI FTGGENFHGR YEYISVNSME KAVEVILNII KIYAEK