Gene CPF_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0029 
SymbolpepT 
ID4203802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp37194 
End bp38414 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content31% 
IMG OID638080904 
Productpeptidase T 
Protein accessionYP_694498 
Protein GI110800422 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG TTCATGAAAG GTTTTTAGAA TATGTAAAAG TAGATACTAA ATCAGATGAG 
ACAACAAGGG TTACTCCTAG TACAAAAGGT CAATTAGAAT TAGGAAAAAT CCTTGCAGAG
GAATTAAAGG AAATCGGAGT AGATGAAGTA AGAATAAGTG ATAAAGGATA TGTATATGCT
TGTTTAAAGA GTAATTGTGA TAAGGATATT CCGAAAATAG GATTTATTTC ACATATGGAT
ACTGCACCAG ATATGAGTGG AAAAAATGTT AATCCTAAAA TTGTTGAAAA TTATGATGGT
AAAGATATTG AACTTGGAAA TGGATATACA TTATCACCAA GTTTTTCACC AGAACTTCCA
ATGTATAAAG GTCAAACTTT AATAACTACT GATGGAACTA CTCTTTTAGG CGCTGATGAT
AAGGCAGGGG TAGCAGAAAT AGTAACAGCT ATTGAATATT TAATAAATAA TCCAGAAATA
AAACATGGTG ATATTAAAAT AGGATTTACT CCAGATGAAG AAATTGGAGA AGGAGCAGAT
CACTTTGATG TTGAAGGCTT TGGAGCAGAT TTTGCTTACA CATTAGATGG TGGAAGAATA
GGTGAATTAG AATATGAAAA CTTTAATGCT GCAAGTGCTA AGGTTGAAAT AATAGGTAAA
AATGTTCACC CAGGAAGTGC TAAAGGAAAA ATGATTAACT CTATTTTAGT TGCTCATGAA
TTTGTTTCTA TGCTTCCTTT AGATGAAGTT CCAGAAAAAA CAGAAGGATA TGAAGGTTTC
TCATTCTTAT TAGATATACA AGGTGAAGTA GAAAAAACTT CATTATCATT TATAATAAGA
GATTTTGATA AAGAAGGCTT TAAAAATAGA AAAGAAAGAT TTAATGAAAT AGCTAAAGAG
TTAAATAAAA AGTATGGAGA AGGTACTGTT ACAGTAACTT TAAAAGACCA ATACATGAAC
ATGAAGGAAA TGATAGAACC TAGAATGCAT ATTGTAGAAA CTGCTGAAAA AGCAATGAAA
CAATGTGGAA TTGAGCCAAT CAAAAATCCT ATAAGAGGAG GTACTGATGG GGCAAGATTA
TCATTTATGG GACTACCAAC ACCAAATCTA TTTACTGGCG GAGAAAACTT CCATGGAAGA
TATGAATATA TATCAATAAA TTCAATGGAA AAAGCTGTTG AAGTAATACT AAACATAATA
AAAATTTATG CTGAAAAATA A
 
Protein sequence
MKKVHERFLE YVKVDTKSDE TTRVTPSTKG QLELGKILAE ELKEIGVDEV RISDKGYVYA 
CLKSNCDKDI PKIGFISHMD TAPDMSGKNV NPKIVENYDG KDIELGNGYT LSPSFSPELP
MYKGQTLITT DGTTLLGADD KAGVAEIVTA IEYLINNPEI KHGDIKIGFT PDEEIGEGAD
HFDVEGFGAD FAYTLDGGRI GELEYENFNA ASAKVEIIGK NVHPGSAKGK MINSILVAHE
FVSMLPLDEV PEKTEGYEGF SFLLDIQGEV EKTSLSFIIR DFDKEGFKNR KERFNEIAKE
LNKKYGEGTV TVTLKDQYMN MKEMIEPRMH IVETAEKAMK QCGIEPIKNP IRGGTDGARL
SFMGLPTPNL FTGGENFHGR YEYISINSME KAVEVILNII KIYAEK