Gene CPF_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0147 
Symbol 
ID4201966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp173666 
End bp175234 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content30% 
IMG OID638081028 
Productputative surface protein 
Protein accessionYP_694611 
Protein GI110800491 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.389776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA ATATATTAAA AAAGTTTAGT TTTATAATGG TGTTTATATT TGTTTTTATT 
ACAACATCAA TACCTGTATT TGCGGCTACG CCATCAATTT CTAAAGATGC TCCTATTAAG
GGAAGTATCA CAATATCAAA AAAAGGTGCT ACCTTTACTG CATATAAGTT ATTAGATGCA
ACTAAAAGTG GGGATGCATA TGAATACTCA GTAAATAGTG ATTTAAAAGA CTTCTTTAAT
AATTCTAATT ATGGTTCTTA TAGCCAAGAA TCAATTCAAA AATTAAGTGG AGAAGAAGTA
AAAGAGTTTG CTGTTAACTT ACATAAATAT ATTCTTGATA ACAAAAAGAG TGGACAAGAA
CTTACAGATG GACAAAAAAA TACTGTTGAC TTAGGTTATT ACCTAGTTAC AGAAACTTCA
AGTGATTCAG AGGGGGCAGC AGTTGCTTCA ACACCTATAA TAGTTTCAGT TCCTCAAGTT
TCAGGAGATT CATGGAATTA TGATGTAACT ATTAATCCAA AGGATAACAC TCCTATATTA
GAAAAAAACA TAGTTAAAGA GAATCAAAGA GTTAAAACTT CATCTGAGAA TATTGGAGAT
GTTGTTAAAT ACGAAGTAAA AGCTTCTATA CCAGTTTATC AAAAGAATGC ACAGAATATA
ATGTACAAAT TTACTGACAC TATGAGCAAG GGATTAACAT ATGATGAGAA AACTGGCTTT
AAGGTGACTT CAGGAGATAA AGTTTTTGAT AAGGACAATG ATTATACTGT AGATGTAAAA
AAGCAAAGAG ATGGATCAAC AGTTATTACA ATAAACTTTA ATTATGATAA TATAAAAGCT
TATGCAGAGA CTGGAATAAC TTTAAATTAC CAAGCTACAT TAAATAAGGA TGCAGTTATT
AGTAATAAAG AAAACTTAGG AAATACTAAC AACATACAAT TAGACTATAC AAATAACCCA
CATGTTAAGG ATAGTTATAA AAAATTAACT GATAAGGTTA CTACTTATAC TTTTGGATTT
GGAATTACAA AGGTTGATTC AGAGCTTAAT TCTAAGCTTT TACAAGGTGC TGAGTTTTCA
GTAAAAGATG CAGGTGGAAA GACTGTAGCT AAATATACAT ATGATGAAAA AGGACAAGTA
GTTTCTTTAA GTGGAAATGG GGTAACTAAC TCAAAGGGAA TTACAACATT TGTAGGCTTA
AAAGAAGGAA AGTACCTTAT TACAGAGGAA GTAGCTCCAT CTGGATATAG CTTATTAAAA
AATCCAGTAG AAGTGACAAT TACAGCTAAT AAAGATGAGT CTGGAAATTA CACAGGAGCT
GCAACTATAG AGATTTCTAA TGGAAATAAA GCAGGACAAA TAATAAATGA TATTTCAGAG
AATGATGGAA ATATATTATT TAATGTTCAA ATAGAAAACC ACGCTGGATT TTCTCTTCCA
TCAACAGGAG GATTAGGAAA TACAGGATTT ATAAAAATAG CCATTATCTT ATTAAGTATA
GTTTGTGTTT TATCTATACT AGGACTTGGC TACACTAAAT TTGAAAATAG CAGAAAAACC
AAGAACTAA
 
Protein sequence
MKNNILKKFS FIMVFIFVFI TTSIPVFAAT PSISKDAPIK GSITISKKGA TFTAYKLLDA 
TKSGDAYEYS VNSDLKDFFN NSNYGSYSQE SIQKLSGEEV KEFAVNLHKY ILDNKKSGQE
LTDGQKNTVD LGYYLVTETS SDSEGAAVAS TPIIVSVPQV SGDSWNYDVT INPKDNTPIL
EKNIVKENQR VKTSSENIGD VVKYEVKASI PVYQKNAQNI MYKFTDTMSK GLTYDEKTGF
KVTSGDKVFD KDNDYTVDVK KQRDGSTVIT INFNYDNIKA YAETGITLNY QATLNKDAVI
SNKENLGNTN NIQLDYTNNP HVKDSYKKLT DKVTTYTFGF GITKVDSELN SKLLQGAEFS
VKDAGGKTVA KYTYDEKGQV VSLSGNGVTN SKGITTFVGL KEGKYLITEE VAPSGYSLLK
NPVEVTITAN KDESGNYTGA ATIEISNGNK AGQIINDISE NDGNILFNVQ IENHAGFSLP
STGGLGNTGF IKIAIILLSI VCVLSILGLG YTKFENSRKT KN