Gene CPF_0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0108 
Symbol 
ID4201619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp128962 
End bp130527 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content31% 
IMG OID638080989 
Producthypothetical protein 
Protein accessionYP_694572 
Protein GI110800767 
COG category[S] Function unknown 
COG ID[COG4086] Predicted secreted protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0689797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ATAAATTAAT AACAGCAATG ATATTAGCTG GAGCAATAAG TATTGGCTCA 
TTTACAACAG TTTTTGCCGA TACTAAGGAA GTAGTAACTT TAGGAGCTAA TCTAAATTCA
AGTCAAAAAC AAGAGATGTT TAAGGAATTC GGAGTTAAAC CTAATGATGT TAAGGTTATA
ACTATGAATG TTAATGAAAT AAGAGAGCAA CTTGGATTGC CTAAAATAGT AGGAGAATTT
AAAGGCAATG CTTACTCAAG TGCTTTTGTT AAACTAGAGG AAAAAGGGTA TGGAATAAAG
GTAAAAACAA ATAACTTAAC AGAGGTTACT AAAACTATGT TAAGTAATGC CTTGTTAACA
TCAGGGGTTA CAGATGCTGA TGTTATTGCT ACAGCGCCAT TCCCTGTAAC TGGGACATCT
GCTTTGGCAG GAGTTCTTCA AGCTTTTGAA AAGGCCACTG GAGAAAATAT ACCTGTTGAA
AATAAGGAAG TAGCAAGACA AGAGTTAAGT ATAACAAATA ATTTAGCTAA GGCTAAAAAT
AGTGAAGGAC AAGATATTGG AAGAGATGGA GCAAGTGCTA TAGTAAATCA AGCTAAAGAA
GAAGTTATAA AGGATAAACC TAAAAATGAC AAAGAGGTTG GTGAAATAGT AAATAATATT
ACAAATAACT ATAATATTAA GTTAACTCCA ACACAAGAGC AAGAATTAGT TGCTCTTTTA
GCCAATATAA ATTCTTTAGG ATTAGATTAT TCTAAATTAA AAGGAGAACT TGATAGCCTT
TCTAATAATA TACAGGAAGC TTTAAAAGAA AATGGACAAG AACTTAAAGA AAGTGGAACT
TTAGATAGAA TCTTAAATAA AATTCTAGGA GTTTGTACTG ATATAAAAAA TTGGTTTGTA
GCACACTTTG GAGATGGAGA AGTTACTATA AATGGAGTTA CATATGATAA AGATGGTAAT
ATGATAAACA AGGATCAATT AAACAATATA GGAACTTCAA TAAGTGATGA GGACAATGAC
ACTAGTAATA ATGAAAGTAA AAATAATATA GAAGATAAAT CACAAAAGAG TGATTCTAAG
GAGAAAGAAA CTGATAAGGC TAATAGTAAT AATCAAGAGG ATTCTAATTC TCAAGGAGAA
CAAGAAAGTA ATGGCAATGG TTCTTTAAAT AATGATACCA AAAGTAATAA AGAATCTAAA
AATAAAGATG CACAGGGAGC TAAAGAGTCA AAAAATAGTA AACCTAATAA AAATAGTGTA
AGTTCATCAA AGAATAATTC AAATAAAAAT AATAGTGGAA GCAGTAATAA TTCAAAGGAT
AAAAGTAAGG AAACTATAAA GCTTGAGGAT GGTAATGTAA TTCCTAAGTA TAATTCAAAG
GGTGAAGAAT ATAATCCTGT TACTGGTGGA TATGGACATA TGCAAGAAAC TGAAGACGCT
GGAAAGGATT CTGATCAAAC TATAGATTAT ATAATAGTTG ATGGGAAGAA GGTTCCACTT
CATGATGAAC AGGGAAGAGA GTATAATCCT GAGACAGGAA GATATGGAGA TCCTGATGAT
AATTAA
 
Protein sequence
MKKNKLITAM ILAGAISIGS FTTVFADTKE VVTLGANLNS SQKQEMFKEF GVKPNDVKVI 
TMNVNEIREQ LGLPKIVGEF KGNAYSSAFV KLEEKGYGIK VKTNNLTEVT KTMLSNALLT
SGVTDADVIA TAPFPVTGTS ALAGVLQAFE KATGENIPVE NKEVARQELS ITNNLAKAKN
SEGQDIGRDG ASAIVNQAKE EVIKDKPKND KEVGEIVNNI TNNYNIKLTP TQEQELVALL
ANINSLGLDY SKLKGELDSL SNNIQEALKE NGQELKESGT LDRILNKILG VCTDIKNWFV
AHFGDGEVTI NGVTYDKDGN MINKDQLNNI GTSISDEDND TSNNESKNNI EDKSQKSDSK
EKETDKANSN NQEDSNSQGE QESNGNGSLN NDTKSNKESK NKDAQGAKES KNSKPNKNSV
SSSKNNSNKN NSGSSNNSKD KSKETIKLED GNVIPKYNSK GEEYNPVTGG YGHMQETEDA
GKDSDQTIDY IIVDGKKVPL HDEQGREYNP ETGRYGDPDD N