Gene CPF_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1146 
Symbol 
ID4203849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1309503 
End bp1310750 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content30% 
IMG OID638082027 
Producthypothetical protein 
Protein accessionYP_695592 
Protein GI110800007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00151555 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAT TTTTAGAAAA AAGCTCAAAA ATCCAAGACC ATTTCACAGA CTGGAGAAAT 
ATTTACGCAA AACCTTATAA TAAAAACGAA GTTGATCCTT ATACAAAAAC TAGAATAATT
TTAATGAATG GAGCTGAATT TGAAGCAAAC TGGTTTTCTC ATCAATTCTC TAGAAACTGT
AATAACAATG AACTTAGAAG AGAACTTGCC CTTGCTAGAA GATTAGATAA ACAACAACAA
ATGCTAATTG GTTCATTAAG ACCTGCTAAT GAAAGTATTT TAGAGACTAC TATAAGCTAT
GAACAACTAG CTGTAGATTT AACTGCTAGA CTTGCAAAGC GTGAACCTAA TGAACATGTT
AAAAAAGCTT TAGATTTTGC ATTACTTGAA GATTTTGACC ATTTATATAG ATATTCAGAT
TTATTATTTA TGGAAGAAGG AACAAAAGCA GAAAATCTAG TTGGACATTA TACAGAAATA
ATGCCAGGTA GACCAACCAT ATCTGAACAT AGATGCCCCG CTGACAACAT AAGAAACTTT
GTTGATTTTA AAACAGCAGA CCTTATTACT AAACTAGATA TATCAATAAT AACTGCGGCA
GAACAACAAA CTATGAATTA TTACATGAAT ATAGCAGGTT TCTATACTAG TGATATTGGA
AGAAATCTTT ATCAAGAAAT AGGCTTAATA GAAGAACAAC ACGTTTCTCA CTATGGAAGT
CTTTTAGATC CTAACTGTAC ATGGCTTGAA AATCTACTTA TGCATAAATA CACTGAAGCA
TATTTGTATT ATTCTTGTTA TAATTCTGAA GTTGATCCAT ATATTAAAGG ACTATGGGAA
CAATGCTTCG TTCAGGAAGT TGCTCAATTA CATAAAGCTT GTGATCTTCT TAAAAAATAT
GAAAATAAAG AATGGCAAGA AGTTATTCCA AATGGTGAAT TCCCAGAACT TCTAACACTT
GGAGAAAATA TATCTTATGT TAGAGATATA TTAGATAATA CTGTTAATAA TACAACTATA
AAAGATGATT ACGTTGATGT AAGTAAATTA GGTCCTGATT CATCGTTCCA TGAATTCCAA
AATAAAGTTA ATAAAAATGT TGAAGATGTT CCAAGTCATA AGGTCATAGT TGATTTTATT
TCAAAAAATA ATGAAGATTA TAGATTTGAA ACAAAAGAAA ATCCAATTGT TGCTTTAAGA
GATAGAAAAT CTGATAATAC TTCTATTGGA AGAACATCTT TAAGTTAG
 
Protein sequence
MNPFLEKSSK IQDHFTDWRN IYAKPYNKNE VDPYTKTRII LMNGAEFEAN WFSHQFSRNC 
NNNELRRELA LARRLDKQQQ MLIGSLRPAN ESILETTISY EQLAVDLTAR LAKREPNEHV
KKALDFALLE DFDHLYRYSD LLFMEEGTKA ENLVGHYTEI MPGRPTISEH RCPADNIRNF
VDFKTADLIT KLDISIITAA EQQTMNYYMN IAGFYTSDIG RNLYQEIGLI EEQHVSHYGS
LLDPNCTWLE NLLMHKYTEA YLYYSCYNSE VDPYIKGLWE QCFVQEVAQL HKACDLLKKY
ENKEWQEVIP NGEFPELLTL GENISYVRDI LDNTVNNTTI KDDYVDVSKL GPDSSFHEFQ
NKVNKNVEDV PSHKVIVDFI SKNNEDYRFE TKENPIVALR DRKSDNTSIG RTSLS