Gene CPF_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2068 
Symbol 
ID4201670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2306987 
End bp2308147 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content29% 
IMG OID638082933 
Productstage IV sporulation protein B 
Protein accessionYP_696497 
Protein GI110800995 
COG category 
COG ID 
TIGRFAM ID[TIGR02860] stage IV sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA TGAACAAAAG AATAAAGATT ATATCAATAA TAATGATGTC TTTAATATTA 
CTTTTATCCT CTGTAACATT TGCAAGGGAT TATTGTGAGA GTAATAATGT TTTTGCAAGT
AGCAATTTCT ATTCTTTAAA TTCTAAGAGT AGTAATGATG AAAAGTTTAA GAACAGATAT
GGCGTAGCTC TTGTAAATAG TGAACAGGAA AAGAAAGATA TAGAGCTTTA TGCTGGAGGA
AATTCAGTAG GTGTAAGGGT TTCAACAGAT GGTGTATTAG CAGTAGGTTA TTCAGACTTA
ACAACAGGAG AAGGAGAAGT AGAGAGTCCA GCTCAAAATG GTGGAATACA AATTGGTGAT
AGACTTATAA GTGTAAATGG AAATAAAATA AAAAATTCAA AAGATTTATC AAAAAAAATC
AACGAGAGTA AATCAGAAAA TGTTGAAATA TTAATTGAGA GAAATGGTGA AGAAATAACT
AAAAATATAA ATTTATCAAA AAATGCAGAT GGTGATTATA AAATAGGTCT TTGGGTAAGA
GATTCTACTG CTGGTGTAGG TACACTTACT TTCTATGATA AAGAAAGTGG AAAATATGGA
GCAATAGGTC ATCCAATAAC AGATAGTGAA ACAGAAAAAA TTCTTTCAAT AAAAAATGGA
GATCTTTTAA ATTCTTCAAT AATAAGCATA AAAAAAGGTG TTAAAGGTAA TCCAGGAGAA
TTAAGAGGAA TTTTTTCAAG TGATAAGAAA CCAATAGGAA ATGTTACAGG AAATACACAA
TGTGGAATAT TTGGTAGCAT GAATACAGAA AATTTAAAAA ATATTAATAA TAAAACTTAT
AAAGTTGGTT GGAGAGATGA AATTCAGCCA GGACCAGCAC AAATTATAAC TACTATTGAT
GAAGAAGGTC CTAAGCTTTA TGATATTGAA ATTGTAAAAC TTGCAAAGCA AGATAGCATT
AGTACAAAGA GTATGGTAAT TAAGATTACA GATGAAAGAT TATTAGAAAA AACTGGTGGT
GTTGTCCAAG GAATGAGTGG AAGTCCAATT ATACAAAATG ATAAAATAAT TGGTGCTGTG
ACACATGTTT TGGTTAATAA ACCTGAAGTA GGATATGGAA TTTATATAGA GTGGATGTTA
AAAGATGCAA AAATTATATA A
 
Protein sequence
MKNMNKRIKI ISIIMMSLIL LLSSVTFARD YCESNNVFAS SNFYSLNSKS SNDEKFKNRY 
GVALVNSEQE KKDIELYAGG NSVGVRVSTD GVLAVGYSDL TTGEGEVESP AQNGGIQIGD
RLISVNGNKI KNSKDLSKKI NESKSENVEI LIERNGEEIT KNINLSKNAD GDYKIGLWVR
DSTAGVGTLT FYDKESGKYG AIGHPITDSE TEKILSIKNG DLLNSSIISI KKGVKGNPGE
LRGIFSSDKK PIGNVTGNTQ CGIFGSMNTE NLKNINNKTY KVGWRDEIQP GPAQIITTID
EEGPKLYDIE IVKLAKQDSI STKSMVIKIT DERLLEKTGG VVQGMSGSPI IQNDKIIGAV
THVLVNKPEV GYGIYIEWML KDAKII