Gene CPF_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2368 
Symbol 
ID4202078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2635900 
End bp2637654 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content31% 
IMG OID638083233 
Productsubtilase family protein 
Protein accessionYP_696791 
Protein GI110799254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTTA AGTCTTCAGC TCCAGAGAAT GTATTTAATA ATGAAAATTA TTTAAATTAT 
TTAGTTCAGT ATCAAGGTGA TATTATAGGA GAGTTTAGTG GTGAAAATGG AATATATGCA
ACCTTAATTA ATAATAGATA TGCCATAATA ACTATTGATA AAAACTTACA GGAATATGAA
AATGATATAT CTATGATTAA ATATAAAAAA GAAAATGGAA GAGATGTAGA AATAGATAGC
ATTACTTATA TAAAAAATCC AGAAGGATAT GTACTTCAGG AAATAAGCCC TTTAGAGGCA
GCAAATGTTG AATATGTTCA AATACAATCA TACTTTAATC TAACAGGTAG AGGGGTTATT
GTTGGAATTT TAGATACTGG AATAGATTAT TTAAACGAAG AGTTTATGGA TTCAGATGGT
AATACAAGAA TTTTAGGAAT TTGGGATCAA ACAATTTCAA GTGAGAGTTC AAGTAATGAT
AATTTACCTT ATGGAACTTT TTATTCAGAA GAAGATATAA ATAGAGCAAT AAGACTTAGT
AGAGAAGGAG GAGATCCATA TACAATAGTT CCATCAAAAG ATGAAGTGGG TCATGGAACA
TCAATGGCAG GAATAATTGG ATCATCAGGA AAAAATCCTA GGTTAAAGGG GGTTGCACCA
GATTGTAAGT TCTTAGTTGT AAAGCTTGCA CAGTCACTTT ATTATAAAAA AGAGTATGAA
ATAAACATAC CTATTTATAA TATAACTGAG ATATTTACAG GAATACAATA TTTATATTCA
TATTTTTTAA AGGGATCTCA AAGTATGATT ATATATTTAC CTTTGGGAAC TAATAGAGGA
AGTCATAAAG GAACAAGTAT GTTAGAAGAA TTTTTAGATT CTATATTGAT AAATAGAGGA
ATTGCTTTAG TTACCGGTGC TGGAAATGAA GGAACTGCAT TATTACACGG ATCAGGAACT
ATAAAACCTG ATGGTCAAGT AACTACCCAT GAATTTAATA TAGATGAAAA TCAAAAAAAG
ATTATTATTG AAGTTTGGGT ACAAATACCT AGTATTGCTT CCATAGAAAT AGTTTCACCT
ACTGGAGGAA CAACAGGAAT AATTCAACCT TTTTTTGGAA AAGGGGATAG GTATGATTTT
ACAATTGAGC GTACTACAGT GTTAGTAAGT TATTATGTAC CTGAAGAAAT ATATGAAGAT
TCTCTCATAT TAGTTATACT AGACAATGTT CAAGCTGGAA CATGGAGTTT TAAATTTAGG
GGGTTAAAGG AAATAGAGGG AAGATATGAT ATTTGGTTAC CTCCAAGGGG AGTAAGTAAA
ACTGCAACTA AGATGATACC TTCTGACCCA TATGGAACAG TAACTGTCCC AGGTACTAGT
TCTTCTGTAA TAACAGTAGC TGCATATAAT CAATTAAATA ATACACAGTT AAATTATTCA
GGGAGAGGAT TTCAGGATAA TTATATTGAT ATAATAGATG TGGCTGCAGG AGGAGTAGAT
GCACTTACTG TTGCACCAGA TAATAAAACA ACTTTAGCAA ATGGTACAAG TGTTGCAGCG
GCTATAGTAG CTGGGATTTG TGTTTTACTT TTTCAATGGG GAATTGTTGA AGGAAATTAT
CCATATATGT TCTCTCAAAC TTTAAAAGCT TTTATTGCAA GAGGTACAAG AAAGAGAAAA
GGGGATACTT ATCCAAATCC AGAGTGGGGA TATGGAATAG TTGATATATT TAATATGTTT
AATCTTACTA ATTAA
 
Protein sequence
MEFKSSAPEN VFNNENYLNY LVQYQGDIIG EFSGENGIYA TLINNRYAII TIDKNLQEYE 
NDISMIKYKK ENGRDVEIDS ITYIKNPEGY VLQEISPLEA ANVEYVQIQS YFNLTGRGVI
VGILDTGIDY LNEEFMDSDG NTRILGIWDQ TISSESSSND NLPYGTFYSE EDINRAIRLS
REGGDPYTIV PSKDEVGHGT SMAGIIGSSG KNPRLKGVAP DCKFLVVKLA QSLYYKKEYE
INIPIYNITE IFTGIQYLYS YFLKGSQSMI IYLPLGTNRG SHKGTSMLEE FLDSILINRG
IALVTGAGNE GTALLHGSGT IKPDGQVTTH EFNIDENQKK IIIEVWVQIP SIASIEIVSP
TGGTTGIIQP FFGKGDRYDF TIERTTVLVS YYVPEEIYED SLILVILDNV QAGTWSFKFR
GLKEIEGRYD IWLPPRGVSK TATKMIPSDP YGTVTVPGTS SSVITVAAYN QLNNTQLNYS
GRGFQDNYID IIDVAAGGVD ALTVAPDNKT TLANGTSVAA AIVAGICVLL FQWGIVEGNY
PYMFSQTLKA FIARGTRKRK GDTYPNPEWG YGIVDIFNMF NLTN