Gene CPF_1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1407 
Symbol 
ID4203294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1581901 
End bp1583889 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content33% 
IMG OID638082287 
Producthypothetical protein 
Protein accessionYP_695852 
Protein GI110801217 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00374356 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA ATAAGTTTTT GCCAATTTGC AAAGATGATA TGATAGAAAG AGGATGGGAA 
CAATGTGACT TTGTACTAGT TACAGCAGAT GCATACATAG ACCATCATAG TTTTGGTACA
GCAATTATAT CTAGGGTTTT AGAGAATGCT GGATATAAGG TTGGAATAAT AGCTCAACCA
GATTGGAAGA GCGTTGATGA TTTTAAAAAA TTAGGTAGAC CAAGATTAGG ATTCTTAGTT
AATGGTGGTA ACATGGATCC TATGGTTAAT CACTATACAG TAAGCAAAAA GTTAAGAAAG
AAAGATCTAT ATACTCCTAA GGGGGAAATG GGTAAGAGAC CTGATAGAGC TACAATAGTT
TATTGTAATA AAATAAGAGA AGCTTATAAG GATGTTAACA TAGTAATTGG TGGAATTGAA
GCTAGTTTAA GAAGATTTGC TCATTATGAT TACTGGGATA ACAAAGTGAG AAAATCAATC
TTAGTTGATA GTGGAGCGGA CCTTTTAGTA TATGGAATGA GTGAAAAGCA AATCGTTGAA
GTGGCTGATT TCTTAAATCA AGGATTTGAT GGAAAGTACA TAAGACATAT ACCAGGAACA
TGTTACATAG CCGATAGTTT AGATGAAATC TATGAGGAGC ATATAGTTCT GCCATCATTT
AAAGAAGTTT CAAGTGATAA GAGAACTTAT GCAGAATGCT TTAAAATTCA ATATGATGAG
CAAGATCCTG TAAGAGGAAG AACTTTAGTT CAAGAACATA ATGGAAAATA TGTTGTTATA
AATAAACCAG AAATGCCTCT TTCAAGGGAA GAATTAGATA GAGTATATGC TCTTCCATAT
CAAAAAACTT ACCATCCTAT TTATGAGAAA GATGGTGGTA TAGCTGCTAT TGAAGAGGTT
AAGTTTAGTA TAGTAAGTTC AAGGGGATGC TCAGGAAACT GTTCATTCTG TGCAATAACC
TTCCATCAAG GAAGAATTGT AACTAGTAGA AGTGAAGATT CTATAGTAGA AGAAGCTGAA
GAAATAACTA AATATGATGA TTTTAAAGGA TATATACACG ATATAGGGGG ACCTACAGCT
AACTTTAGAA AGCCAGCATG TAAGAAGCAA CTAACTTTAG GAGCTTGTAA ACATAAAAGA
TGTATGTCAC CAGGTATATG CAAGAATATG GAGGTAGATC ATAGAGAATA CCTTCATTTA
TTAAGAAGAG TAAGAAAATT ACCAGGAATT AAAAAGGTAT TTATACGTTC AGGACTAAGA
TATGATTATA TAATGGCAGA TAAGGATGAT ACTTTCTTTA AGGAATTAGT TGAGCATCAT
GTAAGTGGTC AATTAAAAGT TGCACCAGAG CATGTATCTC CAAATGTTTT AAAATACATG
GGTAAACCAG CAGGAAAAAC TTATGATGAG TTTAGAAGAA AATTCTTTAG AATCACAGAA
AGATTAGGAA AGAAACAATT CATCATTCCT TATTTAATGT CAAGTCATCC AGGATGCAAG
TTAGAAGATG CAATTATGCT TGCTGAATAT TTAAGAGATA TAAATTATCA ACCAGAGCAG
GTACAAGATT TCTATCCAAC ACCAGGAACA TTATCAACTA CAATGTTCTA TACTGGATTA
GATCCTTTAA CAATGGAAGA AGTTTATATT CCTAGAAGTA AAGAAGAAAA AGCAATGCAA
AGGGCTTTAT TACAATTTAA AAATCCAAAG AATTACAACA TAGTTTATGA TGCTTTAGTT
AAGGTAGGTA GAGAGGATTT AATTGGTAAT GGTCCAAAAT GCTTAATTAG AGATAAAAAT
AGCTTTGGAA AAGGAAATAA TCATAGTAAT CACAAAAGTG GTGGAAGAAA GAGTAGAAAT
GAGAACAGCG GAAGAAGAGA GTCAGAAGAT AAGAAAAGAA GTTCTCATAG TAAAAAACAA
AGAGGAAACA AATCAAGAGG ATTTGATCAA AAGAGCCAAA GAAGCTCAAA GGGCAAGAAA
AGAAGATAA
 
Protein sequence
MSENKFLPIC KDDMIERGWE QCDFVLVTAD AYIDHHSFGT AIISRVLENA GYKVGIIAQP 
DWKSVDDFKK LGRPRLGFLV NGGNMDPMVN HYTVSKKLRK KDLYTPKGEM GKRPDRATIV
YCNKIREAYK DVNIVIGGIE ASLRRFAHYD YWDNKVRKSI LVDSGADLLV YGMSEKQIVE
VADFLNQGFD GKYIRHIPGT CYIADSLDEI YEEHIVLPSF KEVSSDKRTY AECFKIQYDE
QDPVRGRTLV QEHNGKYVVI NKPEMPLSRE ELDRVYALPY QKTYHPIYEK DGGIAAIEEV
KFSIVSSRGC SGNCSFCAIT FHQGRIVTSR SEDSIVEEAE EITKYDDFKG YIHDIGGPTA
NFRKPACKKQ LTLGACKHKR CMSPGICKNM EVDHREYLHL LRRVRKLPGI KKVFIRSGLR
YDYIMADKDD TFFKELVEHH VSGQLKVAPE HVSPNVLKYM GKPAGKTYDE FRRKFFRITE
RLGKKQFIIP YLMSSHPGCK LEDAIMLAEY LRDINYQPEQ VQDFYPTPGT LSTTMFYTGL
DPLTMEEVYI PRSKEEKAMQ RALLQFKNPK NYNIVYDALV KVGREDLIGN GPKCLIRDKN
SFGKGNNHSN HKSGGRKSRN ENSGRRESED KKRSSHSKKQ RGNKSRGFDQ KSQRSSKGKK
RR