Gene CPF_0406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0406 
Symbol 
ID4201126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp485340 
End bp487358 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content27% 
IMG OID638081290 
Productheparinase II/III-like protein 
Protein accessionYP_694863 
Protein GI110798789 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.242919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATT ATTCTAATAA AATAGAGGGA TTAAAAGCAA AGGTTAATGA GTATATGCAA 
GTATATGATG CTGAGTTTGT AAGGTATTAT ATAAATCATA ATTGTAAGAA TGAAGTAGAT
AAAAAGTTAA TAGGTTCTAA TTTAATTCTT AATAATTCTT TTATATTTGA TGATGAATGG
GATATGGAAC AATGTAAAAT TCCATATTTA AATAGAGATT TAGATTGGAA CTTTACTCCT
AATGGAGATG AGGAATGGGT ATTTATGCTT AATAGACATG AATACTTTGA AAAATTAATT
GCAGCCTATT ATTTTAGTAA TGATGAAAAA TATTTAGATA AGTTAAAGGA ATTAATATTT
AATTGGATTG AAAAGAATGA AATAAAGGAG TGTGGAGGAC CAACAATAAG AACAATAGAT
ACTGGAATAA GATGTTTTAG TTGGATGAAG TCTCTTTTAC ACTTGATTCA TGAAAATAAA
TTAGAAGATG AAGAAATATT AAAAATAATT TCTAGTATAA AAGAACAATT AGAGTATTTG
AAAAAATCTT ATATTGATAA ATATGTTCTT AGCAATTGGG GAGTATTACA GACAACAGCA
ATAATTACTT GCTCACTATG GTTAAAAGAA TTTATTGAGG ATGAGGAATT ATATAAGTGG
GCACTTGAGG AACTATACAG GGAAATAAAT CTTCAAGTTT TAGAAGATGG TTCACATTGG
GAGCAGTCTG TAATGTATCA TATTGAAGTA CTAAATTGTT CTATGGCAAC TATACACTAT
GTTAAGTATT TTAATGTTGA TTTAGATGAG GAGTTCTTAG AAAAAATACA TTCTATGGCT
AAATATTTAG TTTACTGTGG AGATTCAAAT TCAATTCAAG TTGCTCAAGG GGACAGTGAC
AGAAGTGATA TTAGAGATGT TCTTCTTAGG GCATCTATAT TATTTAATGA TCCTCATTTA
AAATTTAGAG CATATGAAAC TATGGATTTA ACCAGTATAT TATTGTTTGG AAGAGATGGA
TTTTTAAAAT ATACAAATAT GGATGCAGAG GAAATTACAA AGCTTAATAA GTCCTTTATA
GACTCTGGTA ATATTTATAT AAGAAGTGGA TGGGACAAAG AGGCAAGTTT TACTTATTTA
CAAAATGGTA CTCTTGGAAG TGGACATGGA CATTCAGATT TATGCCATTT TTCAATTCAT
TATGGTGGAG AACCATTTTT AATAGATTCA GGTAGGTATA CCTATGTTGA AAGTGATCTT
TTAAGGGAAT ATTTAAAATC TGTTAAGGCT CATAATGTAT CTGTAATTGA TGATTCTCCT
TTTGCTATTC CTAAGAATTC ATGGAAATAT AATAAATATC CAGATGTTAT GAAAAATTAT
TTTAATGAAA AAGATAATAT AGCTTATGCT GAAATGGCTT ATTTAGCAAC TTTAAGTGAT
GGAACACCAT ATACGGTTAT TAGAAAGGTG TTAGTTATAT CTCCAGATAT ATGGGTAATA
GTTAATGATA TAAGATGTAG TGGAAAGCAT ATATGCAAAA ATTATTATAA TTTAGATTAT
AAGGTTAAGG TGATTAAAGA AGAGGGTTAT TTTAGATGTG TAAATAAAGA AAGTGAAATT
AAGATTTATA ATAACAATGT AGATAAGAAA TATATAGAAA ATACTTTAAT TTCAACAAAT
TATAATAGTA TAAATAATAG TAAAAAGATA GTAACACAAG GTACTTTTGA AAATAATTTT
GTGAACTATG ATATAATTTT AGGGCAGAAT TTAAAAAGTA TAGAAATAAA GGATCCTAGT
ATTGTACAAT ATAATTCAAA AGAAAAGATT GATACAAGTG TGGCAATAAC TAAAGAATTT
GTAATAAACG AAAATGAGAG TTATACTGTT ATAATATTTA ATAAAGAAAC ATTTAAAGGG
GCTAAGGTAT ATATGTATGA CGATTTGGCT TTATATGGTA AGGTTATTGT TGTGCATAGA
GTTAAAGACA AGAGAGAAAT AATTCGTTTG AAAGCATAA
 
Protein sequence
MSNYSNKIEG LKAKVNEYMQ VYDAEFVRYY INHNCKNEVD KKLIGSNLIL NNSFIFDDEW 
DMEQCKIPYL NRDLDWNFTP NGDEEWVFML NRHEYFEKLI AAYYFSNDEK YLDKLKELIF
NWIEKNEIKE CGGPTIRTID TGIRCFSWMK SLLHLIHENK LEDEEILKII SSIKEQLEYL
KKSYIDKYVL SNWGVLQTTA IITCSLWLKE FIEDEELYKW ALEELYREIN LQVLEDGSHW
EQSVMYHIEV LNCSMATIHY VKYFNVDLDE EFLEKIHSMA KYLVYCGDSN SIQVAQGDSD
RSDIRDVLLR ASILFNDPHL KFRAYETMDL TSILLFGRDG FLKYTNMDAE EITKLNKSFI
DSGNIYIRSG WDKEASFTYL QNGTLGSGHG HSDLCHFSIH YGGEPFLIDS GRYTYVESDL
LREYLKSVKA HNVSVIDDSP FAIPKNSWKY NKYPDVMKNY FNEKDNIAYA EMAYLATLSD
GTPYTVIRKV LVISPDIWVI VNDIRCSGKH ICKNYYNLDY KVKVIKEEGY FRCVNKESEI
KIYNNNVDKK YIENTLISTN YNSINNSKKI VTQGTFENNF VNYDIILGQN LKSIEIKDPS
IVQYNSKEKI DTSVAITKEF VINENESYTV IIFNKETFKG AKVYMYDDLA LYGKVIVVHR
VKDKREIIRL KA