Gene CPF_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0914 
Symbol 
ID4202446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1077932 
End bp1079059 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content24% 
IMG OID638081796 
Productcapsular polysaccharide biosynthesis protein 
Protein accessionYP_695363 
Protein GI110800082 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000157523 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA TTAGAGTTTT GCATATGGTT TCAACTTTAA GTAATGGAAG TGGAGTAATG 
GGCTTTATAA TGAATGCTTA TAGAAATATT GATAGAAATA AAATTCAGTT TGATTTCATT
TATTTTGATA ATGAAGAGAG AAGTATAACC TATATAGATG AAATATTAAA GCTTGGTGGA
AAAGTAAATT ATATAACAAA ACCAAATAAT TTAAGGAATA TAAATGAATT TAAAAATGAA
TTAAGTGAAA TTTTAAAGAA AGAAAATTAT AAAATAATTC ATTTACATGA AGTATATTTA
AATAAATTTG TAAATGATGA AGCGAAAAAA GTAGTAGGTG CTAAGGTAAT AGCCCATAGC
CATGCAACAA AATATTCAGA TAACAAAATA AAGGCTATTA GAAATAAAAT ATTATGTTTT
AATTTAAAGA AGAATGTAGA TATATTTTTT GCTTGTTCTA AGGCGGCAGG AAAGTTCTTA
TATGGGAAAA AAGCTTTTTA TGATAATAGA GTATTTGTAA TTAATAATGC AATTGAAATT
GATAAGTTTA AATATAATGA GAATATAAGA AATAAAGTAA GAAAAGAACT TAATTTAGAA
GAAAAATTTG TTATTGGTAA TATAGGAAGA TTTGCTAAAC AAAAGAATCA TAAGTTTTTA
ATTGATATTT TTTATGAGGT TAAAAAGAAA AAAGAAAATG CTTTTTTATT ATTAATAGGA
GAAGGAGACT TAAGAGAAAG TATAGAAAGA AAATTAGAAA AATTAAATCT AAGAAATTCT
GTTTTATTTT TAAGCTCTAG AAAAGATGTT AATGAAATTT TACAAGGTAT GGATGTTTTT
GTATTACCAT CATTATATGA GGGACTTCCG GTATCCGTAA TTGAGGCGCA AACTTCAGGG
TTACCTTGTA TTATTTCTAA TAAGGTAACT GATGAGGTCA ATATAATTGA TTGTAAGTTT
TTAAGTATTA CTAATGCGAA AGTATGGTGT AAATATATTT TAAAGTCAGA GGATCACCTT
AGAGTAGATA CTAATGAAAG TATTACAAAA GCAGGATATG ATATCAAATA TGAAGCTTTG
AGAATTCAAA ACATTTATGA AAAACTTTAT GCAGGAAGGG ATATTTAA
 
Protein sequence
MNKIRVLHMV STLSNGSGVM GFIMNAYRNI DRNKIQFDFI YFDNEERSIT YIDEILKLGG 
KVNYITKPNN LRNINEFKNE LSEILKKENY KIIHLHEVYL NKFVNDEAKK VVGAKVIAHS
HATKYSDNKI KAIRNKILCF NLKKNVDIFF ACSKAAGKFL YGKKAFYDNR VFVINNAIEI
DKFKYNENIR NKVRKELNLE EKFVIGNIGR FAKQKNHKFL IDIFYEVKKK KENAFLLLIG
EGDLRESIER KLEKLNLRNS VLFLSSRKDV NEILQGMDVF VLPSLYEGLP VSVIEAQTSG
LPCIISNKVT DEVNIIDCKF LSITNAKVWC KYILKSEDHL RVDTNESITK AGYDIKYEAL
RIQNIYEKLY AGRDI