Gene CPF_0085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0085 
Symbol 
ID4203164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp101318 
End bp102325 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content32% 
IMG OID638080966 
Productmyo-inositol 2-dehydrogenase 
Protein accessionYP_694549 
Protein GI110799733 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCATGA TTAATATTGG TGTTATTGGA GCAGGAAGAA TAGGAAGAAT TCATGCTCAA 
TCTATTCAAG AAAAAGTACA AGGAGCTCAA GTTAAAACTA TAGCAGATGT ATTTGAAGAA
TCAGCTAAAA AAGCTGCTGA GGAATTTAAA ATTCCAAACT ATACAGGAGA TTACATGGAA
ATCTTAAACG ATCCAGAAAT TGATGCTGTA ATAATATGTT CATCAACAGA TACTCACTCA
AAAATATCAA TTGAAGCTGC TGAAAGAGGT AAACACATAT TCTGTGAAAA ACCTATAGAT
TACGATGTTG CTAAAATTGA AAAAACTTTA GAAGCTGTTA AAAAAGCTGG AGTAAAATAT
CAAGTTGGAT TCAATAGAAG ATTTGACCAC AACTTTGCTA AATTAAAAGA ACTTATAGAA
GAAGGTAAAA TAGGAAATCC ACACATAATA AAAATTACTT CAAGAGATCC ACAAGCTCCA
CCAATAGAAT ACGTAAAAGT TTCAGGTGGT ATGTTCTTAG ATATGACAAT TCATGATTTT
GATATGGCTG CATTCTTAAG TGGAAGTAGA ATTTCAGAAG TATATGTACA AGGTGCATGT
ATGGTAAATC CTGAAATAGG AGAAGCTGGA GATGTTGATA CAGCAATTAT ATCTTTAAAA
TTTGAAAACG GATGCATAGG TGTAATAGAT AATAGTAGAG AAGCTGCTTA TGGATATGAC
CAACGTGTTG AAGTATTTGG TGGAAAAGGA TATGTAATGG CTGATAATGA CTCAGATACA
ACAGTTACAA TAGCTTCAGT TGATGGTATC GTTGGAGAAA AACCAAAATA TTTCTTCTTA
GAAAGATACA TGGATGCCTA TGTTAATGAA ATGCAACAGT TTATAAACTG CTTAGTTAAT
GATACTGAAG TTCCAGTTGG AGCTAATGAA GGATTATATT CAGTTTTAGT TGGACTAGCT
GCTACTAAAT CATTAAAAGA AGGTAGACCA GTTAAAATAG AATACTAA
 
Protein sequence
MVMINIGVIG AGRIGRIHAQ SIQEKVQGAQ VKTIADVFEE SAKKAAEEFK IPNYTGDYME 
ILNDPEIDAV IICSSTDTHS KISIEAAERG KHIFCEKPID YDVAKIEKTL EAVKKAGVKY
QVGFNRRFDH NFAKLKELIE EGKIGNPHII KITSRDPQAP PIEYVKVSGG MFLDMTIHDF
DMAAFLSGSR ISEVYVQGAC MVNPEIGEAG DVDTAIISLK FENGCIGVID NSREAAYGYD
QRVEVFGGKG YVMADNDSDT TVTIASVDGI VGEKPKYFFL ERYMDAYVNE MQQFINCLVN
DTEVPVGANE GLYSVLVGLA ATKSLKEGRP VKIEY