Gene CPR_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0789 
Symbol 
ID4205550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp914818 
End bp916650 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content30% 
IMG OID642565348 
ProductcymH protein 
Protein accessionYP_698114 
Protein GI110802979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TTACAAGAGA GGCAATACAT CATATAGCTC AAAGCAATTA TTCTTATGGA 
TATGATAATG AAACTTTGCA TTTAAGGGTT AGGACTAAAA AAGGTGAAGT AAATAAAGTG
GAAATAAGAA TTGGAGATCC TTACATATGG GACGAAGGTG GTTGCGATGG AGGAAATATG
AATGCCACTG GAGGACGATG GACAGGTGGA AAAAGTTATC CTATGAGAAA GGAATGTGAA
ACAAAATACT TTGATCACTG GATAGTTTAT TATAAACCAT TAACCAAACG TTCAAGATAT
GGATTTATAT TATATGGAGA TGAAGAAACT CTTTTATGCA CAGAAAAAAG AATAGAGGAG
TTAGATGGAA AGTATGATGA AGAAAAATTA AGTGCTATAG GAAACTTTTA TTGTTTTCCA
TATTTAAATG CCATAGATGT TGCTAAAACA CCACAATGGG TAAAAGATAC TGTTTGGTAT
CAAATATTCC CAGATAGATT TTGCAATGGA GATAAATCAA TAGATCCAGA AAATGTTGAG
CCATGGGGGA CAGAGCCTAC TAGGGATAAT TTTATGGGAG GAGATTTACA GGGAGTTTTA
GATAAATTAG ATTACTTATG CAATCTTGGA ATTAATGGAC TATATTTTTG TCCTGTATTT
GAAGCTACTG AGAATCATAG GTATGAAACC ATAGATTATT TTAAAGTAGA TCCAGCGCTT
GGTGGAAATG AAGTCTTTAA AAAACTTGTA AGTGAAGCTC ACAAAAGAGG AATGAAAATA
ATGTTAGATG CAGTATTTAA TCATATAGGT TATTTTTCAC CAAAATGGCA AGATGTATTA
AAGAATAATG AAAAATCAAG ATATAAGGAT TGGTTTTGTA TAAAGAAGTT TCCAGTACTT
GAAAATGGCT TAGAAAATGT TGACGGAAAT AATTTAAATT ATGAAACCTT TGGAAGAATA
GCCACAATGC CTAAACTAAA CACAGAAAAT CCAGATGTTG TAGAATATTT ATTAAAAGTT
GCTAAGTTCT GGGTTGAAGA AATGGATATA GATGGTTGGA GACTTGATGT ATGTAATGAA
GTAGACCATG TATTTTGGAG AAAATTTAGG GAAGTAGTAA AGGAAACTAA TAAAGAAGTT
TATATATTAG GAGAAGTTTG GCATGATGGA CTTCCATGGC TTATGGGAGA TCAGTTTGAT
GCAGTTATGA ATTACCCTGT TACAGATGCA GTAAAAGAAT ATTTCTGCTT AAATCAATCT
AATCCAGAAG ATTTTAAATA TATGATAGAA GCTAATAAGG TTAGCTATTT AAGACAAATA
GGAGAAACGA TATTTAACTT ATTAGATAGT CATGATACTC CAAGAATATT AACTGTTGCT
GGAGGAAACA AGAATAAGAT GAAACTAGCT TATCTATTTA TGTTTACTCA AGCTGGTTCT
CCATGTATAT ATTATGGAGA TGAAGTTGGT ATGGAAGGAA ATCAAGGAAT GGGTATGGAA
TTCCATAGAA GATGTATGAT TTGGGATGAA AATAAACAAG ACAAAGATAT GCTTAAGTTT
ATGAAACAAA TAATAAAAAT AAGAAAAGAA AATAAGGAAT TAAATTTATT AGATAACAAT
TGGATAAGAG CTAATAGAGA TGAAAATATA CTTATATATT CAAAGAAAAA TATATTCATC
ATTATGAACA ATTCAGATAA GGAAGAAAAG GTATTTTTAC CTAAAGAGAT TAAAAATAAT
AAGGTTAAGG ATTTATTTGA AGAAAAAATT GAATCTTTAA AAGAAGATGT AGAATTAAAG
CCCTTTGCAT TTAGAGTTTA TAAAAAGCTT TAA
 
Protein sequence
MSKITREAIH HIAQSNYSYG YDNETLHLRV RTKKGEVNKV EIRIGDPYIW DEGGCDGGNM 
NATGGRWTGG KSYPMRKECE TKYFDHWIVY YKPLTKRSRY GFILYGDEET LLCTEKRIEE
LDGKYDEEKL SAIGNFYCFP YLNAIDVAKT PQWVKDTVWY QIFPDRFCNG DKSIDPENVE
PWGTEPTRDN FMGGDLQGVL DKLDYLCNLG INGLYFCPVF EATENHRYET IDYFKVDPAL
GGNEVFKKLV SEAHKRGMKI MLDAVFNHIG YFSPKWQDVL KNNEKSRYKD WFCIKKFPVL
ENGLENVDGN NLNYETFGRI ATMPKLNTEN PDVVEYLLKV AKFWVEEMDI DGWRLDVCNE
VDHVFWRKFR EVVKETNKEV YILGEVWHDG LPWLMGDQFD AVMNYPVTDA VKEYFCLNQS
NPEDFKYMIE ANKVSYLRQI GETIFNLLDS HDTPRILTVA GGNKNKMKLA YLFMFTQAGS
PCIYYGDEVG MEGNQGMGME FHRRCMIWDE NKQDKDMLKF MKQIIKIRKE NKELNLLDNN
WIRANRDENI LIYSKKNIFI IMNNSDKEEK VFLPKEIKNN KVKDLFEEKI ESLKEDVELK
PFAFRVYKKL