Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0789 |
Symbol | |
ID | 4205550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 914818 |
End bp | 916650 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642565348 |
Product | cymH protein |
Protein accession | YP_698114 |
Protein GI | 110802979 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TTACAAGAGA GGCAATACAT CATATAGCTC AAAGCAATTA TTCTTATGGA TATGATAATG AAACTTTGCA TTTAAGGGTT AGGACTAAAA AAGGTGAAGT AAATAAAGTG GAAATAAGAA TTGGAGATCC TTACATATGG GACGAAGGTG GTTGCGATGG AGGAAATATG AATGCCACTG GAGGACGATG GACAGGTGGA AAAAGTTATC CTATGAGAAA GGAATGTGAA ACAAAATACT TTGATCACTG GATAGTTTAT TATAAACCAT TAACCAAACG TTCAAGATAT GGATTTATAT TATATGGAGA TGAAGAAACT CTTTTATGCA CAGAAAAAAG AATAGAGGAG TTAGATGGAA AGTATGATGA AGAAAAATTA AGTGCTATAG GAAACTTTTA TTGTTTTCCA TATTTAAATG CCATAGATGT TGCTAAAACA CCACAATGGG TAAAAGATAC TGTTTGGTAT CAAATATTCC CAGATAGATT TTGCAATGGA GATAAATCAA TAGATCCAGA AAATGTTGAG CCATGGGGGA CAGAGCCTAC TAGGGATAAT TTTATGGGAG GAGATTTACA GGGAGTTTTA GATAAATTAG ATTACTTATG CAATCTTGGA ATTAATGGAC TATATTTTTG TCCTGTATTT GAAGCTACTG AGAATCATAG GTATGAAACC ATAGATTATT TTAAAGTAGA TCCAGCGCTT GGTGGAAATG AAGTCTTTAA AAAACTTGTA AGTGAAGCTC ACAAAAGAGG AATGAAAATA ATGTTAGATG CAGTATTTAA TCATATAGGT TATTTTTCAC CAAAATGGCA AGATGTATTA AAGAATAATG AAAAATCAAG ATATAAGGAT TGGTTTTGTA TAAAGAAGTT TCCAGTACTT GAAAATGGCT TAGAAAATGT TGACGGAAAT AATTTAAATT ATGAAACCTT TGGAAGAATA GCCACAATGC CTAAACTAAA CACAGAAAAT CCAGATGTTG TAGAATATTT ATTAAAAGTT GCTAAGTTCT GGGTTGAAGA AATGGATATA GATGGTTGGA GACTTGATGT ATGTAATGAA GTAGACCATG TATTTTGGAG AAAATTTAGG GAAGTAGTAA AGGAAACTAA TAAAGAAGTT TATATATTAG GAGAAGTTTG GCATGATGGA CTTCCATGGC TTATGGGAGA TCAGTTTGAT GCAGTTATGA ATTACCCTGT TACAGATGCA GTAAAAGAAT ATTTCTGCTT AAATCAATCT AATCCAGAAG ATTTTAAATA TATGATAGAA GCTAATAAGG TTAGCTATTT AAGACAAATA GGAGAAACGA TATTTAACTT ATTAGATAGT CATGATACTC CAAGAATATT AACTGTTGCT GGAGGAAACA AGAATAAGAT GAAACTAGCT TATCTATTTA TGTTTACTCA AGCTGGTTCT CCATGTATAT ATTATGGAGA TGAAGTTGGT ATGGAAGGAA ATCAAGGAAT GGGTATGGAA TTCCATAGAA GATGTATGAT TTGGGATGAA AATAAACAAG ACAAAGATAT GCTTAAGTTT ATGAAACAAA TAATAAAAAT AAGAAAAGAA AATAAGGAAT TAAATTTATT AGATAACAAT TGGATAAGAG CTAATAGAGA TGAAAATATA CTTATATATT CAAAGAAAAA TATATTCATC ATTATGAACA ATTCAGATAA GGAAGAAAAG GTATTTTTAC CTAAAGAGAT TAAAAATAAT AAGGTTAAGG ATTTATTTGA AGAAAAAATT GAATCTTTAA AAGAAGATGT AGAATTAAAG CCCTTTGCAT TTAGAGTTTA TAAAAAGCTT TAA
|
Protein sequence | MSKITREAIH HIAQSNYSYG YDNETLHLRV RTKKGEVNKV EIRIGDPYIW DEGGCDGGNM NATGGRWTGG KSYPMRKECE TKYFDHWIVY YKPLTKRSRY GFILYGDEET LLCTEKRIEE LDGKYDEEKL SAIGNFYCFP YLNAIDVAKT PQWVKDTVWY QIFPDRFCNG DKSIDPENVE PWGTEPTRDN FMGGDLQGVL DKLDYLCNLG INGLYFCPVF EATENHRYET IDYFKVDPAL GGNEVFKKLV SEAHKRGMKI MLDAVFNHIG YFSPKWQDVL KNNEKSRYKD WFCIKKFPVL ENGLENVDGN NLNYETFGRI ATMPKLNTEN PDVVEYLLKV AKFWVEEMDI DGWRLDVCNE VDHVFWRKFR EVVKETNKEV YILGEVWHDG LPWLMGDQFD AVMNYPVTDA VKEYFCLNQS NPEDFKYMIE ANKVSYLRQI GETIFNLLDS HDTPRILTVA GGNKNKMKLA YLFMFTQAGS PCIYYGDEVG MEGNQGMGME FHRRCMIWDE NKQDKDMLKF MKQIIKIRKE NKELNLLDNN WIRANRDENI LIYSKKNIFI IMNNSDKEEK VFLPKEIKNN KVKDLFEEKI ESLKEDVELK PFAFRVYKKL
|
| |