Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1967 |
Symbol | |
ID | 4205992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2173511 |
End bp | 2174521 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642566517 |
Product | hypothetical protein |
Protein accession | YP_699276 |
Protein GI | 110803936 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAGAG ATAGTAGAAA GAAGAGGAGA AGTAAAAAGA AATGGAAGAC TATAGCAATA TTCATTGCTT TTGAATTCAT ATTTACTGCT GTTACAGCAC CTTTTATACT TTTATATGGA CCATTTAAAA ATGCTAAAAG GACTTATGTA GGTGCAGCAA TGACTAGTTT TAATCATCAG TGGATGGCAA CTACATTTTT ATCAGATGAA AAAATTAATG ATATATTAAA TTCTAATATT GAGGATACAA ATACAAATCA TAAAAATACC AATATAAAGA CAAATGTTAA TTTACCAACT AAACATGATA ATAGTATAGA ATTATACTCT TTTGAAAACT TTAAATATAG TGGATATTAT ATAGTTGTAA AAGATCCTAC TAGAGTAAAA ATAGGAGTTT CTAAATACCT AGGAGAAGAA GGACAAACTA CTTCTGAGAT AGCTAGAGAA TACAATGCTG TTGCTGCTGT AAATGGAGGA GCTTTTACAG ATAAATCTAG TACGGCTCAA TGGACTGGTA ATGGAGGAAC TCCTGCTGGA ATAGTTATAT CAGAGGGGAA ATTAGTTTAT AAAGATGTAC CAGATGACAA GAAAATTGAG TTAGTTGGTA TAACAAAAGA AGGAAAAATG ATTGCAGGAA TGTATTCATT TAATAATCTT AAAGAATTAA ATGTTAAGGA AGCTGTAAGT TTTGGTCCTG TTTTAGTTAA AGAAGGAGAA CCTACACCTA TGAAAGGTGA TGGTGGATGG GGAGTTGCTC CAAGAACTGC TATGGGACAA AGAGCTGATG GATCAATAGT AATGTTAGTT ATTGATGGTA GAAGTTTAAC AAGTGGAGGA GCTACTTTAA AGGAATTACA GGAAGTATTA TTAAATACTT GTAATGTAGT TACTGCTATG AACCTTGATG GTGGTAAATC AACTACTATG TACTTAAATG GAAAAGTAAT AAATAATCCA GCATCAAATG TAGGGGAGAG ATCTATTCCT TCAGCTATAA TAGTAAAATA A
|
Protein sequence | MGRDSRKKRR SKKKWKTIAI FIAFEFIFTA VTAPFILLYG PFKNAKRTYV GAAMTSFNHQ WMATTFLSDE KINDILNSNI EDTNTNHKNT NIKTNVNLPT KHDNSIELYS FENFKYSGYY IVVKDPTRVK IGVSKYLGEE GQTTSEIARE YNAVAAVNGG AFTDKSSTAQ WTGNGGTPAG IVISEGKLVY KDVPDDKKIE LVGITKEGKM IAGMYSFNNL KELNVKEAVS FGPVLVKEGE PTPMKGDGGW GVAPRTAMGQ RADGSIVMLV IDGRSLTSGG ATLKELQEVL LNTCNVVTAM NLDGGKSTTM YLNGKVINNP ASNVGERSIP SAIIVK
|
| |