Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0588 |
Symbol | |
ID | 4205463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 701139 |
End bp | 702272 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 22% |
IMG OID | 642565148 |
Product | glycosyl transferase, group 1 family protein |
Protein accession | YP_697915 |
Protein GI | 110803088 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000108704 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTAT TATTTTTAAT TGATAATTTA TGTGGAGGTG GAGCTGAAAA AGTTTTAATT GATATTTTAA AGAATTTAAA TTTGCAAAAA TATAATATAG AAATTTTTTT AATAAGAAAT AAGGGAGTTT ATTTGAAAGA TTTACCTCAA AATATAAAAG TGAATTATAT ATATAATGAT AGAAATTTTG AAGAAACCAA ATTATATAAT ATATATTATT GGATAACAAA AACTCATTTT AAAATTAATA GAATATTAGG GTTAAAAAGA TTATTTAAGA AAAATATAAA AAAACAATAT GATGCTTATA TTCCTTTTTT AGAAGGAGCA TGTATTCAGT TAGTATCAGA TTCTAATTTA GAAGGGAAAA AAATTGCTTG GATACATACA GATTTAATTA AACATAATAT AATGAGTTTA AAAGAAGAAC GTAAGGCACT TAATTCAATG GATAAATTAA TATGTGTATC AGAAGGAAGC AAAAGATCTC TATTAAAAAA GTATCCGGAA TTCAATCATA AAGTTATGGT TATAAATAAT CCAATAGATT TAGATAATAT AGAGAAAAAA GCAAATGAAA AAATTGAAGA AGTAGTATTT AACAATTCTT ATCCTACTTT TATTGCGATT GGAAGAGTAG AAAGAGTAAA AGGACATGAC CTATTAATAG AAGCACATAA AAAACTTATA AATGATGGAT TTAAACATAA TATTGCAATA TTGGGAGTAG GACAAGAAAT GGAAAGCTTA AAAAATTTAA TTAATGAAAA TTCTTTACAA AGTAGTTTTA AATTTTTAGG CTTTAAATCA AATCCATATA AATATTTAAA AGAAGCTGAT TTTTATATTA TGCCATCAAG ATATGAAGGT TATCCATTAA GCTTATGTGA AGCCATAGCT TTAGAAAAAC CAATTATAGC TACTAATTTT GAATCGGCTA AAGACATACT TAAAAATGGC AAATTAGGAC TAATAGCAGA ATTAGAAGAT GTAGATGATA TAACTTTTAA GATGAAAAAG CTTTTAGAAG ATTATAATTT AGTAAAAGAA TTAAAAAATA ATTGTAGTAT TTTTAAGCAT ACATTAGGAT TTAAAGAAAA AATATTGGAT ATAGAAAATG TTATAGATGG TTAG
|
Protein sequence | MNLLFLIDNL CGGGAEKVLI DILKNLNLQK YNIEIFLIRN KGVYLKDLPQ NIKVNYIYND RNFEETKLYN IYYWITKTHF KINRILGLKR LFKKNIKKQY DAYIPFLEGA CIQLVSDSNL EGKKIAWIHT DLIKHNIMSL KEERKALNSM DKLICVSEGS KRSLLKKYPE FNHKVMVINN PIDLDNIEKK ANEKIEEVVF NNSYPTFIAI GRVERVKGHD LLIEAHKKLI NDGFKHNIAI LGVGQEMESL KNLINENSLQ SSFKFLGFKS NPYKYLKEAD FYIMPSRYEG YPLSLCEAIA LEKPIIATNF ESAKDILKNG KLGLIAELED VDDITFKMKK LLEDYNLVKE LKNNCSIFKH TLGFKEKILD IENVIDG
|
| |