Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1103 |
Symbol | |
ID | 4203528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1259269 |
End bp | 1261191 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638081984 |
Product | glycosy hydrolase family protein |
Protein accession | YP_695549 |
Protein GI | 110799785 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00087575 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTTA AATTAGTTGG ATTAAATGAG GAAATGTTAG AAGCTGTCAA TGGATTAGAA GAACTAATTG ATTTTCAGCT TTATAATGAG AAGGGGAGTA ATAATGAAAA TTTAGAAATT ATAAATGTTT CTAAGCTTCC TGAAGAATCA GAAAATGTAA TAGAAGTCCT TAGAGGAAAA AATAATGAAA TAAGATATAA GAAAAAACAT CACTTTTTTA GGGCATTTAG TTTATATCTT CAATTTCTTA AAAAGGGAGA GGAGAGTTTT AGAAAAAGTG AAAAAAGTTA TATTGATTCA GTAGGAGCAA TGATAGATGC CTCAAGAAAT GCTGTCTATA GAGTTTCTGA GGTTAAAAAG ATATTAGGAT ATATGGCATT AATGGGACAT AATAGGTGTA TGCTTTATAC TGAAGATACT TATGAGATTG AGGGTTATCC ATACTTTGGT TATATGAGAG GAAGATATAC CAAAGAGGAA TTAAGGGAAA TTGATGATTA TGGATACTCT TTAGGTATAG AGGTTGTACC TTGTATTCAA ACTTTAGCTC ACTTAAAACA AACTTTAAGA TGGCCTTATG GGGAAGGGGT GAAAGACACT CAAGATGTAC TTTTAGTAGG AGAGGAAAAG ACTTATAAGT TTATAGAGGC AATGATTTCT TCTTTAAGAG AGTGCTTTAG AAGTAAAAAT ATTCATATAG GTATGGATGA GGCTTTTGAT TTAGGAAGAG GCCATTATTT AACTAAACAT GGACATGTTC CTCACCAAGA GCTCATGGTA GAGCATTTAA ATAAGGTTAA TGAAATAGCA AAAAAATATG ATTTTAAGCC AATGATTTGG GATGATATGT TTTTAAGATG TGGAGCTCCA GATGGTGGAT ATTACGATTT AGATATAGTT ATAACTCCTG AAATAGCAAA TAATATTCCA GAGGAAGTTT CCTTAGTTTA TTGGGATTAT TATAATTCTG ATGAAGAAAA ATATAAGAAG CTTTTAGATA TTAGAGATGA TTTTAATAAT AATATTATCT TTGCAGGAGG ATGCTGGAGG TGGAGTGGAT TTGCTCCAAA CTATTCAAAG ACCTTTGAAA CAACAAATGC TGCCTTAAAT CAATGTAAGG CTAAAGGAAT AAAAGAAGTC TTTGCAACGG CTTGGGGAGA TGATGGATCA GAAACGCCAA TATATTCAAT AATAGTAGGT CTTATATTAT TTGGAGAGCA TGGCTATTAT AACAAGGTAG AGAAAGAATG GATAGATGAA AGATGCAAAT CTTTAACAGG GCTTAGCATG GAGGACTTTA CTTCTTTAGA AGAATTAGAT CTTGTTCCAT CAGTTAAAAT TCCTAATATG GAGGTTTGTA ATCCTTCAAA ATATATAGCT TACCAAGATT TGCTTTTAGG AGCCTTTGAT AAGCATTTAG AGGGGCTAGA CTTAGAAGAA CATTATATTA ATCTTTCTAA AAAATATGAG GAAATAGGAG AGAAAAGTGA GAGGTTTAAA TTAATGTTTA CTATGTATTC TAAACTTGCT GCTTACCTTT CAGTTAAAAG TGAAATAGGA TTAGAAATAA GAAAAGCTTA TCTAGAAAAG GATAAGGATG CATTAAGGCT TATAGCATAT AACTTTATTC CAGAAATACA AGAAAAGCTA AAGAGTTTTC ATAAGAGTTT TAGGGATCTA TGGTATAAAG AATGTAAAGG ACAAGGCTTT GAAGTTATGG ATATTAGACT TGGTGGAGTT ATGGCAAGAT GCGATTCTGC TATGTATAGA ATAAAAGCTT ATCTTAAAGG AAATATTGAT AAAATAGAAG AGCTAGAAGA GGAAAGACTT TATTTTTCTG AACATTTTGG GGGAGATGAT TGTAAGCTTA TATGTTGCAA TGAATATCAA AAAATAGCAA CACAAAATAT TTTAAGTTGG TAA
|
Protein sequence | MRVKLVGLNE EMLEAVNGLE ELIDFQLYNE KGSNNENLEI INVSKLPEES ENVIEVLRGK NNEIRYKKKH HFFRAFSLYL QFLKKGEESF RKSEKSYIDS VGAMIDASRN AVYRVSEVKK ILGYMALMGH NRCMLYTEDT YEIEGYPYFG YMRGRYTKEE LREIDDYGYS LGIEVVPCIQ TLAHLKQTLR WPYGEGVKDT QDVLLVGEEK TYKFIEAMIS SLRECFRSKN IHIGMDEAFD LGRGHYLTKH GHVPHQELMV EHLNKVNEIA KKYDFKPMIW DDMFLRCGAP DGGYYDLDIV ITPEIANNIP EEVSLVYWDY YNSDEEKYKK LLDIRDDFNN NIIFAGGCWR WSGFAPNYSK TFETTNAALN QCKAKGIKEV FATAWGDDGS ETPIYSIIVG LILFGEHGYY NKVEKEWIDE RCKSLTGLSM EDFTSLEELD LVPSVKIPNM EVCNPSKYIA YQDLLLGAFD KHLEGLDLEE HYINLSKKYE EIGEKSERFK LMFTMYSKLA AYLSVKSEIG LEIRKAYLEK DKDALRLIAY NFIPEIQEKL KSFHKSFRDL WYKECKGQGF EVMDIRLGGV MARCDSAMYR IKAYLKGNID KIEELEEERL YFSEHFGGDD CKLICCNEYQ KIATQNILSW
|
| |