Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0280 |
Symbol | |
ID | 4205309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 337838 |
End bp | 340060 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642564837 |
Product | glycosy hydrolase family protein |
Protein accession | YP_697609 |
Protein GI | 110803187 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4724] Endo-beta-N-acetylglucosaminidase D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAACA AACAAAAAAT ACGTAAACGT AGAAGGGCTT TACTTTGTGT TTTTGCAGCA TTTGCATTTT CTTTTAATCC TTTAGGAAAG ATAGCTTACG CAACACCTAA ACAAGATACA AGTGTTGTAA ATTATGTGAA AACTCAAGAA GAAACTTCAA ACTCAATCCA GAATCAACCA ATCTCATCTT ATTGGTATCC AGAAGATTTA TTAAAATGGA GTGCTAACAA TGATAAAGAT GCAAAATTTA ATAAAAGCAC AGTTCCATTA GCAAAAAGAG TCGAAAAAGA TAAACTTGAC ACTATTAATG ATACTCAAAA TAAAGATGTT AAAGTTGTAG CAATTTCAAT AATGAATGCT AACACTAGTG GTAATCCATC ACAAGGCTCA AATAAATTTA GTGCTAATAC ATTCTCTTAT TGGCAATACA TAGATAAATT GGTTTACTGG GGTGGATCAG CTGGAGAAGG ATTAATAGTT CCTCCAAGTC CAGATGTTAC TGACTCAGCA CACAGAAATG GGGTTCCTGT TTTAGGTACT GTATTCTTTC CTATGACAGC TCATGGTGGA AAAATGGAAT GGTTAAATAA ATTCCTTGAA AAAGATTCTA ATGGAAACTT CCCTATAATA GATAAGTTAA TTGAAGTTGC AGAAAACTAT GGCTTTGATG GTTGGTTCAT AAATCAAGAA ACAGAAGGAA CTGAACAAGA ACCTTTAACT CCTGAACATG CTAAGCTTAT GCAAGAATTA ATAAGGCAAT TTAAAGCTAA GTCTAACGGA AAATTTGAAA TCATGTGGTA TGATTCCATG ACAAAAGATG GGAATATGGA TTGGCAAAAT GCTTTAACTG ATAAAAATGA ATATTTCTTA TTAGATGGAG ATAAAAACAA AGTTGCTGAT AGTATGTTCT TAAATTTCTG GTGGACATAT AATAGTTTAA AAGATAAAGA TTTATTAAAA GTATCTAATG AAAAAGCTAA AGAAATAGGA ATAAACCCTT ATGACTTATA TGCTGGAATA GATGTTCAAG AAAATGGATA TAAAACTCCT CTTAGATGGA CTCATTATCA AAGAGATGAC CAAGCTCCAT TTACTTCATT AGGTTTATAT TGTCCAAGTT GGACTTATTT TGATGCTAAA ACACAGGAAC AATTCCAAGA AAATGAAAGT AGATTATGGG TTAATGAACA TGGTGATCCA TCAAAGGCAA CTAATGCTCA AGGAGTTGAT TGGAGAGGAA TTTCAACATA CTCTGTTGAA AAAAGTGCTG TAACTAGCCT TCCATTTACA ACTAACTTTA ATATGGGGAA TGGATATGAT TTTTATGTTG ATGGTAATAA AGTATCTACT CAAGATTGGA ATAACAGAAG TTTAAACGAT ATAATGCCAA CTTACAGATG GGTTATTTCA AATGAAGAAA ATAATAATGT AAAAGCTGAC ATAGATTATA CTACTGCTTA CTATGGTGGT AACTCAATTA AACTTTCTGG TTACCTAGCT GAAGGAAAAG CTTCTACAAT AAAACTTTAT AGTTCTGATT TAACTTTACC TGAAGGTGTG CAATTCACTA CTACTGCTAA GGCTAATGGA AATACTGTAG ATATGGATTT AGTTCTTACA TTCCATGATG GTACAGAAAC TACTATTTCT GCTGATAAGA AACTTAGTAC TGATTGGACT AAACTTACTT ATGATATTTC ACCATATGTA GGAAAATCAA TTAAAACTAT TTCTTATAAA CTTTCAAGCC CAGTATGTGT TGATAACTTC TTAGCAAACT TAGGTAATAT AACAATAGAA ATTCCTAACT CAAGTTCAAA AGTAAATATA TCTAATGCTA AATTAAATGA TGTTGACTTT AAAGATGGAA TATATGCAGG AGCTAGACTT TCATGGAGTC CTGAAGGAAA TTCTTCTGAT GTACATCATT ATGAAATTTA CAGAGTTATG AATGATGGAA CTAAAGTTTT ACTAGGTGCA ACTCCTAATA CAAGTTATTA TGTTAGTGAT TTAAGAAGAA ATGATAAAGA AACAAATACT AATTTTGAGG TAGTTGCTGT TAATAAAAAC TATAACAGAG GAAATAGTCA AAGTATAAAT ATAGAATGGC CTTCTTACCC TTCTCCAGCA GCATCGTTTA AGGTTTCAAA TACACTAATA GCACCAGGAC AATCAGTAAC TTTTACTAAC ACTAGTTCTG AAGTTACAGA AGAAATCCAA TGA
|
Protein sequence | MTNKQKIRKR RRALLCVFAA FAFSFNPLGK IAYATPKQDT SVVNYVKTQE ETSNSIQNQP ISSYWYPEDL LKWSANNDKD AKFNKSTVPL AKRVEKDKLD TINDTQNKDV KVVAISIMNA NTSGNPSQGS NKFSANTFSY WQYIDKLVYW GGSAGEGLIV PPSPDVTDSA HRNGVPVLGT VFFPMTAHGG KMEWLNKFLE KDSNGNFPII DKLIEVAENY GFDGWFINQE TEGTEQEPLT PEHAKLMQEL IRQFKAKSNG KFEIMWYDSM TKDGNMDWQN ALTDKNEYFL LDGDKNKVAD SMFLNFWWTY NSLKDKDLLK VSNEKAKEIG INPYDLYAGI DVQENGYKTP LRWTHYQRDD QAPFTSLGLY CPSWTYFDAK TQEQFQENES RLWVNEHGDP SKATNAQGVD WRGISTYSVE KSAVTSLPFT TNFNMGNGYD FYVDGNKVST QDWNNRSLND IMPTYRWVIS NEENNNVKAD IDYTTAYYGG NSIKLSGYLA EGKASTIKLY SSDLTLPEGV QFTTTAKANG NTVDMDLVLT FHDGTETTIS ADKKLSTDWT KLTYDISPYV GKSIKTISYK LSSPVCVDNF LANLGNITIE IPNSSSKVNI SNAKLNDVDF KDGIYAGARL SWSPEGNSSD VHHYEIYRVM NDGTKVLLGA TPNTSYYVSD LRRNDKETNT NFEVVAVNKN YNRGNSQSIN IEWPSYPSPA ASFKVSNTLI APGQSVTFTN TSSEVTEEIQ
|
| |