Gene CPR_0280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0280 
Symbol 
ID4205309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp337838 
End bp340060 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content31% 
IMG OID642564837 
Productglycosy hydrolase family protein 
Protein accessionYP_697609 
Protein GI110803187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4724] Endo-beta-N-acetylglucosaminidase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAACA AACAAAAAAT ACGTAAACGT AGAAGGGCTT TACTTTGTGT TTTTGCAGCA 
TTTGCATTTT CTTTTAATCC TTTAGGAAAG ATAGCTTACG CAACACCTAA ACAAGATACA
AGTGTTGTAA ATTATGTGAA AACTCAAGAA GAAACTTCAA ACTCAATCCA GAATCAACCA
ATCTCATCTT ATTGGTATCC AGAAGATTTA TTAAAATGGA GTGCTAACAA TGATAAAGAT
GCAAAATTTA ATAAAAGCAC AGTTCCATTA GCAAAAAGAG TCGAAAAAGA TAAACTTGAC
ACTATTAATG ATACTCAAAA TAAAGATGTT AAAGTTGTAG CAATTTCAAT AATGAATGCT
AACACTAGTG GTAATCCATC ACAAGGCTCA AATAAATTTA GTGCTAATAC ATTCTCTTAT
TGGCAATACA TAGATAAATT GGTTTACTGG GGTGGATCAG CTGGAGAAGG ATTAATAGTT
CCTCCAAGTC CAGATGTTAC TGACTCAGCA CACAGAAATG GGGTTCCTGT TTTAGGTACT
GTATTCTTTC CTATGACAGC TCATGGTGGA AAAATGGAAT GGTTAAATAA ATTCCTTGAA
AAAGATTCTA ATGGAAACTT CCCTATAATA GATAAGTTAA TTGAAGTTGC AGAAAACTAT
GGCTTTGATG GTTGGTTCAT AAATCAAGAA ACAGAAGGAA CTGAACAAGA ACCTTTAACT
CCTGAACATG CTAAGCTTAT GCAAGAATTA ATAAGGCAAT TTAAAGCTAA GTCTAACGGA
AAATTTGAAA TCATGTGGTA TGATTCCATG ACAAAAGATG GGAATATGGA TTGGCAAAAT
GCTTTAACTG ATAAAAATGA ATATTTCTTA TTAGATGGAG ATAAAAACAA AGTTGCTGAT
AGTATGTTCT TAAATTTCTG GTGGACATAT AATAGTTTAA AAGATAAAGA TTTATTAAAA
GTATCTAATG AAAAAGCTAA AGAAATAGGA ATAAACCCTT ATGACTTATA TGCTGGAATA
GATGTTCAAG AAAATGGATA TAAAACTCCT CTTAGATGGA CTCATTATCA AAGAGATGAC
CAAGCTCCAT TTACTTCATT AGGTTTATAT TGTCCAAGTT GGACTTATTT TGATGCTAAA
ACACAGGAAC AATTCCAAGA AAATGAAAGT AGATTATGGG TTAATGAACA TGGTGATCCA
TCAAAGGCAA CTAATGCTCA AGGAGTTGAT TGGAGAGGAA TTTCAACATA CTCTGTTGAA
AAAAGTGCTG TAACTAGCCT TCCATTTACA ACTAACTTTA ATATGGGGAA TGGATATGAT
TTTTATGTTG ATGGTAATAA AGTATCTACT CAAGATTGGA ATAACAGAAG TTTAAACGAT
ATAATGCCAA CTTACAGATG GGTTATTTCA AATGAAGAAA ATAATAATGT AAAAGCTGAC
ATAGATTATA CTACTGCTTA CTATGGTGGT AACTCAATTA AACTTTCTGG TTACCTAGCT
GAAGGAAAAG CTTCTACAAT AAAACTTTAT AGTTCTGATT TAACTTTACC TGAAGGTGTG
CAATTCACTA CTACTGCTAA GGCTAATGGA AATACTGTAG ATATGGATTT AGTTCTTACA
TTCCATGATG GTACAGAAAC TACTATTTCT GCTGATAAGA AACTTAGTAC TGATTGGACT
AAACTTACTT ATGATATTTC ACCATATGTA GGAAAATCAA TTAAAACTAT TTCTTATAAA
CTTTCAAGCC CAGTATGTGT TGATAACTTC TTAGCAAACT TAGGTAATAT AACAATAGAA
ATTCCTAACT CAAGTTCAAA AGTAAATATA TCTAATGCTA AATTAAATGA TGTTGACTTT
AAAGATGGAA TATATGCAGG AGCTAGACTT TCATGGAGTC CTGAAGGAAA TTCTTCTGAT
GTACATCATT ATGAAATTTA CAGAGTTATG AATGATGGAA CTAAAGTTTT ACTAGGTGCA
ACTCCTAATA CAAGTTATTA TGTTAGTGAT TTAAGAAGAA ATGATAAAGA AACAAATACT
AATTTTGAGG TAGTTGCTGT TAATAAAAAC TATAACAGAG GAAATAGTCA AAGTATAAAT
ATAGAATGGC CTTCTTACCC TTCTCCAGCA GCATCGTTTA AGGTTTCAAA TACACTAATA
GCACCAGGAC AATCAGTAAC TTTTACTAAC ACTAGTTCTG AAGTTACAGA AGAAATCCAA
TGA
 
Protein sequence
MTNKQKIRKR RRALLCVFAA FAFSFNPLGK IAYATPKQDT SVVNYVKTQE ETSNSIQNQP 
ISSYWYPEDL LKWSANNDKD AKFNKSTVPL AKRVEKDKLD TINDTQNKDV KVVAISIMNA
NTSGNPSQGS NKFSANTFSY WQYIDKLVYW GGSAGEGLIV PPSPDVTDSA HRNGVPVLGT
VFFPMTAHGG KMEWLNKFLE KDSNGNFPII DKLIEVAENY GFDGWFINQE TEGTEQEPLT
PEHAKLMQEL IRQFKAKSNG KFEIMWYDSM TKDGNMDWQN ALTDKNEYFL LDGDKNKVAD
SMFLNFWWTY NSLKDKDLLK VSNEKAKEIG INPYDLYAGI DVQENGYKTP LRWTHYQRDD
QAPFTSLGLY CPSWTYFDAK TQEQFQENES RLWVNEHGDP SKATNAQGVD WRGISTYSVE
KSAVTSLPFT TNFNMGNGYD FYVDGNKVST QDWNNRSLND IMPTYRWVIS NEENNNVKAD
IDYTTAYYGG NSIKLSGYLA EGKASTIKLY SSDLTLPEGV QFTTTAKANG NTVDMDLVLT
FHDGTETTIS ADKKLSTDWT KLTYDISPYV GKSIKTISYK LSSPVCVDNF LANLGNITIE
IPNSSSKVNI SNAKLNDVDF KDGIYAGARL SWSPEGNSSD VHHYEIYRVM NDGTKVLLGA
TPNTSYYVSD LRRNDKETNT NFEVVAVNKN YNRGNSQSIN IEWPSYPSPA ASFKVSNTLI
APGQSVTFTN TSSEVTEEIQ