Gene CPF_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0285 
Symbol 
ID4201313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp340337 
End bp343744 
Gene Length3408 bp 
Protein Length1135 aa 
Translation table11 
GC content32% 
IMG OID638081172 
Productendo-beta-N-acetylglucosaminidase 
Protein accessionYP_694745 
Protein GI110798707 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4724] Endo-beta-N-acetylglucosaminidase D 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAACA AACAAAAAAT GCGTAAGCGT AGAAGGGCTT TACTTTGTGT TTTTGCAGCA 
TTTGCATTTT CTTTTAATCC TTTAGGAAAG ATAGCTTACG CAACACCTAA ACAAGATACA
AGTGTTGTAA ATTATGTGAA AACTCAAGAA GAAACTTCAA ACTCAATTCA GAATCAACCA
ATCTCATCTT ATTGGTATCC AGAAGATTTA TTAAAATGGA GTGCTAACAA TGATAAAGAT
GCAAAATTTA ATAAAAGCAC AGTTCCATTA GCAAAAAGAG TCGAAAAGGA TAAACTTGAT
ACTATTAATG ACACTCAAAA TAAAGATGTT AAAGTTGTAG CAATTTCAAT AATGAATGCT
AATACTAGTG GCAATCCATC ACAAGGTTCA AATAAATTTA GTGCTAATAC ATTCTCTTAT
TGGCAATACA TAGATAAATT AGTTTACTGG GGTGGATCAG CTGGAGAAGG ATTAATAGTT
CCTCCAAGTC CAGATGTTAC TGACTCAGCA CACAGAAATG GGGTTCCTGT TTTAGGTACT
GTATTCTTCC CTATGACAGC TCATGGTGGA AAAATGGAAT GGTTAAATAA ATTCCTTGAA
AAAGATTCTA ATGGAAACTT CCCTATAGTA GATAAGTTAA TTGAAGTTGC AGAAAACTAT
GGCTTTGATG GTTGGTTCAT AAATCAAGAA ACAGAAGGAA CTGAACAAGA ACCTTTAACG
CCTGAACATG CTAAGCTTAT GCAAGAATTA ATAAAGCAAT TTAAAGCTAA GTCTAATGGA
AAATTTGAAA TTATGTGGTA TGATTCCATG ACAAAAGATG GGGATATGGA TTGGCAAAAT
GCTTTAACTG ATAAAAATGA ATATTTCTTA TTAGATGGAG ATAAAAACAA AGTTGCTGAT
AGTATGTTCT TAAATTTCTG GTGGACATAT AATAGTTTAA AAGATAAAGA TTTATTAAGA
GTATCTAATG AAAAAGCTAA AGAAATAGGA ATAAACCCTT ATGACTTATA TGCTGGAATA
GATGTTCAAG CAAATGGATA TAAAACTCCT CTTAGATGGG CTCATTATCA AAGAGATGAC
CAAGCTCCAT TTACTTCATT AGGTTTATAT TGTCCAAGCT GGACTTATTT TGATGCTAAA
ACTCCAGAAC AATTTCAAGA AAACGAAAGT AGATTATGGG TTAATGAACA TGGTGATCCA
TCAAAGGCAA CTAATGCTCA AGGAGTTGAT TGGAGAGGAA TTTCAACATA CTCTGTTGAA
AAAAGTGCTG TAACTAGCCT TCCATTTACA ACTAACTTTA ATATGGGTAA TGGATATGAT
TTTTATGTTG ATGGTAATAA AGTATCTACT CAAGATTGGA ATAATAGAAG TTTAAATGAT
ATAATGCCAA CTTACAGATG GGTTATTTCA AATGAAGAAA ATAATAATGT AAAAGCTGAC
ATAGATTATA CTAATGCTTA CTATGGTGGT AACTCAATTA AACTTTCTGG TTACCTAGCT
GAAGGAAAAG CTTCTACAAT AAAACTTTAT AGTTCTGATT TGACTTTACC TGAAGGTGTA
CAATTCACTA CTACTGCTAA GGCTAATGGA AATACTGTAG ATATGGATTT AGTTCTTACA
TTCCATGATG GTACAGAAAC TACTATTTCT GCTGATAAGA AACTTGGTAC TGATTGGACT
AAACTTACTT ATGATGTTTC ACCATATGTA GGAAAATCAA TTAAAACTAT TTCTTATAAA
CTTTCAAGCC CAGTATGTGT TGATAACTTC TCTGCAAACT TAGGTAATAT AACAATAGAA
ATTCCTAACT CAAGTTCAAA GGTAAATATA TCTAATTCTA AATTAAATGA TGTTGACTTT
AAAGATGGAA TATATGCAGG AGCTAGACTT TCATGGAGTC CTGAAGGAAA TTCTTCTGAT
GTACATCATT ATGAAATTTA CAGAGTTATG AATGATGGAA CTAAAGTTTT ACTAGGTGCA
ACTCCTAATA CAAGTTATTA TGTTAGTGAT TTAAGAAGAA ATGATAAAGA AACAAATACT
AATTTTGAGG TAATTGCTGT TAACAAAAAC TATAACAGAG GAAATAGTCA AAGTATAAAT
ATGGAATGGC CTGCTTACCC TGCTCCAACA GCATCATTTA AGGTTTCAAA TACACTAATA
GCACCAGGAC AATCAGTAAC CTTTACTAAC ACTAGTTCTG AAGTTACAGA AGAAATCCAA
TGGAAATTCC CAGGTGCTAA AGTTGAAAGT AGTACAGAAC AAAATCCAAC TGTTACTTAT
GAAAAGGAAG GTGTTTATCC AGTAACACTT ACTGCTAAAA ATTCTTCTGG AGAAAATGTG
GAAACTAAAA CTGAACTTAT AACTGTTACA AATGCTGCAA AAGATGGTTT AGTAAACTTA
TCACTTAATA AATCAGCTAC AGCATCAAGC TTTGTAAACG AGAATGAAGC TCCTCAATAT
GCTTTAGATG GAAATGTTAA AACTAAGTGG TGTGCCGTAG GCTCTGCTCC TCATACATTA
ACAATAGATT TAGGAGATAT TAAAACTATA GGAGAATTAG AAATAAGCCA TGCAGAAGCA
GGTGGAGAAA GCAGTGGTAT GAACACTAAA GCTTACTCTC TTGAAGTTAG TAATGATGGT
GAAAACTTTA CTCCTGTATT AAATGTAGAT GATAATACAA AGGCTATAAG CAATGATGCT
TTCCCTGTTA CTAAAGCTAG ATATGTTAGA TTAAATATTA TTCAGCCAAC TCAAGGTGCA
GACTCTGCAG CTAGAATATA CGAGGTTGCT GTAAAGGGAT TAGATGGAAA TGTGGATTTA
CCACCTGTAA TAAACCCTGA TGATTCAAAT AAACCAGTTG ATCCTGATAA TCCAAATAAT
CCAGACAAGC CAAACAATCC TGAAAGCCCA GATAATCCTG GTAATACAGA TAAACCTGTT
AATCCTGATC AACCTGGCGA TTCTGAAAAA CCAGAAAATC CAAACAAACC TGGTGATACA
GAAAATTCAG GTGAAAAACC TTCTGAGCCA TCTGCTTTAG TTAGTAAAGA AATTACTGAA
AATAGTGCTT TATTAAGTTG GAAGGCTCCT GAAAAAGTAG ACTTAATTAA AGAGTATGTT
ATTTATCAAG ATGGAAAAGA AATTGGTAGA GTTCCTGCTG ACAAAACTCC ACTTGAGTTC
TTAGCTAAAG ACTTAAAACC TAATACTAAA TATAACTTTC AAGTATCTTC AATTGATAAA
AACGGAAAAG AATCAGATAA AATATCATTA GAAGTTACTA CAAATAAAAA AGATAATGGA
AATTTACCTA ATACAGGATC ACCTATAGGG GCTGGTGCTT TAGCTACTAC TGGAATTGCT
TTATCATCTG CTGGTGTTTA TTTAACTTTA AAGAAAAAAA GAAAATAA
 
Protein sequence
MTNKQKMRKR RRALLCVFAA FAFSFNPLGK IAYATPKQDT SVVNYVKTQE ETSNSIQNQP 
ISSYWYPEDL LKWSANNDKD AKFNKSTVPL AKRVEKDKLD TINDTQNKDV KVVAISIMNA
NTSGNPSQGS NKFSANTFSY WQYIDKLVYW GGSAGEGLIV PPSPDVTDSA HRNGVPVLGT
VFFPMTAHGG KMEWLNKFLE KDSNGNFPIV DKLIEVAENY GFDGWFINQE TEGTEQEPLT
PEHAKLMQEL IKQFKAKSNG KFEIMWYDSM TKDGDMDWQN ALTDKNEYFL LDGDKNKVAD
SMFLNFWWTY NSLKDKDLLR VSNEKAKEIG INPYDLYAGI DVQANGYKTP LRWAHYQRDD
QAPFTSLGLY CPSWTYFDAK TPEQFQENES RLWVNEHGDP SKATNAQGVD WRGISTYSVE
KSAVTSLPFT TNFNMGNGYD FYVDGNKVST QDWNNRSLND IMPTYRWVIS NEENNNVKAD
IDYTNAYYGG NSIKLSGYLA EGKASTIKLY SSDLTLPEGV QFTTTAKANG NTVDMDLVLT
FHDGTETTIS ADKKLGTDWT KLTYDVSPYV GKSIKTISYK LSSPVCVDNF SANLGNITIE
IPNSSSKVNI SNSKLNDVDF KDGIYAGARL SWSPEGNSSD VHHYEIYRVM NDGTKVLLGA
TPNTSYYVSD LRRNDKETNT NFEVIAVNKN YNRGNSQSIN MEWPAYPAPT ASFKVSNTLI
APGQSVTFTN TSSEVTEEIQ WKFPGAKVES STEQNPTVTY EKEGVYPVTL TAKNSSGENV
ETKTELITVT NAAKDGLVNL SLNKSATASS FVNENEAPQY ALDGNVKTKW CAVGSAPHTL
TIDLGDIKTI GELEISHAEA GGESSGMNTK AYSLEVSNDG ENFTPVLNVD DNTKAISNDA
FPVTKARYVR LNIIQPTQGA DSAARIYEVA VKGLDGNVDL PPVINPDDSN KPVDPDNPNN
PDKPNNPESP DNPGNTDKPV NPDQPGDSEK PENPNKPGDT ENSGEKPSEP SALVSKEITE
NSALLSWKAP EKVDLIKEYV IYQDGKEIGR VPADKTPLEF LAKDLKPNTK YNFQVSSIDK
NGKESDKISL EVTTNKKDNG NLPNTGSPIG AGALATTGIA LSSAGVYLTL KKKRK