Gene CPF_1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1487 
Symbol 
ID4201001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1686237 
End bp1689755 
Gene Length3519 bp 
Protein Length1172 aa 
Translation table11 
GC content32% 
IMG OID638082365 
Productputative hyaluronoglucosaminidase 
Protein accessionYP_695930 
Protein GI110801141 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAA AAGGTAACAA ACTTATTGAA TTTGTTAGTA AGATCATGGC TTTTGTTATG 
GCAGTGACTT TATTATCAAG CTTACCTGTA CAAAATGTAT TAGCTAATGG CAGCAAAGAA
ACAAAGGGAG ATGAGTATGA AATCTATCCA ATTCCTCAAA GTATAAAATA TGATAATAGC
ATAGTAACTT TAGGGACTGA TGCTAATGTT GTATTTGAAG AAGGAATTGA TGAGGCAACA
AAAAACAGAC TATTAGAGGT TTTATCAATT AAGGGAATTA ATCATGAAGA GAGCAATGAA
ATAAAAGAAG ATAAGACAAA CTTTCTAATA GGAATAAATA ACTCAGAAGG AGTTGTAGAC
AAGTACTTTA CAGATAATAA TTTAGTAAAT GATTCTCATT TTGAAAATCA TGATGCCCAT
GTAGTATCAG TTAAGGGAAA TGTAATAGCT GTATTAGGTA AAAATACTGA TTCAGCATTT
TATGGTATCA CATCTTTAAA AGCAATATTT AATCAATTAG AAGGAAATGA ATTAAAGGAA
TTATTAATTG AGGACTATTC AGATGGTCAA TGGCGTGGTT TTATTGAAGG ATATTATGGA
ATTCCTTGGA GCAATGAAAA TCGTAAGGAC TTAATGAAAT TTGGTGGGGA TTTTAAGATG
AATTCATACA TCTTTGCACC AAAGGATGAT CAGTATCACA GTCTTAAGTG GAGAGAGCCA
TATCCTGCTG AGAAGTTAGC TGAAATAAAG GAAATGGTTG ATGTTGGAAT AGCAACAAAG
AATAAATTCA TATGGACAAT ACACCCATTC TTAAAAGATG GTATGAACTT TGGATCAGAA
GAGAGTTATA AAGCTGATTT AGAAAAGATA ATAGCTAAGT TTGAACAATT ATATAGTGTG
GGAGTACGTC AATTTGGTGT GCTTGCAGAT GATGCTGAGG GAGAAGCTAA TAATCAAGTA
AAACTTATGG AAGACTTAGA AAAATGGCGT TTACAAAAAG GTGATGTATA CGAATTTATA
TTTGTTCCTA AGGTTTATAC AAAGGAATCA GCTGGTGGAG ATGTTAATAA TGAATACTTA
AAAACTATTG GTACAATGCC AGAAACTATT GATATCATGT GGACTGGTGA TGTAATACTT
GGTTATGTAA CTCAAGAGAC ATTTGAATTC TTTGAAGAAG CTGTAGGACG TCAAGCCTTT
ATGTGGTTAA ACTGGCCAGT AAATGATATT AACAATAAAC GTTTACTAAT GGGTAAGGGT
GAGATGTTAG ATCCAACAGT TACAAACTTT AAGGGAATTG TAACAAATCC AATGCAAGAA
GCTCAAGCTT CAAAGGTAGC TTTATTTGCT ATAGCTGATT ATGGATGGAA TAGAGCAGAT
TTTGATATGG ACAAGAGTTG GAAAGATTCT TTCAAATATA TAGAGCCAGA TGCAAGTGAA
GAATTATATA CCTTTGCAAA ACACATGAGT GATCCAGCTC CAAACTGGCA TGGATTATCT
TTAGAAGAAT CTGAAGAGTT AAGACCAGTA ATTGAAGAGT TTACAAGAAG ATTATGGGAA
AAAGAATCTG TTTTAGATTA CAGTAAGGTT ATTTTAGATG AATATCAAGA GATTTTAGAT
GCAACTAATA ATTTTGCAAC TAAATCTAAG AATGAATTAT TAAAGAGTGA AATTAAAGGA
TGGGTTGATT CTTTAAGAGA TTTAGCAGAA TCAACTATAG CATATATAAA TTCAGCAGTA
GCTTTTGAAA AAGGTAACTA TGAGGAAGCT ATGAAATACT ATGTTCTAGG AGAAGAAGAG
TACACTGCTT CAAGATCTCA TAGAACTCCT GTAATAAATG GACAATCTAG ACCAGAACCA
GGTACAAGAC ACTTAATACC ATTTATTAAG GATTTATCTA AAATCATAGG AGATAATATT
GACCAAGTTA TAAATCCAGA TACTACTAAG TTAACACTTC GTCCTTATAC AAATATGTCA
CCTCTTTATT GGGGACATGT TCAAAATATA GCTGATGGAG ATAATACTAG ATATTCATGC
ATGTGGATTC GTAATTCAGC TAAAGAAGGA GACTATGTTG CTGTAGAATT AAATGAGGTA
ACTAAGGTTA ATAGCATTAC TTTTGAACAA GGACAAGATG AGGGAGATGC ATTTAACTAT
GGTAAATTCC AATATTCTAT GGATGGTGAA AATTGGACTG ATGTAGATGG TGTAGATTAT
GGACCAAAAA TGCATAAGAT TGTAGTTGAA GGTTTAGATA TAACAGCAAA ATATTTAAGA
TTTATTCCTA CTAAAGAAAT TTTAAATAAC TGGATTGCTG TTAGAGAATT CTCTGTTAAT
AAAAAAGATG AAAATAATAT GAAAGTTAAT GCATATACAA ATGTAGAAGC TTTAGCTAAG
AATGAAGTAA GTATTTCAGA GGAAAAAGCT ACACTTTCAG ATTTAAATGA TGTTACTTTA
TCAAAGGGAG AATATGTTGG AATTAAGTTA AATAAGCTTA GAGAAGTAAC TAACATAGTT
TCAGATTTAA CAAATAAGGA TAATTTAACA TTAGAAACAT CAATTAATGG AGTTGAGTGG
GTAGAAGCAA AAACTTTATC AGAAACTATA AATGCTCGCT ATGTTCGTAT TATAAATAAT
ACAGATAAGG ATGTAACTTT TAATTTAAAT AAATTAGAAG CTGAATACTC TAATAATGAT
GTGAATTTTG ATATTAGACC AACAGCAGAA GCAAAGTTTG AGCCTAAGAA CTTAATTGAT
GGAAAATTAA ATACAGCATT TAAACCACTA GAAAGTGCAC CTAAATCTGG CCAATTAACT
TATAGAATTT CTGACAAAAC AGATATTAAG AAGTTTACAA TAGTACAAAA TCCAAATACA
ATTTCTAATG CAATTGTATC TGTAAGAAAT GAAAATGGAT GGAAAGAGAT AGGAAGTTTA
GGAAAGAGCT TTAATGAGTT TAATACAGAA GATTTTGAAA ATGTCTTTGA AATAAAGGTA
GAGTGGGATG GATTTGCACC TACTATTTAT GAAATAGGAC TTTCAACAAT TAAAGAAGAG
GTTCAAGTAG ATAAGTCTAA ATTAGAAGAA GCTATTAAGG AAGTAGAAAA GCTTAAAGAG
GAAGACTATA CAAAGGACTC TTGGAGTAAC TTAATAGAAA AGTTAAATTT AGCTAAAGAG
GTATTATCTA AGGAAGATGC TACTCAAGAT GAGGTAGATA ATGCTATAAA AGCTTTAAAT
GAAGCCCTTA ATGGATTAGT TAAAAAAGAA GAGACTGAAG GACCAGTAGA TCCAGACAAA
CCAGAAGGCC CAATTGATCC AGACAAGCCA GTTGATCCTG AGAATCCAGA TAATACAGAG
AAACCTGAAA ATCCTGAAGG TACAGATAAA CCAGAAACTC CAGACAAACC AGAGGGTAAT
TTACCAAACA CTGGAGGAGC TTCATCATCA ATATTCTTAC AATTAGGAGT AGTAATGATG
GCTTCAGGAG CTTTTGTATT AAAAAGAAAG AAAAGATAA
 
Protein sequence
MGKKGNKLIE FVSKIMAFVM AVTLLSSLPV QNVLANGSKE TKGDEYEIYP IPQSIKYDNS 
IVTLGTDANV VFEEGIDEAT KNRLLEVLSI KGINHEESNE IKEDKTNFLI GINNSEGVVD
KYFTDNNLVN DSHFENHDAH VVSVKGNVIA VLGKNTDSAF YGITSLKAIF NQLEGNELKE
LLIEDYSDGQ WRGFIEGYYG IPWSNENRKD LMKFGGDFKM NSYIFAPKDD QYHSLKWREP
YPAEKLAEIK EMVDVGIATK NKFIWTIHPF LKDGMNFGSE ESYKADLEKI IAKFEQLYSV
GVRQFGVLAD DAEGEANNQV KLMEDLEKWR LQKGDVYEFI FVPKVYTKES AGGDVNNEYL
KTIGTMPETI DIMWTGDVIL GYVTQETFEF FEEAVGRQAF MWLNWPVNDI NNKRLLMGKG
EMLDPTVTNF KGIVTNPMQE AQASKVALFA IADYGWNRAD FDMDKSWKDS FKYIEPDASE
ELYTFAKHMS DPAPNWHGLS LEESEELRPV IEEFTRRLWE KESVLDYSKV ILDEYQEILD
ATNNFATKSK NELLKSEIKG WVDSLRDLAE STIAYINSAV AFEKGNYEEA MKYYVLGEEE
YTASRSHRTP VINGQSRPEP GTRHLIPFIK DLSKIIGDNI DQVINPDTTK LTLRPYTNMS
PLYWGHVQNI ADGDNTRYSC MWIRNSAKEG DYVAVELNEV TKVNSITFEQ GQDEGDAFNY
GKFQYSMDGE NWTDVDGVDY GPKMHKIVVE GLDITAKYLR FIPTKEILNN WIAVREFSVN
KKDENNMKVN AYTNVEALAK NEVSISEEKA TLSDLNDVTL SKGEYVGIKL NKLREVTNIV
SDLTNKDNLT LETSINGVEW VEAKTLSETI NARYVRIINN TDKDVTFNLN KLEAEYSNND
VNFDIRPTAE AKFEPKNLID GKLNTAFKPL ESAPKSGQLT YRISDKTDIK KFTIVQNPNT
ISNAIVSVRN ENGWKEIGSL GKSFNEFNTE DFENVFEIKV EWDGFAPTIY EIGLSTIKEE
VQVDKSKLEE AIKEVEKLKE EDYTKDSWSN LIEKLNLAKE VLSKEDATQD EVDNAIKALN
EALNGLVKKE ETEGPVDPDK PEGPIDPDKP VDPENPDNTE KPENPEGTDK PETPDKPEGN
LPNTGGASSS IFLQLGVVMM ASGAFVLKRK KR