Gene CPF_1439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1439 
SymbolentD 
ID4201944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1619394 
End bp1622543 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content31% 
IMG OID638082319 
Productmannosyl-glycoprotein endo-beta-N-acetylglucosamidase domain-containing protein 
Protein accessionYP_695884 
Protein GI110801032 
COG category[G] Carbohydrate transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG3103] SH3 domain protein
[COG4193] Beta- N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00896108 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAA ATAGATTAAG CTGTCTTATA GTAGGTGCTG TAATTGGAGC TGGAGCAATA 
GTTTGTACAA CTAATACTAA AGTACATGCG AAACCAGTGA ATGAGGTTAA AAACATTAAT
ACTTCTAAAG GAAATTCATT TGGAGAAATT ATTTCATCAG AAGACCTAGG ATTAAGAAAA
GGTGCAGATT CTTCTCATGA AATAATAACT TCAATACCTA GGGGCGCTAG AGTTAACATA
ATAGACAAGG TATCTGATAA CTGGTATAAA GTTGGTTATA AAGATTTTGT AGGTTATGTA
GAAGCTAAGG ACATAAGAGT ACTAGGAGAT AATTTAAATC AAGATAATGT TGGTTTAATT
TCTGCTAATC AATTAAATGT TAGAACTAGC CCAAATGAAA ATGGTCAAGT GATTGGAACT
TTACATAAAA ATGATAAGGT AAATGTATTA GATAAATCAA TTGATGGTTG GTATAAAATT
GATTTTAATG GTAGAAGAGC ATATGTATCT AGTAAATATG TTAATTTAAT TTCATATAAA
AATAATGAAG TTAAGACAGA AGTAAAAAAA GAGCCAATTG AAGGAACAGG TAAGGTTAAT
ATAAATACAG CTTTAAATGT TAGACAAGCG TCTACTACAA ATAGTAGGAT TATTGGTAGC
TTAAAAGGTG GAGAAAAAGT TAACATAATA AGTGAAAGTA ATGGATTTTA TAAAATAGAG
TTTAATAATT CATATGGCTA TGTTTATTCT AAATACATAT CAAAAGATGG AGACAGTGAA
AAGGTTCAAG TTGTAAAACA AGAAGAAGTT AAAAAAGAAA AAGTTGATGA ATCTAAAAAA
GAAGCTAAAG CTACTCCTAA AGCAGAACCT GTAGTTTTGG CTGTTAGATC TCTTAATAAG
ACAGGAATAG TAAATGTAAG TAGTTCACTA AATGTAAGAA GCAGCGCAAG TACAAGCAGT
AAAGTTATAG GAAGCTTAAG CGGAAACACA AAGGTAACAA TAGTAGGAGA AGAAGGAGCC
TTCTATAAAA TAGAATATAA AGGTTCTCAT GGGTATGTAG CTAAGGAATA TGTTAAAGAT
GTTACAGAAA GCAATAATAG TAACCAAGGT ACACAGACTC CAGAAAAACC AAGTACTCCT
GAAACTACTA AAAAGACAGG AATAGTAAAT GTAAGTAGTT CTCTAAACGT AAGAGAAGGA
GCAAGTACAA GCAGTAAAGT TATAGGAAGC TTAAGCGGAA ACACAAAGGT AACAATAGTA
GGAGAAGAAG GAGCCTTCTA TAAAATAGAA TATAAAGGTT CTCATGGGTA TGTAGCTAAG
GAATATGTTA AAGATGTTAC AGAAAGCAAT AATAGTAACC AAGGTACACA GACTCCAGAA
AAACCAAGTA CTCCTGAAAC TACTAAAAAG ACAGGAATAG TAAATGTAAG TAGTTCTCTA
AACGTAAGAG AAGGAGCAAG TACAAGCAGT AAAGTTATAG GAAGCTTAAG CGGAAACACA
AAGGTAACAA TAGTAGGAGA AGAAGGAGCA TTCTATAAAA TAGAATACAA AGGTTCTCAT
GGATATGTAG CTAAGGAATA CGTTAAAGAT GTTACAGAAA GTAATAATAG TAACCAAGGT
ACACAGACTC CAGAAAAACC AAGTACTCCT GAAAGTACTG AAAAAACAGG AATAGTGAAT
GTAAGTAGTT CACTAAATGT AAGAAGCAGC GCAAGTACAA GCAGTAAAGT CATAGGAAGC
TTAAGCGGAA ACACAAAAGT AACAATAGTA GGAGAAGAAG GAGCATTCTA TAAAATAGAA
TACAAAGGTT CTCATGGATA TGTAGCTAAG GAATATGTTA AAGATGTTAC AGAAAGTAGT
AATAGTAACC AAGGTACACA GACTCCAGAA AAACCAAGTA CTCCTGAAAG TACTGAAAAA
ACAGGAATAG TGAATGTAAG TAGCTCTTTA AACGTAAGAG AAGGAGCAAG TACAAGCAGT
AAGGTTATAG GAAGCTTAAG CGGAAACACA AAGGTAACAA TAGTAGGAGA AGAAGGGGCA
TTCTATAAAA TAGAATATAA AGGTTCTCAT GGATATGTGG CTAAGGAATA TATCAAGGAT
ATTAAAGACG AGGTAGTAAC AGAACCAGAA AAACCAAGTA ACCCTGAAAA TAGTAAGAAA
ACTGGTGTTG TAACTGCATC TAAAGGATTA AATGTAAGGA AAGAAGCTAA TACTTCATCT
CAAATTATTG GAATTTTAAA TAGTGGAGAA AGTGTTGAAA TAATAGGAGA AGAAAATGGT
TTCTATAAGA TAACTTATAA AGGACAAGAA GCTTATGCAT CTAAAAATTA TATAAATATT
TTTGATGGTA ACTCAAATGT TAATCCTGGA TTAGATATAG GAAATGCTTC AAAAACAAAT
TATGGAGTAT CACTTAACGA ATATATAAAA TTACAACAAA GAAATAATCC TTCAAATTAT
TCATACTCAG AATTTGAAAA ATATATAAAT CCAGCTAAAG CTACTAATAA GCTACAGTTC
TTAAGAATAG ATAAATTTAG ATCAGTTAAT GTAAGTGGAT TAAGTAGTAG ATTAAGTAAC
AAAGGAGTTT TAACAGGACA AGGACAAGCT TTTGTAAATG CTGCTAAAGC CTTTAACATA
GATCCTATAT ACTTAGTTGC TCAATGTTTA CACGAAACAG GTAATGGAAC AAGTAAACTT
GCAAAGGGTG TAACAATTAC TGAAATTGCA GATGAAAGTA AACCTATATA TAATGGTAAT
GGTCAATTAG TAGGATATCA TATGATTAAA TTATCTAAGC CAGTAACAGT TTATAATTTA
TTTGGAATAG GGGCTAAGGA TAATTCATCA GTTTTTCCAA ATAGAGCTTT AATATTAGGA
ACAACATATG CTTATAATAG AGGTTGGACA AGTATTGAAA ATGCTATAAA GGGTGCTGCA
GAATTTGTTT CATTAAATTA TGTTCATAGT TCAAGATATA GTCAAAATAC TCTTTATAAG
ATGAGATATA ATCAAAATGT ATCAAATATA TGGCATCAAT ATGCTACAAC ACCATGGTAT
GCATCAAGTA TTGCTGATAT TATGAGGAGT TATCAAGATT TATATTTAGA AAATAATTTC
ACATTTGATG TACCTGTTTT TGCAGGATAA
 
Protein sequence
MNRNRLSCLI VGAVIGAGAI VCTTNTKVHA KPVNEVKNIN TSKGNSFGEI ISSEDLGLRK 
GADSSHEIIT SIPRGARVNI IDKVSDNWYK VGYKDFVGYV EAKDIRVLGD NLNQDNVGLI
SANQLNVRTS PNENGQVIGT LHKNDKVNVL DKSIDGWYKI DFNGRRAYVS SKYVNLISYK
NNEVKTEVKK EPIEGTGKVN INTALNVRQA STTNSRIIGS LKGGEKVNII SESNGFYKIE
FNNSYGYVYS KYISKDGDSE KVQVVKQEEV KKEKVDESKK EAKATPKAEP VVLAVRSLNK
TGIVNVSSSL NVRSSASTSS KVIGSLSGNT KVTIVGEEGA FYKIEYKGSH GYVAKEYVKD
VTESNNSNQG TQTPEKPSTP ETTKKTGIVN VSSSLNVREG ASTSSKVIGS LSGNTKVTIV
GEEGAFYKIE YKGSHGYVAK EYVKDVTESN NSNQGTQTPE KPSTPETTKK TGIVNVSSSL
NVREGASTSS KVIGSLSGNT KVTIVGEEGA FYKIEYKGSH GYVAKEYVKD VTESNNSNQG
TQTPEKPSTP ESTEKTGIVN VSSSLNVRSS ASTSSKVIGS LSGNTKVTIV GEEGAFYKIE
YKGSHGYVAK EYVKDVTESS NSNQGTQTPE KPSTPESTEK TGIVNVSSSL NVREGASTSS
KVIGSLSGNT KVTIVGEEGA FYKIEYKGSH GYVAKEYIKD IKDEVVTEPE KPSNPENSKK
TGVVTASKGL NVRKEANTSS QIIGILNSGE SVEIIGEENG FYKITYKGQE AYASKNYINI
FDGNSNVNPG LDIGNASKTN YGVSLNEYIK LQQRNNPSNY SYSEFEKYIN PAKATNKLQF
LRIDKFRSVN VSGLSSRLSN KGVLTGQGQA FVNAAKAFNI DPIYLVAQCL HETGNGTSKL
AKGVTITEIA DESKPIYNGN GQLVGYHMIK LSKPVTVYNL FGIGAKDNSS VFPNRALILG
TTYAYNRGWT SIENAIKGAA EFVSLNYVHS SRYSQNTLYK MRYNQNVSNI WHQYATTPWY
ASSIADIMRS YQDLYLENNF TFDVPVFAG