Gene CPF_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1117 
Symbol 
ID4202661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1274949 
End bp1277291 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content30% 
IMG OID638081998 
Productglycosy hydrolase family protein 
Protein accessionYP_695563 
Protein GI110799795 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGATT ATAGAGAGCC TATTTATAAA ATAGAGCCTT GGAATATAAC AGAAGAAGAA 
TTTCTTTTAA AAAATAATTA TAGAAATGAA ACAACATTTT CATTGGCTAA TGGATATATA
GGTACTAGAG GCACCTTTGA AGAGGAATAT GATTTTGATG TTGAAACAGG ATTAGAAGGA
AACTTTGTAA ATGGCTTTTA TGAAAGTGAG CATATAAGAT ATGGGGAATG GAATTTTGGT
TTCCCTACAG AAAGTCAATC ACTTTTAAAT CTTCCAAATG CTAAAATAAT AAAGCTATTT
ATAGAAGATG AAGAATTCAG TATGCTTACA GGTGAAATAG AAGATTATAA GAGAGTTTTG
CATATGAAAG AGGGAAGGAT AACTCGAGAT CTTATATGGG TATCACCTAA GGGTAAAAAA
ATAAAAATTA GCATAAGTAG ATTTGTTAGT TTTAATAATA AAAATTTAAT GGAAATACGT
TATAAAGTTA CTCCTTTAAA CTTTAGTGGA AATTTAAAAT TTATCTCTGC TATTGATGCA
AATGTAGAAA ATCATACAAG AAAAACTAAT CCTTTAGTTG ATTATGGTCC TTTTGGTAAA
AGGCTTGCAA ATGATTATAT AGATTCCATA AAAGATGAAC TTTATTATGA AGGAACAACA
TTAAATAGTG AACTTTCAAT AGCTTGTGGA GCAGTAAATA AAATATCAGC AGAAAACTTT
ATAAGAAAGA ATTTTAAAAA TTATGAGCTT TTTGGGGTAT CTTATGAATT TTATGCTAAG
GAAAATAAAG AAATTATATT AGATAAGTTT ATTGCATATA GCACATCTTT AGATATGAAT
TGTAAAAAGT TACATGGCTT TATAAAAACT ATTTTAAGTG ATGCCAAGAA AAAAGGATAT
ATAGAAGCTG AGAGAGAGCA AAAGGAATAT GTTGAAGAAT TTTGGAGAAC AGCTGATGTA
ATTATTGAGG GAGATGATGC ACTTCAACAG GGAATAAGAT TTAATCTTTT TCACCTTATG
CAATCAGCTG GGAGAGATGG TAAAACCGGA ATGGGAGCTA AAGGGTTAAG TGGAGAAGGA
TATGAGGGAC ATTATTTTTG GGATACAGAA ATGTATGTAC TTCCTGTATT TGTATATACT
AAGCCAGATT TAGCTAAGAA ATTACTAGAT TATAGGTATT TTACTTTAGA TAAAGCTAGA
GAGAGAGCAA GGGTATTAGG ACATGATAAA GGAGCCTTAT ATCCATGGAG AACAATTAAT
GGAGAGGAAG CATCAACATA TTTTCCTTTA GGAACGGCTC AATATCATAT AAATGCAGAC
ATAGCATATG CCTTTAAGCT TTACGTAGAT GTTAATGATG ATTTTCACTA TTTAAAGGAT
AAAGCAGCTG AGGTTTTATG TGAAACTGCA AGGGTTTGGG CTGATGTAGG ATCATTTTCT
GAGTATGTAG GAAATAAATA TTGTATTTGT GCTGTAACTG GGCCAGATGA GTACAATGCT
ATAGTTGATA ATAATTTTTA CACTAATCTT ATGGCCCGTG AAAACCTTAG AGATGCAATA
TGGGCATTAA ATAAGATTAA AGAAAAAGAT CAATTAGCTT ATGATAATTT GGTTAAAAAG
ATTGATTTAA AAGATGAAGA AATAGAATAT TGGAAAAAAA TAATTGAAAA TATGTATTTC
CCGTATGATG AGAAAAGAGG GGTGTATCCT TTAGATGATG GCTTTATGAA GAGGAAGCAT
TGGGATGATT CAAAAATACC TAAAGAAAAG AGACATCTTC TTTATGAAAA TTATCATCCT
CTATTTATAT TTAGACAAAG GATGTCAAAG CAAGCTGATG CAATTTTAGC AATGTACCTT
CATAGTAATC TTTTTAGTAT AGAGGAATTA AGAAAAAACT ATGATTTCTA TCAAGAAGTA
ACATTGCATC ATTCCTCTTT ATCAACTTGT ATATTTGGTA TTTTAGCTAG TCAAATTGGA
TATGATGAGG AGGCTTACAA GTATTTTTCA CAGTCAGCAA GAATGGACTT AGATGATTAT
CACAATAACT TTTATGCTGG AATTCATGCT GCAAATATGG CAGGAACCTG GCAAGGCATT
GTAAATGGTT TTGCAGGACT TAGAACTAAT AAAGGAATTT TAGAATTAAA TCCTACAATA
CCTAAGGAGT GGAATGCATA TAGTTTTAAA ATTTTCTATA AGAAGAATCT ATTAGAAATA
AAAATTTCTA AAGATGAAAT TGAAATAAGA CTATTAGAGG GCGAGAATTT AGAATTATAT
GTATATGGAG AAAAAGTTTA TCTTAAAAAT TTAAGTGAAA TAATAAAAAT ACCAGCTAAA
TAA
 
Protein sequence
MEDYREPIYK IEPWNITEEE FLLKNNYRNE TTFSLANGYI GTRGTFEEEY DFDVETGLEG 
NFVNGFYESE HIRYGEWNFG FPTESQSLLN LPNAKIIKLF IEDEEFSMLT GEIEDYKRVL
HMKEGRITRD LIWVSPKGKK IKISISRFVS FNNKNLMEIR YKVTPLNFSG NLKFISAIDA
NVENHTRKTN PLVDYGPFGK RLANDYIDSI KDELYYEGTT LNSELSIACG AVNKISAENF
IRKNFKNYEL FGVSYEFYAK ENKEIILDKF IAYSTSLDMN CKKLHGFIKT ILSDAKKKGY
IEAEREQKEY VEEFWRTADV IIEGDDALQQ GIRFNLFHLM QSAGRDGKTG MGAKGLSGEG
YEGHYFWDTE MYVLPVFVYT KPDLAKKLLD YRYFTLDKAR ERARVLGHDK GALYPWRTIN
GEEASTYFPL GTAQYHINAD IAYAFKLYVD VNDDFHYLKD KAAEVLCETA RVWADVGSFS
EYVGNKYCIC AVTGPDEYNA IVDNNFYTNL MARENLRDAI WALNKIKEKD QLAYDNLVKK
IDLKDEEIEY WKKIIENMYF PYDEKRGVYP LDDGFMKRKH WDDSKIPKEK RHLLYENYHP
LFIFRQRMSK QADAILAMYL HSNLFSIEEL RKNYDFYQEV TLHHSSLSTC IFGILASQIG
YDEEAYKYFS QSARMDLDDY HNNFYAGIHA ANMAGTWQGI VNGFAGLRTN KGILELNPTI
PKEWNAYSFK IFYKKNLLEI KISKDEIEIR LLEGENLELY VYGEKVYLKN LSEIIKIPAK