Gene CPF_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1103 
Symbol 
ID4203528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1259269 
End bp1261191 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content31% 
IMG OID638081984 
Productglycosy hydrolase family protein 
Protein accessionYP_695549 
Protein GI110799785 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00087575 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTA AATTAGTTGG ATTAAATGAG GAAATGTTAG AAGCTGTCAA TGGATTAGAA 
GAACTAATTG ATTTTCAGCT TTATAATGAG AAGGGGAGTA ATAATGAAAA TTTAGAAATT
ATAAATGTTT CTAAGCTTCC TGAAGAATCA GAAAATGTAA TAGAAGTCCT TAGAGGAAAA
AATAATGAAA TAAGATATAA GAAAAAACAT CACTTTTTTA GGGCATTTAG TTTATATCTT
CAATTTCTTA AAAAGGGAGA GGAGAGTTTT AGAAAAAGTG AAAAAAGTTA TATTGATTCA
GTAGGAGCAA TGATAGATGC CTCAAGAAAT GCTGTCTATA GAGTTTCTGA GGTTAAAAAG
ATATTAGGAT ATATGGCATT AATGGGACAT AATAGGTGTA TGCTTTATAC TGAAGATACT
TATGAGATTG AGGGTTATCC ATACTTTGGT TATATGAGAG GAAGATATAC CAAAGAGGAA
TTAAGGGAAA TTGATGATTA TGGATACTCT TTAGGTATAG AGGTTGTACC TTGTATTCAA
ACTTTAGCTC ACTTAAAACA AACTTTAAGA TGGCCTTATG GGGAAGGGGT GAAAGACACT
CAAGATGTAC TTTTAGTAGG AGAGGAAAAG ACTTATAAGT TTATAGAGGC AATGATTTCT
TCTTTAAGAG AGTGCTTTAG AAGTAAAAAT ATTCATATAG GTATGGATGA GGCTTTTGAT
TTAGGAAGAG GCCATTATTT AACTAAACAT GGACATGTTC CTCACCAAGA GCTCATGGTA
GAGCATTTAA ATAAGGTTAA TGAAATAGCA AAAAAATATG ATTTTAAGCC AATGATTTGG
GATGATATGT TTTTAAGATG TGGAGCTCCA GATGGTGGAT ATTACGATTT AGATATAGTT
ATAACTCCTG AAATAGCAAA TAATATTCCA GAGGAAGTTT CCTTAGTTTA TTGGGATTAT
TATAATTCTG ATGAAGAAAA ATATAAGAAG CTTTTAGATA TTAGAGATGA TTTTAATAAT
AATATTATCT TTGCAGGAGG ATGCTGGAGG TGGAGTGGAT TTGCTCCAAA CTATTCAAAG
ACCTTTGAAA CAACAAATGC TGCCTTAAAT CAATGTAAGG CTAAAGGAAT AAAAGAAGTC
TTTGCAACGG CTTGGGGAGA TGATGGATCA GAAACGCCAA TATATTCAAT AATAGTAGGT
CTTATATTAT TTGGAGAGCA TGGCTATTAT AACAAGGTAG AGAAAGAATG GATAGATGAA
AGATGCAAAT CTTTAACAGG GCTTAGCATG GAGGACTTTA CTTCTTTAGA AGAATTAGAT
CTTGTTCCAT CAGTTAAAAT TCCTAATATG GAGGTTTGTA ATCCTTCAAA ATATATAGCT
TACCAAGATT TGCTTTTAGG AGCCTTTGAT AAGCATTTAG AGGGGCTAGA CTTAGAAGAA
CATTATATTA ATCTTTCTAA AAAATATGAG GAAATAGGAG AGAAAAGTGA GAGGTTTAAA
TTAATGTTTA CTATGTATTC TAAACTTGCT GCTTACCTTT CAGTTAAAAG TGAAATAGGA
TTAGAAATAA GAAAAGCTTA TCTAGAAAAG GATAAGGATG CATTAAGGCT TATAGCATAT
AACTTTATTC CAGAAATACA AGAAAAGCTA AAGAGTTTTC ATAAGAGTTT TAGGGATCTA
TGGTATAAAG AATGTAAAGG ACAAGGCTTT GAAGTTATGG ATATTAGACT TGGTGGAGTT
ATGGCAAGAT GCGATTCTGC TATGTATAGA ATAAAAGCTT ATCTTAAAGG AAATATTGAT
AAAATAGAAG AGCTAGAAGA GGAAAGACTT TATTTTTCTG AACATTTTGG GGGAGATGAT
TGTAAGCTTA TATGTTGCAA TGAATATCAA AAAATAGCAA CACAAAATAT TTTAAGTTGG
TAA
 
Protein sequence
MRVKLVGLNE EMLEAVNGLE ELIDFQLYNE KGSNNENLEI INVSKLPEES ENVIEVLRGK 
NNEIRYKKKH HFFRAFSLYL QFLKKGEESF RKSEKSYIDS VGAMIDASRN AVYRVSEVKK
ILGYMALMGH NRCMLYTEDT YEIEGYPYFG YMRGRYTKEE LREIDDYGYS LGIEVVPCIQ
TLAHLKQTLR WPYGEGVKDT QDVLLVGEEK TYKFIEAMIS SLRECFRSKN IHIGMDEAFD
LGRGHYLTKH GHVPHQELMV EHLNKVNEIA KKYDFKPMIW DDMFLRCGAP DGGYYDLDIV
ITPEIANNIP EEVSLVYWDY YNSDEEKYKK LLDIRDDFNN NIIFAGGCWR WSGFAPNYSK
TFETTNAALN QCKAKGIKEV FATAWGDDGS ETPIYSIIVG LILFGEHGYY NKVEKEWIDE
RCKSLTGLSM EDFTSLEELD LVPSVKIPNM EVCNPSKYIA YQDLLLGAFD KHLEGLDLEE
HYINLSKKYE EIGEKSERFK LMFTMYSKLA AYLSVKSEIG LEIRKAYLEK DKDALRLIAY
NFIPEIQEKL KSFHKSFRDL WYKECKGQGF EVMDIRLGGV MARCDSAMYR IKAYLKGNID
KIEELEEERL YFSEHFGGDD CKLICCNEYQ KIATQNILSW