Gene CHU_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2041 
Symbol 
ID4186702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2374220 
End bp2378977 
Gene Length4758 bp 
Protein Length1585 aa 
Translation table11 
GC content45% 
IMG OID638072041 
Productpolyfunctional acetylxylan esterase/b-xylosidase/a-L-arabinofuranosidase, CBM9 module, glycoside hydrolase family 43 protein and carbohydrate esterase family 6 protein 
Protein accessionYP_678646 
Protein GI110638437 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTAA CGATATTTTC CAAAGTAATT TTCTTTTCTT GCGTTTTTTT CTTTTGCATC 
CAATCATATG CTCAAGATCC TAATTTTCAT ATTTACCTGA CTTTCGGGCA ATCTAATATG
GAAGGAAACG GAGTAATAGA AGCGCAGGAT CAAACCGCTG TAAATAGCAG GTTTCAGGTA
ATGGGTGCCG TAAATTGTAC AGGTACAAAA TCTTATACGA CAGGAAAGTG GACAACAGCT
ACAGCACCTA TTGTAAGATG TAACACAGGA CTTGGGCCAC TTGATTATTT TGGGCGCACA
ATGGTGTCAA ACCTGCCGGC AAATATAAAA GTAGGTGTTG TACCTGTTGC GATAGGAGGT
TGTGATATTG CCTTGTTTGA TAAGGTCAAT TATGGATCGT ATGTTGCAAC AGCTCCAAGC
TGGATGATTG GTACAATAAA TCAGTATGGC GGAAATCCGT ATGCACGTCT GGTGGAAGTT
GCAAAACTGG CACAGAAGGA TGGAGTTATT AAGGGAATAT TATTTCATCA GGGAGAAACA
AACAACGGAC AGCAGGATTG GCCGGCTAAA GTTAAGGCTA TTTATGATAA TCTGATTAAA
GATCTTGGCT TAGATCCAGC TAAAACTCCA TTTTTGGCTG GCGAATTAGT AACAACCGCA
CAAGGCGGCG CATGTGGCGG ACATAATTCT ATTATTGCAA AATTACCGAA TGTGATTCCG
AATGCTCATG TCGTTTCGGC CGCTGGTTTA CCGCACAAGG GAGATAACTT ACACTTCACT
CCGGCATCTT ATCGTACGTT TGGAGAACGT TATGCGCAAC TGATGTTGAC ATTGCCTGCT
TATTCGAATG CACAAACGGC GGCCACAAAT CCAATCATCA ATGCAGATGT TCCCGATATA
GCTATCGTTA GAGTAGGGAA TAATTACTAC ATGAGCAGTA CAACCATGCA CATGAATCCC
GGTGTACCCA TAATGAAGTC TACCGATTTA GTTAATTGGG ATATTGTAAA TTACTGCTAC
ACCACATTGT CTACCTCTGA CAGTTACAGC CTTGCGAATG GAAAGAATGA ATATGGTCAT
GGTTCATGGG CAAGTAGCAT TCGATATTTT AAAGGAATGT ATTATGTTAC GACATTTGCT
AATACAGGCA GAACGTATAT CTATAAAACA GCTAATATTG AAACCGGTCC GTGGACTGTT
TCTACGTTAA ATGCTTCCTA CCACGATTGC AATCTTTTCT TTGATGACGA TGATAGAGTG
TATCTTATAT ATGGACAGGG AGATATTAAA ATTATTGAAT TAACTGCAGA TGCTTCAGCG
ATTAAATCCG GAGGTACGAA TAAAACATTA ATTACAAATG CTGGTGCTGT AGCTGGCCCT
ATTGGCTTGA ATGCAGAAGG GTCACAGGTA TTGAAGCACA ATGGGTATTA CTATATTAAT
AATATTTGCT GGCCCAGCGG CGGTATGCGT ACGCAGATTA TTCATCGTTC TTCAACACTA
ACAGGTACGT ATGAAAGTAA AGTGATCTTA AAAGATCAGG GAGTTGCACA AGGTTCTTTT
ATAGAAACGC CAGCTGGTAA ATGGTATGCC TATTTATTTA AAGATGGAGG AGCCAGAGGC
AGGGTTCCAT ATCTTGTTCC GATGACATGG ACAAATGACT GGCCTGTTCT GAGTGCAGTG
CCAGCTACAC TTGATATTCC TCAGGGAACA GGAGGTATGC ATAATATTGT TTCCTCCGAT
GAATTTTCAC AAGCGGCTCC TCTTAAACTT GCGTGGCAAT GGAACCACAA TCCACAAAAT
AACTATTGGT CATTAACACA AAAAAGCGGC TATCTGCGGC TCACCAATGA GCGTACAGAT
CCGAACGTGC TGATGACAAC GAATACACTA ACGCAGCGAA CATTTGGACC CCAGTGTTCG
GGCTATACGG TAATAGATGT TTCCGGCATG AAGGATGGTG ATTATGCTGG TTTAGTAGCT
TTACAAAAAC AGTATGGTTA CGTAGGTGTT AAAATGACTG GTACAACAAA GTCTATTGTA
ATGGTCAATG GAAATGATGT TACGGGAACA CCTGCTCAAG TTGCAAGTGT TCCTTTAAAT
CAAAACATCA TATATCTTCG GATTGATATG GATTACAGGA ATCAGACAGA CAAAGCTTAT
TTCTATTATA GTCTGAATGG TACCACCTGG CAATCCATTG GCAGTACACT TCAAATGTCT
TATACCATTC CGCAATTTAT TGGTTATCGG TATGGGTTAT TTACTTATGC GAGTGTTTCA
GCCGGTGGAT ATGCTGATTT TGATTTTTTT AGAATTGGAT CAACGATTAC GGAGGCATCG
ACGGTTATTA CCACTCCTTC GCCGGTAGTA TCGCTTACCG CACCTGTAAA CAATACAGTT
TATACGGAAG GTGATAATAT AACGATCAAT GCCACGGCAA CGATCACAAG CGGGAGCATT
TCCAAAGTAG AATTTTATAA CGGAACAACG TTGTTAGGTA CAGATGCAAG TTCACCATAC
AGCTATACAA TCACAGCTGC AGCAGCAGGT ACCTATCCGG TCACTGCCAA AGCAACGAGT
GCAGCCAATG CAGTAACAAC GAGCACGGCA ATAAACATTC AGGTAGCAAA ACCTATTTAC
CAGACCGGTT CTGCACCCAC AATCGATGGA ACCGTTGACG GCTTGTGGAG CAATTTTCCA
TCCACAGGTA TCACAAAAAA CAATACCGGT ACGATCAGCT CAGGTACAGA TCTGTCGGGT
AACTGGAAAG CGATGTGGGA TGCGTCTAAT CTGTATGTGC TTGTTCAGGT AACCGATGAT
GTGAAGCGCA ACGATGGTGG AACGGATGTG TACAACGACG ATGGCGTTGA AGTATACATT
GATCTGGGCA ATACCAAAGC AACGACATAC GGCACCAACG ACCAGCAGTA CACGTTCCGC
TGGAACGATG TTACAGCGGC CTACGAGATC AACGGACATC CGGTAACAGG AATAACCAAA
GGCATCAGCA ATACAGCAAC CGGTTATATT GTGGAGGTGA GCATCCCGTG GAGCACCATT
GGCGGCACTG CTTCATTAAA TTCATTCCAG GGCTTTGAAG TCATGATCAA TGATGACGAT
GACGGAGGAG CAAGAGAAGG TAAGCTTGCC TGGGTTGCGT CTACAGATGA TACGTGGAGC
AATCCGGCTT TAATGGGAAC AGTTGTATTA AAAGGATTGA ATTGTACGGT ACCGGCAGCA
GCGATAACAG CAAGCACGGC AACCACATTC TGCTCCGGAG GCAGTGTAGT ATTGAATGCA
GGTACAGGCA CCGGATACAG CTATGTATGG AAGAACGGAG CAGCAACAAT AGCAGGAGCG
ACAAATTCAG GTTATACAGC CACCGCATCC GGCAGTTATA CGGTAACAGT AACAAACCCG
GGCGGCTGTT CAGCAACCTC AGCAGGGACT ACGGTGACGG TAAATGCCTT ACCGGTTTTA
ACGCAGTATG CACAGGTAGA TGGCGGAACC TGGAACCAGG TATCAGGCGC AACGGTGTGT
GCTGGCTCTT CGGTTGTACT GGGTCCTCAG CCGACAGTAA ATACAGGCTG GAGCTGGACA
GGTCCGAACG GTTACAGTGC ATCGGCCAGA GAGCTTAGGC TTACATCAGT ACAAACAAAT
CAGGGCGGTG TTTATACGGC AAGTTATACA GATGGAAATA CGTGTAAATC AACTTCTGTA
TTTACGTTAA CGGTAACTGC ACTACCGGCC GCAGCGATTA CGACAAGTAC ACCGACAACA
TTCTGCGCAG GCGGCAGCAC AACACTGACA GCAGGTTCAG GTGCATCCTA CAAATGGATG
AACGGCACGG TCGCAATCAC AGGAGCAACC GCACAGACCT ATACCGCAAC AGCCGCCGGA
AGCTATACGG TTGAAGTAAC GAATGCGGGT AACTGCAAAG CTACTTCAGC AGCAACAGTA
GTAACAGTAA CTGCACTGCC AACTGCTACA ATCACAGCAA CTGGTTCAAC AACGATTCCT
CAGGGCGGAA GTGTAGCATT ACAGGCGAAT GCAGGTTCAG CTTTGACCTA CAAATGGTTC
AACGGCACGG TCGCAATCAC AGGAGCAACC GCACAGACCT ATACCGCAAC GACCGCGGGA
AGCTATACGG TTGAAGTAAC AAATGCGGGT AACTGCAAAG CAACTTCAGC AGCAGCAACG
GTAAGCGTGG TTGCAAATCA GCCATCTGTT ATTACAATTA CTTCACCGGC ACCGAATGCT
GCAGTAACAG GAGCGATCGA CATTTCGGTG AATATCACAG ATGCGGATGG TAGTATAACC
CTTGTAGAGT TTTTAGCAGG CGATGATGTA ATCGGCACAG CAGCAGCAGC GCCGTATACG
TACACATGGG ACACTCCAAC GGCAGGATCT CATACAATTA CGGTTCGGGT AACAGACAGT
AACGGAGGCG TCACAACTTC TGGACCGGTA ACAGTTACAT CGGAATCCAT CACAACAGGC
GTGCAGGTAT TGAATACATT AAATGCAGCT GTATATCCGA ATCCATCAAA CGGCATCGTA
TTTATTGATA CAGATGCAGA CTTATCAGAT GCAAGCTTTA CACTGATAGA TGTGTTGGGT
AAAGAAGGAA CTGTTTTTTC AACAGCAACC GGCAACGGAG CGATGATAGA TGTAAGCAGT
CTGGCGGGTG GCACTTATGT GTTGATTATC AAACAGGATC ATTCAATTCT GAGAAAGAAA
ATTACAGTGA TAAAATAA
 
Protein sequence
MNVTIFSKVI FFSCVFFFCI QSYAQDPNFH IYLTFGQSNM EGNGVIEAQD QTAVNSRFQV 
MGAVNCTGTK SYTTGKWTTA TAPIVRCNTG LGPLDYFGRT MVSNLPANIK VGVVPVAIGG
CDIALFDKVN YGSYVATAPS WMIGTINQYG GNPYARLVEV AKLAQKDGVI KGILFHQGET
NNGQQDWPAK VKAIYDNLIK DLGLDPAKTP FLAGELVTTA QGGACGGHNS IIAKLPNVIP
NAHVVSAAGL PHKGDNLHFT PASYRTFGER YAQLMLTLPA YSNAQTAATN PIINADVPDI
AIVRVGNNYY MSSTTMHMNP GVPIMKSTDL VNWDIVNYCY TTLSTSDSYS LANGKNEYGH
GSWASSIRYF KGMYYVTTFA NTGRTYIYKT ANIETGPWTV STLNASYHDC NLFFDDDDRV
YLIYGQGDIK IIELTADASA IKSGGTNKTL ITNAGAVAGP IGLNAEGSQV LKHNGYYYIN
NICWPSGGMR TQIIHRSSTL TGTYESKVIL KDQGVAQGSF IETPAGKWYA YLFKDGGARG
RVPYLVPMTW TNDWPVLSAV PATLDIPQGT GGMHNIVSSD EFSQAAPLKL AWQWNHNPQN
NYWSLTQKSG YLRLTNERTD PNVLMTTNTL TQRTFGPQCS GYTVIDVSGM KDGDYAGLVA
LQKQYGYVGV KMTGTTKSIV MVNGNDVTGT PAQVASVPLN QNIIYLRIDM DYRNQTDKAY
FYYSLNGTTW QSIGSTLQMS YTIPQFIGYR YGLFTYASVS AGGYADFDFF RIGSTITEAS
TVITTPSPVV SLTAPVNNTV YTEGDNITIN ATATITSGSI SKVEFYNGTT LLGTDASSPY
SYTITAAAAG TYPVTAKATS AANAVTTSTA INIQVAKPIY QTGSAPTIDG TVDGLWSNFP
STGITKNNTG TISSGTDLSG NWKAMWDASN LYVLVQVTDD VKRNDGGTDV YNDDGVEVYI
DLGNTKATTY GTNDQQYTFR WNDVTAAYEI NGHPVTGITK GISNTATGYI VEVSIPWSTI
GGTASLNSFQ GFEVMINDDD DGGAREGKLA WVASTDDTWS NPALMGTVVL KGLNCTVPAA
AITASTATTF CSGGSVVLNA GTGTGYSYVW KNGAATIAGA TNSGYTATAS GSYTVTVTNP
GGCSATSAGT TVTVNALPVL TQYAQVDGGT WNQVSGATVC AGSSVVLGPQ PTVNTGWSWT
GPNGYSASAR ELRLTSVQTN QGGVYTASYT DGNTCKSTSV FTLTVTALPA AAITTSTPTT
FCAGGSTTLT AGSGASYKWM NGTVAITGAT AQTYTATAAG SYTVEVTNAG NCKATSAATV
VTVTALPTAT ITATGSTTIP QGGSVALQAN AGSALTYKWF NGTVAITGAT AQTYTATTAG
SYTVEVTNAG NCKATSAAAT VSVVANQPSV ITITSPAPNA AVTGAIDISV NITDADGSIT
LVEFLAGDDV IGTAAAAPYT YTWDTPTAGS HTITVRVTDS NGGVTTSGPV TVTSESITTG
VQVLNTLNAA VYPNPSNGIV FIDTDADLSD ASFTLIDVLG KEGTVFSTAT GNGAMIDVSS
LAGGTYVLII KQDHSILRKK ITVIK