Gene CHU_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2044 
Symbol 
ID4186705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2386914 
End bp2391338 
Gene Length4425 bp 
Protein Length1474 aa 
Translation table11 
GC content45% 
IMG OID638072044 
Productbeta-xylosidase/alpha-L-arabinofuranosidase- like protein 
Protein accessionYP_678649 
Protein GI110638440 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAATT TTCTACGAAG CTCAATTAGT CAGCAGAATT TTAAATCACT TATTGATGTA 
TTCATAAAAG CCAGAGTGCT CCTTTGTCTT TCAATACTTC TATTTTATGG AAACAATGTG
ATTGCTCAAC TTTCAAATCA ACCAGGTACG AATGGCATCC AAACCTATGT GAATCCCATT
CTTCCCGGTG ATCATCCGGA TCAGACATTG ATGCGCGTGG GTGCTAATTT TTATTCTACC
GGTTCTAATT TTCATTTTAC ACCGTATCTG CCGATTCTTC ATTCAACAGA TTTGATGCAC
TGGGAAGTAA TTGCACGTGT AATTCCGCCA ACATCCAGCA TTCCTAATAA TGATACACCT
TCCGGAGGTA CCTGGCAAGG GGCCTTAGCT CAATTTAATA ATAAGTTTTG GGTATATTTT
TCGAATAACG CAGGTGGCGG ACAATATTTC TGTAACGCAA CAAACATGGC TGGTCCATGG
TCTGCACCGG TAAAAGTAAA TACAAATACA GGCGTGTATG GCTATGACAA CTCTATTTTT
GTGGATGATG ACGGAACTCC TTATATGCTG TTGAAAAACG GACAGGATCT GAATGGTCTT
CAAAAATTAG GTATGGATGG ACAACCATCC GGACAAGCTT TGAACATGAA CTGGATCAAT
GCAAATGGCC ATCCATACAG CTGGGCAGAA GGCCCGGTGA TGTGTAAACG CAATGGACGC
TATTATATGT TTGTAGCTGG TAACGTAACA GGCGGACAAT ATGTATTGAG TTCTGCAAAA
CTTTCGAATG TAGAATCCGA TTGGACGCGA CATGGTAATT TCTGGCAAAC TTCAACAGGT
TCCGGTGGTT TTACAGGTCC CAACCATGTT ACACAACCCA TAAAATTAGA CGACGGAACA
TGGTGGTGCC TTTCGCATGC CTACGATAAC GGAGGCTGGG AAGGACAAGG CAGACAAAGT
CATCTGCACC AGGTTATATG GGATGGAAAT GGCGTGCCTA AAGGCGTTCC GGTAAGTGTT
AATCCGGTAC AGGGGCCAAA TCTGCCAAGC AGTAATATTG ATTATGAATT CATCCGGTCT
GATTATTTTG AATCTACAAC ATTAAGTTTG AATTGGCATT TTTTAAGTAA GGTCTATGCC
AATGCATCTA AATATAGTTT AAGCGAACGC CCGGGCTATA TGCGTCTAAA ACCGGGAAGC
GGCTACACCC ATCTATTACA AAAAGATAAA GGGAAATATT ATTCATTAAC TACAAAGGTA
GATCTGAATG CTACTGCCAA TGGGCAACAA GCAGGTATCC GAATGACCAG CGGTAATGAC
GATGTCACCT TTACATTATA TACAGGCTAT AACAACGGTA AAAAAATTGG CATGGCGTTT
CAAGGCGTAA CAACTGAAGT TAACAATGCC ATTGGTACTA CGGTATGGCT GCGCGTAGAG
CGTGCCCTGC ATATACTGAC ATGTTACTAC AGCGCAGATG GTTTAGTGTG GACACAGCTA
GGAACAAAGG ATGTATCTAA TATGGATAAA TCACAAACCG ATTATAATAA ATGGGTGGGT
ACTTCTGTAG GTCTTTATGC ATCTGCTGCA TCTGCTGATT TCGATCAGTT TTCATACCGG
TATGGTTTTG TACCGGTTCG TGTAGAAGGA AGAAACAACT GGTATGGTGT AACATTTGCA
ACCAGAACAC CTGGTCGCGT AGTAACAAAT AGTACATCCG GAGATTGGCT TATGCTGGCT
GGCGTTGACT TAGGTTCGGG CAGCAGAGTT ACTACAGGTA TTGAAATAAA TGCATCTTCA
GCAAATAGCA ATGGAAGCCT TGAAGTATGG CTCGATAACA TTGGTGGTAC AGGAACAAAA
GTATCCACAA TAACGATTCC GAATACAGGA GGTAATGATG TATGGACGAA TGTAACGGGA
AGTTTTAATT CTTCGGGTCA ACATGATGTG TATTTAAGAT GGGTTGGCGG TGCAAATAGT
TTCAGTGTGA ATACCATCCG TTTTTTATCC GGAGCAGCGG TGCCCGTTCC GGTAGTAACG
CTTACAGCAC CTGTAAATAA TACAGTGTAT ACGGAAGGTG ATAATATAAC GATCAATGCC
ACGGCAACGA TCACAAGCGG GAGCATTTCC AAAGTAGAAT TTTATAACGG AACAACGTTG
TTGGGTACAG ATGCAAGTTC ACCATACAGC TATACAATCA CAGCAGCAGC AGCAGGAACA
TATCCGGTCA CTGCCAAAGC AACGAGTGCA GCCAATGCAG TAACAACGAG CACGGCAATA
AACATTCAGG TAGCAAAACC TATTTACCAG ACCGGTTCTG CACCCACAAT CGATGGAACC
GTTGACGGCT TGTGGAGCAA TTTTCCATCC ACAGGTATCA CAAAAAACAA TACCGGTACG
ATCAGCTCAG GTACAGATCT GTCGGGTAAC TGGAAAGCGA TGTGGGATGC GTCTAATCTG
TATGTGCTTG TTCAGGTAAC CGATGATGTG AAGCGCAACG ATGGTGGAAC GGATGTGTAC
AACGACGATG GCGTTGAAGT ATACATTGAT CTGGGCAATA CCAAAGCAAC GACATACGGC
ACCAACGACC AGCAGTACAC GTTCCGCTGG AACGATGTTA CAGCGGCCTA CGAGATCAAC
GGACATCCGG TAACAGGAAT AACCAAAGGC ATCAGCAATA CAGCAACCGG TTATATTGTG
GAGGTGAGCA TCCCTTGGTC TACCATTGGC GGCACGGCTT CATTAAATTC ATTCCAGGGC
TTTGAAGTCA TGATCAATGA TGACGATGAC GGAGGAGCAA GAGAAGGTAA GCTTGCCTGG
GTTGCGTCTA CAGATGATAC GTGGAGCAAT CCGGCTTTAA TGGGAACAGT TGTATTAAAA
GGATTGAATT GTACGGTACC GGCAGCAGCG ATAACGGCAA GCACGGCAAC CACATTCTGC
TCCGGAGGCA GTGTAGTATT GAATGCAGGT ACAGGCACCG GATACAGCTA TGTATGGAAG
AACGGTACAG CAACAATAGC AGGAGCGACA AATTCAGGTT ATACAGCCAC CGCATCGGGC
AGTTATACGG TAACAGTAAC AAACCCGGGC GGCTGTTCAG CAACCTCAGC AGGGACTACG
GTGACGGTAA ATGCCTTACC GGTTTTAACG CAGTATGCAC AGGTAGATGG CGGAACCTGG
AACCAGGTAT CAGGCGCAAC GGTGTGTGCT GGCTCTTCGG TTGTACTGGG GCCTCAGCCG
ACAGTAAATA CAGGCTGGAG CTGGACAGGT CCGAACGGTT ACAGTGCATC GACCAGAGAG
ATTACGTTAA CTGGAGTTAC ACCAACACAA GGAGGTATTT ATACGGCAAG TTATACAGAT
GGAAATACGT GTAAATCAAC TTCTGTATTT ACGTTAACGG TAACTGCACT GCCGGCCGCA
GCGATTACGA CAAGTACACC GACAACATTC TGCGCAGGCG GCAGCACAAC ACTGACAGCA
AGTTCAGGTG CATCCTACAA ATGGCTGAAC GGCACGGTCG CAATCACAGG AGCAACTGCA
CAGACCTATA CCGCAACAGC CGCGGGAAGC TATACCGTTG AGGTAACGAA TGCGGGTAAC
TGCAAAGCTA CTTCAGCAGC AGCAGTAGTA ACAGTAACTG CACTGCCAAC TGCTACAATC
ACAGCAACTG GTTCAACAAC GATTCCTCAG GGCGGAAGTG TAGTATTACA GGCGAATGCA
GGTTCAGCTT TGACCTACAA ATGGTTCAAC GGCACGGTCG CAATCACAGG AGCAACAGCA
CAGACCTATA CCGCAACAGC GGCGGGAAGC TATACGGTTG AAGTAACAAA TGCGGGTAAC
TGCAAAGCAA CTTCAGCAGC AGCAACGGTA AGCGTGGTTG CAAATCAGCC ATCGGTTATT
ACAATTACTT CACCGGCACC GAATGCTGCA GTAACAGGAG CGATTGATAT TTCGGTGAAT
ATCACAGATG CGGATGGGAA TATAACCCTT GTAGAGTTTT TAGCAGGCGA TGATGTAATC
GGCACAGCAG CAGCAGCGCC GTATACGTAC ACATGGGACA CTCCAACGGC AGGATCTCAT
ACGATTACGG TTCGAGTAAC AGACAGTAAC GGAGGCGTCA CAACTTCGGC ACCGGTAACA
GTTACATCGG AATCCATCAC AACAGGCGTG CAGGCATTGA ATACATTGAA TGCAGCTGTA
TATCCGAATC CATCAAACGG CATCGTATTT ATTGATACAG ATGCAGACTT ATCAGATGCA
AGCTTTACAC TGATAGATGT GTTGGGTAAA GAAGGAACTG TTTCTTCAAC AGCAACCGGA
AACGGAGCGA TGATAGATGT GAGCAGTCTG GCGGGTGGCA CTTATGTGCT GATTATCAAA
CAGGATCATT CAATTCTGAG AAAGAAAATT ACAGTAATCA AGTAG
 
Protein sequence
MGNFLRSSIS QQNFKSLIDV FIKARVLLCL SILLFYGNNV IAQLSNQPGT NGIQTYVNPI 
LPGDHPDQTL MRVGANFYST GSNFHFTPYL PILHSTDLMH WEVIARVIPP TSSIPNNDTP
SGGTWQGALA QFNNKFWVYF SNNAGGGQYF CNATNMAGPW SAPVKVNTNT GVYGYDNSIF
VDDDGTPYML LKNGQDLNGL QKLGMDGQPS GQALNMNWIN ANGHPYSWAE GPVMCKRNGR
YYMFVAGNVT GGQYVLSSAK LSNVESDWTR HGNFWQTSTG SGGFTGPNHV TQPIKLDDGT
WWCLSHAYDN GGWEGQGRQS HLHQVIWDGN GVPKGVPVSV NPVQGPNLPS SNIDYEFIRS
DYFESTTLSL NWHFLSKVYA NASKYSLSER PGYMRLKPGS GYTHLLQKDK GKYYSLTTKV
DLNATANGQQ AGIRMTSGND DVTFTLYTGY NNGKKIGMAF QGVTTEVNNA IGTTVWLRVE
RALHILTCYY SADGLVWTQL GTKDVSNMDK SQTDYNKWVG TSVGLYASAA SADFDQFSYR
YGFVPVRVEG RNNWYGVTFA TRTPGRVVTN STSGDWLMLA GVDLGSGSRV TTGIEINASS
ANSNGSLEVW LDNIGGTGTK VSTITIPNTG GNDVWTNVTG SFNSSGQHDV YLRWVGGANS
FSVNTIRFLS GAAVPVPVVT LTAPVNNTVY TEGDNITINA TATITSGSIS KVEFYNGTTL
LGTDASSPYS YTITAAAAGT YPVTAKATSA ANAVTTSTAI NIQVAKPIYQ TGSAPTIDGT
VDGLWSNFPS TGITKNNTGT ISSGTDLSGN WKAMWDASNL YVLVQVTDDV KRNDGGTDVY
NDDGVEVYID LGNTKATTYG TNDQQYTFRW NDVTAAYEIN GHPVTGITKG ISNTATGYIV
EVSIPWSTIG GTASLNSFQG FEVMINDDDD GGAREGKLAW VASTDDTWSN PALMGTVVLK
GLNCTVPAAA ITASTATTFC SGGSVVLNAG TGTGYSYVWK NGTATIAGAT NSGYTATASG
SYTVTVTNPG GCSATSAGTT VTVNALPVLT QYAQVDGGTW NQVSGATVCA GSSVVLGPQP
TVNTGWSWTG PNGYSASTRE ITLTGVTPTQ GGIYTASYTD GNTCKSTSVF TLTVTALPAA
AITTSTPTTF CAGGSTTLTA SSGASYKWLN GTVAITGATA QTYTATAAGS YTVEVTNAGN
CKATSAAAVV TVTALPTATI TATGSTTIPQ GGSVVLQANA GSALTYKWFN GTVAITGATA
QTYTATAAGS YTVEVTNAGN CKATSAAATV SVVANQPSVI TITSPAPNAA VTGAIDISVN
ITDADGNITL VEFLAGDDVI GTAAAAPYTY TWDTPTAGSH TITVRVTDSN GGVTTSAPVT
VTSESITTGV QALNTLNAAV YPNPSNGIVF IDTDADLSDA SFTLIDVLGK EGTVSSTATG
NGAMIDVSSL AGGTYVLIIK QDHSILRKKI TVIK