Gene CHU_2852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2852 
Symbol 
ID4183709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3254720 
End bp3258178 
Gene Length3459 bp 
Protein Length1152 aa 
Translation table11 
GC content46% 
IMG OID638072843 
Productbeta-glycosidase-like protein 
Protein accessionYP_679442 
Protein GI110639233 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.359276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0739949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATAACA TACGCAGCCC CAGCACTGTT TTCAATAAAT GGAATATGAT CCTGAACCCA 
ACTAACCGAA GCTCATTTAT GAAAAAAAGA TTACTCTCCG CTTTTATTTT TGTTTCGCTG
GTGTTTACAA CTGTCTGTCA TCAAACAGTT GCTCAGACAC AACCATTCCC GGCAAACAAA
ACCTACTCGT ATGGTACCAT GCCTGGCAAT AAAAATAGCC AGGATGCTAT CAATAACTAC
AATACCTGGA AAACGAATTA TGTTGAAGCC TGTTCCAACG GAAGGTTCAG AGTAAAATTT
GACAACAGAA GCCAGACGGT ATCAGAAGGT ATCGCTTACG GGATGTTATT AGCTGCTTAT
TCAGGCGACA GAACATTATT TGACGGTCTT TGGAATTATT ATAAAGATAA CCGTAACGGA
AACGGTGTAA TGAACTGGAA AATCAATGGC TGTTCGGGTA CTGCAGGTGC TAACGGTGCT
ACTGACTCTG AAGTGGATGC GGCTATGGCC CTTATCGTTG CAGACTATCA GTGGAGTACT
TCAGGCTCCA TCAATTATAA AAATGATGCA AAGGCATTGA TCACTGCCAT CAAAAATTTT
GAGGTAGAAA GCGGATCGTA TGTGTTAAAG CCAGGCGACC AGTTTGGCGG CAGTAGCCTC
ACAAACCCTT CGTACTTTGC GCCGGGTTAT TTCAGAACGT TTGCAACGTA TATGAATGAT
ACCTTCTGGA ACAATGTAGC CGATAAATGT TATGTCGTTA TTAATAATAA CCTGTCTGTA
AATAATGCCG CAGGCGGTTT AGTGTCTGAC TGGTGTACAG CATCCGGCAG TTATTCAAGT
GCAGCAGGCG GTTATGCAAA CGGCGGAAGA AATTACTCAT ACGATGCAGC CCGCACACCA
TGGAGAATTG CAGTAGATTA TGTTTGGTAT GGCAATGCAA GTGCAAAAAC GTATTCTAAA
AAATCTTCCG ATTTCGTTCG CGTAAACCTG GGCGGTTCTC AAAATGTAAA AGATGGTTAC
AGCCAGAACG GTTCTGCTGT AAGTAACTAC CACAACTCAA CATTTGTTGG CGCTTTTGCT
GCAGCAGCGA TGGCAGGCGA AAATCAATCG CACCTTGACA ATTCCTATTC CGATCTGAAA
GGCATCAACG ATGCGAATTC TTATTTTAAC CAGACATTAA AAACACTTTA CCTGTTTTTA
TTAACAGGTA ACTTTTATTT ACCGGGCTCA GGCACTGTAG TGCCTCCTGT AAACGCTGCA
CCTACGGTTT CGCTAACGGC TCCTTCAAAC AACGCTGCCT TCAATGCTCC GGCTTCGGTA
ACGCTTACAG CAAATGCAGC CGATGCAGAC GGTACTATTG CGAAGGTTGA ATTTTTCAAT
GGTTCTACCT TATTGAATAC AGATGCAAGC GCACCGTATT CCTTCAACTG GACGAATGTT
GCAGCGGGCA ATTACACCAT CACAGCAAAA GCAACGGATA ACGCCGGTGC AGTAACAACT
TCGGCTGCGG TATCTATCAC GGTAACAGCT GCGGCTAACG CTGCTCCTAC GGTTTCATTG
ACAGCTCCTT CAAACAACGC TGCCTTCAAT GTTCCGGCTT CGGTAACGCT TACAGCAAAT
GCAGCCGATG CAGACGGTAC TATTGCGAAG GTTGAATTTT TCAATGGTTC TACCTTATTG
AATACAGATG CAAGCGCACC GTATTCCTTC ACCTGGACGG GCGTTGCAGC GGGCAGCTAC
ACCATCACAG CAAAAGCAAC GGATAACGCC GGTGCAGTAA CAACTTCGGC TGCGGTATCT
ATCACGGTAA CAGCTGCGGC TAACGCTGCA CCTACGGTTT CGTTAACGGC TCCTTCAAAC
AACGCTGCCT TCACTGCTCC GGCTTCGGTA ACACTTACAG CAAATGCAGC GGATGCAGAT
GGTACTATTG CCAAAGTCGA ATTCTTCAAC GGTTCAACGT TGTTAAGTAC AGATGCAAGC
GCACCGTATT CCTTCAACTG GACGGGTGTT GCAGCGGGTA ATTACACGAT CACAGCAAAA
GCAACGGATA ACACCGGTGC AGTAACAACT TCGGCAATCG TGTCTATTAC GGTAACAGCT
GCGGCTAACG TTGCTCCTAC GGTTTCATTA ACAGCTCCTT CAAACAATGC TGCCTTCACT
GCTCCGGCTT CGGTAACGCT TACAGCAAAT GCAGCCGATG CAGATGGTAC TATTGCCAAA
GTCGAATTCT TCAACGGTTC AACGTTGTTA AGTACAGATG CAAGCGCACC GTATTCCTTC
ACCTGGACAA ATGTTGCAGC TGGGAACTAT ACGATCACAG CAAAAGCAAC GGATAACGCC
GGTGCAGTAA CAACTTCGGC AACCGTGTCT ATTACGGTAA CAGCTGCGGC TAACGCTGCA
CCTACGGTTT CATTAACAGC TCCTACAAAC AACGCTGCCT TCACTGCTCC GGCTTCGGTA
ACGCTTACAG CAAATGCAGC GGATGCAGAT GGTACTATTG CCAAAGTCGA ATTCTTCAAC
GGTTCAACAT TGTTAACAGG TTCTGTAAAT ACTTCTGCTC CCTACACGTT CACCTGGACG
GGCGTTGCAG CGGGCAGCTA CACCATCACA GCAAGAGCAA CCGATAATAC CGGTGCAGTA
ACAACTTCAG CCGCGGTGTC TATCACGGTA ACGGCTGCTC CGATTGAAAA CCCTGGTAAC
GACTGTATAA CCGAAGCGGT ACCAGTAGCT GCGCAATGGG TTGTCAGAAA CAGCTGGACA
GATCAGAATA TGGGATCAAA AGCAGTTTCT ACTGCAGACG CATTAAATAT TAAGCACCGC
CAATGGGGTA ATCCGGAACT ATGGGCGATT GAAACAGGCA AAGCAATCAG CGTTGTAAGC
GGACAATCTT ACACTGTGTC TTTTGACTTT AAAAACGATG CTCAAACGCC TGTTACCAGT
CTTGAAATTG GTTTTGCAAC AGCTGAAGCA TGGAATGGTG CAACGCTTGA CCAACCTGCG
GTTACTGTTT CAGGAAGTAT TCCTGCTTCT TTCACAACTA AAACAGTAAC CATTACAGCT
GCAGCAACAG GTACTATTTA CCTGGCATAT AAATTAAAAT TAAACGGGCA GCCGAATAAT
GAAGTAAATG TATTTATCAA AAACATTTCC GTTTGTTCTT CCGCTGCTGC TTCTTCTTCC
GCTGCACGTC CGGCTGCCCC GGCAGCTTCT AATGAAGTGA ATGATTTGTT AATGGGTGCA
AACCCGTTTG CTGATCAGAC GACAGTAGAA ATTCCATACG CATCAACTAC ATCGGTTCAT
CTAATCATGA GGGATATGAA CGGTCTTACG GTATGGGAAT CCTATTCGCT GCAAACAAAT
CAGAAAATCT ACCTTGGCTC TGGCCTGCCG ATCGGTACGT ATCTGGTAAC AGTATCTTAT
GATGGTAACA GCAAAACATT CCGGTTATTG AAATACTAA
 
Protein sequence
MNNIRSPSTV FNKWNMILNP TNRSSFMKKR LLSAFIFVSL VFTTVCHQTV AQTQPFPANK 
TYSYGTMPGN KNSQDAINNY NTWKTNYVEA CSNGRFRVKF DNRSQTVSEG IAYGMLLAAY
SGDRTLFDGL WNYYKDNRNG NGVMNWKING CSGTAGANGA TDSEVDAAMA LIVADYQWST
SGSINYKNDA KALITAIKNF EVESGSYVLK PGDQFGGSSL TNPSYFAPGY FRTFATYMND
TFWNNVADKC YVVINNNLSV NNAAGGLVSD WCTASGSYSS AAGGYANGGR NYSYDAARTP
WRIAVDYVWY GNASAKTYSK KSSDFVRVNL GGSQNVKDGY SQNGSAVSNY HNSTFVGAFA
AAAMAGENQS HLDNSYSDLK GINDANSYFN QTLKTLYLFL LTGNFYLPGS GTVVPPVNAA
PTVSLTAPSN NAAFNAPASV TLTANAADAD GTIAKVEFFN GSTLLNTDAS APYSFNWTNV
AAGNYTITAK ATDNAGAVTT SAAVSITVTA AANAAPTVSL TAPSNNAAFN VPASVTLTAN
AADADGTIAK VEFFNGSTLL NTDASAPYSF TWTGVAAGSY TITAKATDNA GAVTTSAAVS
ITVTAAANAA PTVSLTAPSN NAAFTAPASV TLTANAADAD GTIAKVEFFN GSTLLSTDAS
APYSFNWTGV AAGNYTITAK ATDNTGAVTT SAIVSITVTA AANVAPTVSL TAPSNNAAFT
APASVTLTAN AADADGTIAK VEFFNGSTLL STDASAPYSF TWTNVAAGNY TITAKATDNA
GAVTTSATVS ITVTAAANAA PTVSLTAPTN NAAFTAPASV TLTANAADAD GTIAKVEFFN
GSTLLTGSVN TSAPYTFTWT GVAAGSYTIT ARATDNTGAV TTSAAVSITV TAAPIENPGN
DCITEAVPVA AQWVVRNSWT DQNMGSKAVS TADALNIKHR QWGNPELWAI ETGKAISVVS
GQSYTVSFDF KNDAQTPVTS LEIGFATAEA WNGATLDQPA VTVSGSIPAS FTTKTVTITA
AATGTIYLAY KLKLNGQPNN EVNVFIKNIS VCSSAAASSS AARPAAPAAS NEVNDLLMGA
NPFADQTTVE IPYASTTSVH LIMRDMNGLT VWESYSLQTN QKIYLGSGLP IGTYLVTVSY
DGNSKTFRLL KY