Gene CHU_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1842 
Symbolcel 
ID4185913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2160209 
End bp2161864 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content41% 
IMG OID638071841 
Productretaining beta-glycosidase 
Protein accessionYP_678451 
Protein GI110638242 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00229195 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTATCA GAACAATCTT TATTTTAATT GCCGCATGTT CAATCGCGAT TGCCTGCCGT 
AAAAAACATG ACGATGAATC CACATCCATT GATAGCAGGC AATTACATGC CAGTGGAATT
GATATTATTG ATGGCAGCGG TAAAAAAGTA TACTTGAGAG GCGTAGCATT TGGAAATGAA
GTATGGTCTA ATGCACCTAC CATTCCGACA ACGCATCATT CAGAAGAAGA TTATAAGCGC
GTGCGCGATA TGGGTATGAA TGCCATACGT TTTTACCTGA ATTATCAGAT TTTTGAAGAT
GATGCTACTC CGTATGTATA TAAATCTGCA GCGTGGGACT GGATCGATCA GAATATTGCC
TGGGCAAAAA AACATGACAT TTATTTAATT CTCAATATGC ATGTGCCCCA AGGCGGCTTT
CAATCTAACG GAGATGGTGA TGCGTTATGG AACAATCCCG AAAACCAGAA CCGGTTAAAG
GCGTTGTGGT TTAATATTGC CAAACGGTAT GCAAATGAAC CAACCATTGC CGGACTTGAT
CTGCTGAATG AGCCCGTAGT AACAACATCC ATTGATCAAT GGAAAAACTT TTCACAGTCA
ATTATTGATA CCATCCGTAC GGTAAACACC AACAGCATGA TTATTGTGGA GCGGGTGAAT
GCGATAGATG ATAACTGGTC AAATAATTCG GATATGAATT TTTTTGACCT GAATGACAAC
AATCTTGCAT ATGAATTTCA TTTTTATTTG CCGATGGAAT TCACTCATCA GGGGGCAAGC
TGGATCGGAG GAGGCAACAC ATTTCCGATC GGACAAACCT ATCCGGATGC CAACAGGGTT
TTTGTTAAAG GCAATTCTTT TTTTTACACC GCATCTTTTG CAAATGCCCG CATACCAACA
GGTACCTACG ATTGGATGGA ATATGCTGAA TCACCTAAGT ATAAAATACA GGATGAAAAA
ATAAAACTGG GCAAACCAAC AGTTGTAAAC CGTGCCAATA CAGGTAAGAT ATGGGTAGAT
GATATTGTAG TGAAGGAGTA TGATGCATCG GATAATTTTG TACGTAATGT ATTTGAAATA
GATCTGAATA CATTTGACGG CTGGTATTCA TGGAGTGAAA ACGGATCTGG TACTGCAGGC
GCTGACGCTG CTACAGGGCA TTCAAATTCA AACTCGCTAT ACATGCAGGG GACAACCGGC
GATGCCAATA TAAGCAGTAC TGCCTATCAG TTTATTCCCA AACAAGGTTA TTCCTACACC
ATCAGCGGCT GGGTAAAAGG AGAAGATGTG ACAGCTGGTT CTGCTGCCAT GTTCCGCCTG
GATTTTGAAC AGATTGCTGA TGGAGACAAT GTATATTCAT TGGGAAAAGA ATATTTAGCG
GCACAGGTAG ATCAATATTA TAAATGGACA CACATCAAAA ACAAGCCCTT ATTTTTAGGG
GAGTTCGGAG TGATTCAGTT TGGTTATGAA AATAATCTAG GCGGCTTGCA GTGGACGGGT
GATATGATTG ATATCCTTAA AGAACGGGAT GCCCATTTTA CATACCATGC CTATCATGAA
GATTCCTTCG GTATTTATAA AGGCTACGGT ACTCCTGTTG ATCCTTCAAC GGGCAATCAG
GCGCTCATTA ATTTATTCAA ATCAAAACTC CCTTAG
 
Protein sequence
MTIRTIFILI AACSIAIACR KKHDDESTSI DSRQLHASGI DIIDGSGKKV YLRGVAFGNE 
VWSNAPTIPT THHSEEDYKR VRDMGMNAIR FYLNYQIFED DATPYVYKSA AWDWIDQNIA
WAKKHDIYLI LNMHVPQGGF QSNGDGDALW NNPENQNRLK ALWFNIAKRY ANEPTIAGLD
LLNEPVVTTS IDQWKNFSQS IIDTIRTVNT NSMIIVERVN AIDDNWSNNS DMNFFDLNDN
NLAYEFHFYL PMEFTHQGAS WIGGGNTFPI GQTYPDANRV FVKGNSFFYT ASFANARIPT
GTYDWMEYAE SPKYKIQDEK IKLGKPTVVN RANTGKIWVD DIVVKEYDAS DNFVRNVFEI
DLNTFDGWYS WSENGSGTAG ADAATGHSNS NSLYMQGTTG DANISSTAYQ FIPKQGYSYT
ISGWVKGEDV TAGSAAMFRL DFEQIADGDN VYSLGKEYLA AQVDQYYKWT HIKNKPLFLG
EFGVIQFGYE NNLGGLQWTG DMIDILKERD AHFTYHAYHE DSFGIYKGYG TPVDPSTGNQ
ALINLFKSKL P