Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_1842 |
Symbol | cel |
ID | 4185913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 2160209 |
End bp | 2161864 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638071841 |
Product | retaining beta-glycosidase |
Protein accession | YP_678451 |
Protein GI | 110638242 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00229195 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTATCA GAACAATCTT TATTTTAATT GCCGCATGTT CAATCGCGAT TGCCTGCCGT AAAAAACATG ACGATGAATC CACATCCATT GATAGCAGGC AATTACATGC CAGTGGAATT GATATTATTG ATGGCAGCGG TAAAAAAGTA TACTTGAGAG GCGTAGCATT TGGAAATGAA GTATGGTCTA ATGCACCTAC CATTCCGACA ACGCATCATT CAGAAGAAGA TTATAAGCGC GTGCGCGATA TGGGTATGAA TGCCATACGT TTTTACCTGA ATTATCAGAT TTTTGAAGAT GATGCTACTC CGTATGTATA TAAATCTGCA GCGTGGGACT GGATCGATCA GAATATTGCC TGGGCAAAAA AACATGACAT TTATTTAATT CTCAATATGC ATGTGCCCCA AGGCGGCTTT CAATCTAACG GAGATGGTGA TGCGTTATGG AACAATCCCG AAAACCAGAA CCGGTTAAAG GCGTTGTGGT TTAATATTGC CAAACGGTAT GCAAATGAAC CAACCATTGC CGGACTTGAT CTGCTGAATG AGCCCGTAGT AACAACATCC ATTGATCAAT GGAAAAACTT TTCACAGTCA ATTATTGATA CCATCCGTAC GGTAAACACC AACAGCATGA TTATTGTGGA GCGGGTGAAT GCGATAGATG ATAACTGGTC AAATAATTCG GATATGAATT TTTTTGACCT GAATGACAAC AATCTTGCAT ATGAATTTCA TTTTTATTTG CCGATGGAAT TCACTCATCA GGGGGCAAGC TGGATCGGAG GAGGCAACAC ATTTCCGATC GGACAAACCT ATCCGGATGC CAACAGGGTT TTTGTTAAAG GCAATTCTTT TTTTTACACC GCATCTTTTG CAAATGCCCG CATACCAACA GGTACCTACG ATTGGATGGA ATATGCTGAA TCACCTAAGT ATAAAATACA GGATGAAAAA ATAAAACTGG GCAAACCAAC AGTTGTAAAC CGTGCCAATA CAGGTAAGAT ATGGGTAGAT GATATTGTAG TGAAGGAGTA TGATGCATCG GATAATTTTG TACGTAATGT ATTTGAAATA GATCTGAATA CATTTGACGG CTGGTATTCA TGGAGTGAAA ACGGATCTGG TACTGCAGGC GCTGACGCTG CTACAGGGCA TTCAAATTCA AACTCGCTAT ACATGCAGGG GACAACCGGC GATGCCAATA TAAGCAGTAC TGCCTATCAG TTTATTCCCA AACAAGGTTA TTCCTACACC ATCAGCGGCT GGGTAAAAGG AGAAGATGTG ACAGCTGGTT CTGCTGCCAT GTTCCGCCTG GATTTTGAAC AGATTGCTGA TGGAGACAAT GTATATTCAT TGGGAAAAGA ATATTTAGCG GCACAGGTAG ATCAATATTA TAAATGGACA CACATCAAAA ACAAGCCCTT ATTTTTAGGG GAGTTCGGAG TGATTCAGTT TGGTTATGAA AATAATCTAG GCGGCTTGCA GTGGACGGGT GATATGATTG ATATCCTTAA AGAACGGGAT GCCCATTTTA CATACCATGC CTATCATGAA GATTCCTTCG GTATTTATAA AGGCTACGGT ACTCCTGTTG ATCCTTCAAC GGGCAATCAG GCGCTCATTA ATTTATTCAA ATCAAAACTC CCTTAG
|
Protein sequence | MTIRTIFILI AACSIAIACR KKHDDESTSI DSRQLHASGI DIIDGSGKKV YLRGVAFGNE VWSNAPTIPT THHSEEDYKR VRDMGMNAIR FYLNYQIFED DATPYVYKSA AWDWIDQNIA WAKKHDIYLI LNMHVPQGGF QSNGDGDALW NNPENQNRLK ALWFNIAKRY ANEPTIAGLD LLNEPVVTTS IDQWKNFSQS IIDTIRTVNT NSMIIVERVN AIDDNWSNNS DMNFFDLNDN NLAYEFHFYL PMEFTHQGAS WIGGGNTFPI GQTYPDANRV FVKGNSFFYT ASFANARIPT GTYDWMEYAE SPKYKIQDEK IKLGKPTVVN RANTGKIWVD DIVVKEYDAS DNFVRNVFEI DLNTFDGWYS WSENGSGTAG ADAATGHSNS NSLYMQGTTG DANISSTAYQ FIPKQGYSYT ISGWVKGEDV TAGSAAMFRL DFEQIADGDN VYSLGKEYLA AQVDQYYKWT HIKNKPLFLG EFGVIQFGYE NNLGGLQWTG DMIDILKERD AHFTYHAYHE DSFGIYKGYG TPVDPSTGNQ ALINLFKSKL P
|
| |