Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0643 |
Symbol | |
ID | 7309508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 739911 |
End bp | 741539 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643607584 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_002505004 |
Protein GI | 220928095 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.649155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCTAA AAAAATGTAT AATACCACTG CTTATATTTG TGTTCCTTTT TCAAACAAGC ACTTCCGCTT ATGAAAGCAT AAGCTTTTTA TATGCAGGAA ACACAACCAC CTATATAAAT AATGTTAATC GTTCAGGTAC AAATCTTACA ACGGTTTCTC CCGATTATTT TGAAATAAAC AGTAACGGGA CTCTAAAAAA AACAATAAAG GTAGATCCTC TTTTTGTGGA GACTATGCAC GCACGGGGAA TAAAAGTAGT ACCCTACCTT AGTAATAACT GGGACAGAAC TCTCGGCAGG GCTGCATTGG CCAACCAAAA CTCATTTGTC GCCAGTTTAA GTGCCGAAAT TGTACGACTT GGTTGTGACG GTATAAATAT AGACATTCAA AACCTTACCG AAGCTGACCG CAACGCATTT ACAGCTTTTA TCAGACTTTT ACGTTCTTAT TTGCCAAAAA CAAAAATACT ATCTGTATGT GTCGCAGCTA ATCCGTGGGG ATCAACTATT GGCTGGCAGG GTTCTTATGA TTATGCCGCC TTAGGTACTA TAAGCGACCA GCTATTCATA ATGGCTTATG ATGAGCATTA CACCGGAAGT GCACCGGGAG CGGTAGCCAG CTTTTCTTTT GTTGAAAAAA GTGTAAACTA TGCCTTGAAA TATGTTCCAT CTACAAAAAT AGTTTTAGGA GTACCATTTT ACGGCAGATA TTGGAAACAA GGAGCAGCAA GCGGCGGCTA CGGAATAACC GTATCTGACG TTGAACGTCT GGTAGCAACC TGTAAATCCA AAACCTGGTA TGACAGTACA TCCCAGTGTG CCCGTGCAAC CGTAACGGTA ACCTCGACAG ATAATGCTGT TATCTGGGGA AGCAGCAGAC TTTCAGCCGG CACTTACGAT ATTTGGTATG AAAATGAAAC TTCACTTGAG AAAAAGCTAT CCCTTGTCTC AAAAAATAAT CTGCTTGGTG CGGGGAGTTG GGCATTGGGG CAGGAGCCTC AGCGTTTCTG GAATAATTAT AGCCAATGGC TGATAGGTAA GCCCTTTATT GATATATCAA ATCACTGGGC ACAAAGCTAC ATCATAGATC TGTTTCAAAA AGGCATCGTC AGTGGTATGC CCGGCAAGCG TTTTGTACCT GACGGCAGTC TCACGAGGGC CGAAGCCGCA GCATTACTTG TAAAAACCCT TGGTTTGCAG AATGAAACCG CAACTGCATC TTTCGCCGAC ACTAAAGATC ACTGGGCATC AAAACAAATT GCCATTGTTA AAGAAAAGGG CATTTTCAGC GGGTATTCGG GAAACATGTT TTACCCTGAA AGAAAGATTA CAAGGGAAGA ATTTGCAGTG GTATGTGACA AAATACTATT CAGCCCTGAT ACTGTAGATT TCTCTCAAAG AATTTTCAGT GATGTAAGTC CTGAAAGCAA CCCATGGTCA AATAAATCTA TCATTGTTCT TTCAATGAAT AATATTCTAT CAGGATACCC GGATGGTACT TTCAGACCGA AAAAAACAAT TACAAGAGCG GAAGCAACCA GAGTAATAGC CGCTTTACTG GAATATCCCG GAGGGTTTAC AATTTCGCCG ACTCATATTC AGAGTCCGTC CCCTGTACCG CCAAGATAA
|
Protein sequence | MRLKKCIIPL LIFVFLFQTS TSAYESISFL YAGNTTTYIN NVNRSGTNLT TVSPDYFEIN SNGTLKKTIK VDPLFVETMH ARGIKVVPYL SNNWDRTLGR AALANQNSFV ASLSAEIVRL GCDGINIDIQ NLTEADRNAF TAFIRLLRSY LPKTKILSVC VAANPWGSTI GWQGSYDYAA LGTISDQLFI MAYDEHYTGS APGAVASFSF VEKSVNYALK YVPSTKIVLG VPFYGRYWKQ GAASGGYGIT VSDVERLVAT CKSKTWYDST SQCARATVTV TSTDNAVIWG SSRLSAGTYD IWYENETSLE KKLSLVSKNN LLGAGSWALG QEPQRFWNNY SQWLIGKPFI DISNHWAQSY IIDLFQKGIV SGMPGKRFVP DGSLTRAEAA ALLVKTLGLQ NETATASFAD TKDHWASKQI AIVKEKGIFS GYSGNMFYPE RKITREEFAV VCDKILFSPD TVDFSQRIFS DVSPESNPWS NKSIIVLSMN NILSGYPDGT FRPKKTITRA EATRVIAALL EYPGGFTISP THIQSPSPVP PR
|
| |