Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1249 |
Symbol | |
ID | 7310044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1546692 |
End bp | 1548752 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643608170 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_002505585 |
Protein GI | 220928676 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000227885 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TAAGTGTACT GGTTGTGCTT GCTGTTTTAC TGGTGAGCAT TATTCCATCT GCGGTTTCAG CTGAGGAAAA CAACTTTAAC TATGTTGATG CATTTGCCAA GTCAATTCTT TTCTATGAGG CAAATTGGTG TGGTCCTGAT GCAGGAAACA ACAGGATTAA ATGGCGCGGC CCTTGCCATT GTGATGACGG AAAGGATGTA GGACTGGACT TGACGGGAGG GTTTCATGAC TGCGGAGACC ATGTAAAGTT CGGGTTACCG CAATGTGCTT CTGCTTCAAC CCTTGCATGG GCTTACTATG AGTTCGAGGA TGTTTTTATT GACAAGGGAC AGGATGGCTA TATGCTTAAT ATTTTAAAGC ATTTTTGCGA TTACTTTATG AAATGCTTCC CGAACAAGAC TACGTTTTAC TATCAGGTAG GCGACGGTGA TGTAGATCAC CAATATTGGG GGCCGCCTGA GCTTCAGACA TATGACAGGC CAGCCTATTA TGTTGCAACA CCATCAAATC CCGGTTCTGA TGTAGCAGGT GATGCGGCTG CTGCACTGGC ACTTATGTAT CTGAACTATA AGGATATAGA TTCAACTTAT GCAGAAAAAT GTCTTACTTA TGCAAAAGAC CTTTATGACT TCGGTATGAC CTATAGAGGA AACAGTAAAG GTCAAAGCTA TTATCTGCCT AGAACTTATC TGGATGAACT TATGTGGGGA TCAATCTGGC TTTATGTTGC TACAAATGAT AATAAATATA TGGATAATGT AGAAAAGCTA ATGGTTGAGA AAGGGATAAC CGGAGGCAAC TCCTTTAATG ACAATTGGAC CCAGTGCTGG GATTATGTAT TAACAGGAGT GTTTACAAAG CTTGCAACAC TTTCGACTAA TCCTTTATAC AAATCAATAG CCGATGACCA TATTGATTAC TGGCAGAATA GATTAAAAAC TACTCCTGCA GGATTGAAAT ACCTTGATAG CTGGGGTGTT TGCAAATACC CTGCAGCAGA GAGTATGGTT CAGCTTGTTT ATTATAAATA TTTTGGTAAT GAGAAGTGTC TGGACTTTGC AAAGGGACAA ATAGATTACA TTCTTGGAGA CAATCCCAAC AATATGTCTT ATGTAGTTGG TTTCGGAGAT AATTATCCCA AATACCCTCA TCATCGTGCC GCAAGCGGAG TTTTGGAAGG CCCGCCTGCA GATGAAAAGA AGGAATTGCC GGAAAGGCAT ATTCTATACG GTGCTCTTGT AGGGGGAGCA GACATGAATG ATGAATATCA TGACAATGTT AATGAGTATG TTTATTCAGA AACCGGATTG GACTATAATG CAGGCTTAGT AGGAGCAATG GCGGGTATGT CAAAATACTT CGGTAAAGAC CAGTTGCCTG AACCTACTCC GGGTATTGAG GGTGAGCCGA CCCAATACTA TTCGGAAGCT AAACTATACA AATCAAATAC AGAGGGTGTT ACCGTTGACC TTAATTTGTA TAATATTGTT ACGGCACCTC CTCAATATGA AAAAGACTTA TCCTGTAAGT TCTTCGTAGA CTTATCAGAA TTTGCTGCTG AAGGAATTAA TCCATCTAAG TTTACAACTA AGGTATATTA TTCTCCGGCC GATGCACAAA TATCTGGAAT TCAGCCATAT GATAAGGAGA AAAATATCTA TTACGTAGAA ATTACTTTCC CAAATAGTCA GTTGTATGCA AGAACTTATG TTCAATTCTG TATTTACAAT TACGAAAGTA AACTATGGGA TTCCAAAAAT GACCTTTCTA CCGCAGGTTT AACCGATGAA TATGCGAAAA TAGAAAATAT TCCAATATAC AAAAATGGCG TTAAAGTTTA CGGGAATGAC CCTGCGGGAA GTACAACAAT TTTGTATGGA GATTTAAATA ACGATAAAGA AATAAATGCA ATTGATTTTG CATTATTAAA AAAATACCTG CTGAATGGAG ATGCAGAGGG TATAATTCTT AATAATGCCG ACATTAATAA GGATGGAGCT GTAAATGCTC TGGATTTTGC AAACTTAAAA CTGTACTTAC TTGGTAAATA A
|
Protein sequence | MKKVSVLVVL AVLLVSIIPS AVSAEENNFN YVDAFAKSIL FYEANWCGPD AGNNRIKWRG PCHCDDGKDV GLDLTGGFHD CGDHVKFGLP QCASASTLAW AYYEFEDVFI DKGQDGYMLN ILKHFCDYFM KCFPNKTTFY YQVGDGDVDH QYWGPPELQT YDRPAYYVAT PSNPGSDVAG DAAAALALMY LNYKDIDSTY AEKCLTYAKD LYDFGMTYRG NSKGQSYYLP RTYLDELMWG SIWLYVATND NKYMDNVEKL MVEKGITGGN SFNDNWTQCW DYVLTGVFTK LATLSTNPLY KSIADDHIDY WQNRLKTTPA GLKYLDSWGV CKYPAAESMV QLVYYKYFGN EKCLDFAKGQ IDYILGDNPN NMSYVVGFGD NYPKYPHHRA ASGVLEGPPA DEKKELPERH ILYGALVGGA DMNDEYHDNV NEYVYSETGL DYNAGLVGAM AGMSKYFGKD QLPEPTPGIE GEPTQYYSEA KLYKSNTEGV TVDLNLYNIV TAPPQYEKDL SCKFFVDLSE FAAEGINPSK FTTKVYYSPA DAQISGIQPY DKEKNIYYVE ITFPNSQLYA RTYVQFCIYN YESKLWDSKN DLSTAGLTDE YAKIENIPIY KNGVKVYGND PAGSTTILYG DLNNDKEINA IDFALLKKYL LNGDAEGIIL NNADINKDGA VNALDFANLK LYLLGK
|
| |