Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0698 |
Symbol | |
ID | 7309557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 804691 |
End bp | 807528 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643607637 |
Product | protein of unknown function UPF0182 |
Protein accession | YP_002505057 |
Protein GI | 220928148 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00837363 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAG TATATGATTA TTACAAAGAA AAAGATAAGA AGCAGAAAAA TGGCAAAGGT TACAAAGGTA AAATTGCGGC CTTGATTACT TTGGCAATTA TAGTGGTTTT TGCAATAATA GGAAGCTCTA TATACATGGA GCTGATTCAG CTGAAGGAGT TAAACCCAAA TGCTGATTAC ACTACCGTTT ATACCAAAAA TCTATTATAC AAGACAATAT TCTTTATAAT CAGTTTTGTA ATTATCACCT TATTTGTATT TATTACAAAC AAGGTAACGG GAAATAACCT TAAAAAGTAC TTTAAAAATA ATAACCTAGA GCAGAGGCGA TTGGTAAACG CTCCTGTTGC ATTTGTAATA GGTATACTAG GAGCTTTTAT TACCAAGGAA TTCTTTTTTA ACAAAGCACT CCTGTTTTTA AATTCAGCTA ATTTTGGTAT AAAAGACCCT CAATTCGGAC AGGACATTGG GTACTACATG TTTCAGAGAC CTTTTTATAT GTCTCTGTTT AATTTTATAT CGAATTTGTG GCTGTTCCTG GTATTTTACA CGGCTGTATA CTATTTAATT GTATTGATGA CAGCTTTAAA CAACACCTTA TCCACTAAAG ATCTCAAGGA TAAGACCATT TTAAGACATA ATTTGATAAA TATAGCAGTC TTCTTTGTAC TAAAAACTAT TTCCTTCAAA TTTCAAAGAG AATCGCTGCT TTATACAAAC TTTACAAATA AGGACATAAC CGGAGCGGGC TACGTTGATA CAAATATTTG GATAAAGTAC TATACAGTAG CTCCTGTAAT TGTACTTGTC ATTGTTTTAT TGGCGGGTTT CTTTATGTGG AAGGGCAAGC TAAAGAAGTC GGCTATTGTA ATTGCTGCTT TCCCTACAAT TTTTATTTTA CTTACATTGG TTTCATCCAT TATACAAAAT GCAATTGTCG GACCCAACGA AATTGAATTC GAAGACAAAT ATCTTAAAAA CAATATGACT GAAACAAGGG CAGCTTTCGG ACTTGACAAA ATTCAACCCT ACGATTTCAA CAAAATAGAG GAACTAACTC CTGAAATAAT AAACAACAAC AGAAATACCG TGGATAATAT ACGTGTAGTC GACTATATAC CTACATTAAA CAGTAACAAA CAGCTCCAGA GCAATACAAA CTTTTACACA TTTCATAACG GAGATATCTT AAACTACACA GTAAACGGTA AAGAAATACC TGTACTTATT TCTGCCAGAG AAATAAACAC TGACTATCTG CGGAACCAAA ACTTTGTTAA TAAAACCTTC AAATATACAC ATGGTTATGG TGTCGTTGTA AATCCCATAA ATAAATTAAC GGCGCAGGGA CAGGTTGAGT TTTTAATCAG CGGCCTGAAA ATGGACACCG TCGACCAGGT TAACCTGAAG GTTACCGAGC CAAGGATTTA CTACGGACAG TTAACAAACA ACTACGTCAT AGTAAATCCC AAAAGTGCTA AGAAGCTAAC TGAAATAGAT TATGATGGAA CGACTGTCAG TTATTTTGAT GAAACAGGAA ACAAGTACGA GAAAATTAAA ATGAACCTTT TGAACAGGAT ACTGTTTTCG ATTAAATATG CAGATACCAA CCTTTTGGTA TCCAGCAATA TATCATCTGA CTCTAAAATT CTATTGAATA GAAATGTTGT AGAAAGGGCC CAGAAAGGCA TACCTTTTTT AAAAGTTGAT TCCGATCCTG CATTGAATAT AACTGCCGAC GGAAAGCTTG TATGGGTTCT GGATGCTTAT AGCATATCAA ACAACTATCC GTATTCACAG TACTATTATA CACAGTCGGA GGATAATGAT CTTCAATCTC TTAACGGAAT AAATTACATA AGGAATTCCG TTAAAGTAAC TGTTGATGCC TATGACGGTA CAGTGAAGTA TTACATTATA GATAATGAAG ACCCTATAAT AAAAGCTTAT CAAAGTGTGT ATCCGGGACT TTTTACAAAG GATGAATTTC CGGCAGACCT TGCGTCTCAT ATAAGGTATC CTGAAACCCT TTTCAAGCTT CAGACTGAGG TACTGAAGAA ATACCATCTT GACCCCAAAA AGGAAGATAA TATATCAACC TTTTATACAG GACAGGATGA ATGGAATATT GCAAAATATC CGGATCAAAA CAACGAAAGC GGAGCAAAGG ATATTGACGC CTATTATAAC ATGGTAAAAC TCCCGGGAGA TATCGGTAAA AAAGAGGAAC TCATTCTTAT GAGGCCATTC ACACCTTCGG GAGAGAAGCA TAACATGGTA TCATGGCTGG CAGTTCGTAA TGATATGGAA AACTATGGGA AAATGATATT ATTTAACTTC CCGAAAAATA CAAATATTCT CGGTCCGGAC CAGTTTGAGG TAAATATTAA CCAGATTAGT GAAATATCCG AGGATATGAC ACTGTGGGGA CAGGGTGGCT CAAAGGTATT TAAGGGAAGT CTTCTGGTAA TTCCTATAGA AAACAGTATT CTGTATGTTG AACCAATATA TATACAGTCA AACAGTGCAT CATCTATCCC GCAGGTTAAA AGGGTCGTAG TGGGTTACCA ACAGGGAGCG GATTTCAAGC ACGGTATAGG AGATAATCTT GATAGTGCTA TAGACGATCT GTTTAAAGGC AGTGTAAAGC CGTCTGGTGA AAAAACTCAG ACCATTAACC CAGAAAATCA GAATGATGAA GGTCATGAAG CTGTTACGCC ACCGGATAAA TCACAAAGTA TTGACCAGAA GAAACTTGAC GAATTGCAGC AAAAGCTTGA TCAACTCATG AAACAGTCAC AGGAGATTAA TGACCTTTTA AAAAGCTTGA GAAAATAA
|
Protein sequence | MEEVYDYYKE KDKKQKNGKG YKGKIAALIT LAIIVVFAII GSSIYMELIQ LKELNPNADY TTVYTKNLLY KTIFFIISFV IITLFVFITN KVTGNNLKKY FKNNNLEQRR LVNAPVAFVI GILGAFITKE FFFNKALLFL NSANFGIKDP QFGQDIGYYM FQRPFYMSLF NFISNLWLFL VFYTAVYYLI VLMTALNNTL STKDLKDKTI LRHNLINIAV FFVLKTISFK FQRESLLYTN FTNKDITGAG YVDTNIWIKY YTVAPVIVLV IVLLAGFFMW KGKLKKSAIV IAAFPTIFIL LTLVSSIIQN AIVGPNEIEF EDKYLKNNMT ETRAAFGLDK IQPYDFNKIE ELTPEIINNN RNTVDNIRVV DYIPTLNSNK QLQSNTNFYT FHNGDILNYT VNGKEIPVLI SAREINTDYL RNQNFVNKTF KYTHGYGVVV NPINKLTAQG QVEFLISGLK MDTVDQVNLK VTEPRIYYGQ LTNNYVIVNP KSAKKLTEID YDGTTVSYFD ETGNKYEKIK MNLLNRILFS IKYADTNLLV SSNISSDSKI LLNRNVVERA QKGIPFLKVD SDPALNITAD GKLVWVLDAY SISNNYPYSQ YYYTQSEDND LQSLNGINYI RNSVKVTVDA YDGTVKYYII DNEDPIIKAY QSVYPGLFTK DEFPADLASH IRYPETLFKL QTEVLKKYHL DPKKEDNIST FYTGQDEWNI AKYPDQNNES GAKDIDAYYN MVKLPGDIGK KEELILMRPF TPSGEKHNMV SWLAVRNDME NYGKMILFNF PKNTNILGPD QFEVNINQIS EISEDMTLWG QGGSKVFKGS LLVIPIENSI LYVEPIYIQS NSASSIPQVK RVVVGYQQGA DFKHGIGDNL DSAIDDLFKG SVKPSGEKTQ TINPENQNDE GHEAVTPPDK SQSIDQKKLD ELQQKLDQLM KQSQEINDLL KSLRK
|
| |