Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1016 |
Symbol | |
ID | 7309841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1264277 |
End bp | 1266034 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643607943 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002505358 |
Protein GI | 220928449 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT TTATATCCAA AGCTATTATT TGTGCCACGG TTACAGCCCT GTTATTAACT GGCTGCGGAT CGGGGACTAA TACTGAAAGC ACTGCAAGTT CTTCTGCCGG AAACTCAAGT GTTCAGGCTC AAAAGTCTGA TATTTCATTT CCATTAAAGG AGAAAGCTAC ATTAACAGCA TTCGTAATGA CTCCTTACTC TGGTGAAAAC GGTGACTATA CCAACAACTA CGTTACCAAC TACCTAGAGG AAAAGCAGAA TATTAAAATT GATTTCAAGT ACTCCGTAAC CGGCGATGAC GGTAAAACCA AGCTGAACTT GCTTATGGCC AGCGGAGAAA AATTGCCTGA CATATTTTTA TCAACGAAAT GGTCCAAAGC CGAAACTATG CTTTATGGTA AGCAAGGACT GATTATACCG TTGAATGATT ACTTAAAAGA TGCACCTAAC TGGAATGAAT TAAATAGGGT TAGTCCGTTA AGATTGGGAG ATATCACAAT GCCTGACGGA AACATATATT GCTATGGAGA CGATAATGAG TGCTTCCATT GTATGTTCCA GTCAAGAATG TGGATTTACA AACCTTGGGT TGATAAACTA ATGGGCGGAA AAATGCCTTC CACTACAGAT GAACTGTATA CGTTTTTGAA GGCTGTCAAA GAAAAAGATC CTAACGGCAA CGGAAAGGCT GACGAGATTC CCTTCACCGG TAATATTGCT GCCGGAGGTT GGGCAACTGA TCCGACAACC TTTATAACAA ATGCGTTTAT TCAGAATAAT AACATATTGT CAAATACAAA CCCTGTAGTA GGGGCAGGAT TTGTTGTAAA TAACGGTAAA GTTGAATATC AGTTTACAAA AGACGAATAC AAGGATGCCT TAGTATACTT AAACAAGCTT TATAAGGAAG GCTTACTGGA TTCACAGACC TTTACACAGA ATGCGGATCA GCAGAAAGCT ACTGTACAAG GAACTCCTCA ACTGGCTGCC ATGGCACCGG GAGGTTGGTG GCCGTGTAAC ACAGATGAAC TTTTGAAGGA GCAGGAAGGT TCATATCAAG ATTGGGTGGT TTTAGAGCCT ATAAAAGGAC CAAACGGAGT ACAGCTATCT GCCTACTATC CAACAAACTA TTTCCAGAGC AACTATGGTC TAGTATCTGC TGACTGTAAA AATCCTGAAC TAGCCGTTAA ATTCTTTGAT TTGCTTGCAT CACAAGAAAT GACTCTTATT ACACAAAATG GACCACAGGG TATAGCATGG GATTATGTAA CAGAAGGTAC TTCAATTGCA GGCGGAGAAG CTAAATGGAA GAAAATACCT GCCAAGAAGT TAAGAAGCAG CCAGATTCCG GATTATTCCG ATGAAGGCTT GGATTTTGTA AAATATGTTT GGGATCCAGA TGCAGTTATG ACTCATAATA CAAATGAGTT CAGACTTAGC CAGTACTGTG CTAATCCTGA GACCAGCGTT GAGGCATTGT TGTATCAATG CGGTAAAGCA TATTCAAAAT ACAAACCTGA CGACGCTACA ATGCTTCCAA ACCTCGCCTA CTCGGAAGAG GATGCTAAGA AAATTGCCGA CTACACAGTT TCAATAGGTA AATTTGTAAA TCAGGCTACT GTTCAGTTTA TTACAGGTGA CCTGGATATT AATACATGGC AAAATTATGT AGACAAGATT AATAGTATGG ATTTGAAGGG ATACCTAGCT ATTCAACAAA ATGCATACGA CCAATATGCA AAAAGTTTAA ACAAATAG
|
Protein sequence | MKKFISKAII CATVTALLLT GCGSGTNTES TASSSAGNSS VQAQKSDISF PLKEKATLTA FVMTPYSGEN GDYTNNYVTN YLEEKQNIKI DFKYSVTGDD GKTKLNLLMA SGEKLPDIFL STKWSKAETM LYGKQGLIIP LNDYLKDAPN WNELNRVSPL RLGDITMPDG NIYCYGDDNE CFHCMFQSRM WIYKPWVDKL MGGKMPSTTD ELYTFLKAVK EKDPNGNGKA DEIPFTGNIA AGGWATDPTT FITNAFIQNN NILSNTNPVV GAGFVVNNGK VEYQFTKDEY KDALVYLNKL YKEGLLDSQT FTQNADQQKA TVQGTPQLAA MAPGGWWPCN TDELLKEQEG SYQDWVVLEP IKGPNGVQLS AYYPTNYFQS NYGLVSADCK NPELAVKFFD LLASQEMTLI TQNGPQGIAW DYVTEGTSIA GGEAKWKKIP AKKLRSSQIP DYSDEGLDFV KYVWDPDAVM THNTNEFRLS QYCANPETSV EALLYQCGKA YSKYKPDDAT MLPNLAYSEE DAKKIADYTV SIGKFVNQAT VQFITGDLDI NTWQNYVDKI NSMDLKGYLA IQQNAYDQYA KSLNK
|
| |