Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0054 |
Symbol | |
ID | 7308973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 62580 |
End bp | 63638 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643606983 |
Product | periplasmic sugar-binding protein |
Protein accession | YP_002504422 |
Protein GI | 220927513 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000350797 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGGATTA AGCAACTAAA AATATTTCTA CTACTTGCAA GTCTGATTAC TATGGTTCTT TTAGCCGGTT GCTCTTACAA AAACCCAATC GATAAACCTG ATAAAATTAC TATTGGTCTC TCCATGGCAA CCCTCCAAGA GGAACGTTGG CACAGAGACA TAGAAGCTCT CAGGGCCAAG GCCCAAGCCA AAGGAGCAGA AATTCTTTTC CGAAATGCAA ACAATAATAT AAATGACCAA ATTTCCCAGG TTAAAAGTCT GTTGTCAAAG GATATTGACA TTCTTGTAAT CGTTCCACAG GACGCCGAAA AGTCTCAGCA GGCTGTTCAG CTTGCCAGAA ATAAGGGAAT AAGGGTAATC TGCTATGACA GACTGATTAA AAATTCAAAC ACTGATTTCT ATGTATCTTT TGATAACATA AGAGTTGGAG AATATATGGC ATCCCTGATG GTATCCAAAG TACCTAAAGG GAACTACATA TTAATTAATG GAGCTAAAAC AGATTACAAT AGCTTTATGT ATAACAAGGG TTTTAAAAAT ATTTTAGGCA AATACTTATA CGAAGGGTCC ATAAAAATCG TTGACGAAGT GTGGGCAAAT GACTGGAAAC CGGAGGATGC CTTTAAGTGT GTGGACAAGG CTCTTCGGGA CGGAAAAAAG ATTGATGCTA TTATTGCCGC CAATGACAGT CTTGCCGGCG CAGCAATCAA GGCTCTTTCT CAAAGGCGAT TGGCAGGAAA AGTTCCTGTT GCCGGGCATG ACGCCGATAT TTCAGGTTGT CAGAGAGTAG CTGAAGGTAC TCAGTTGCTG ACCGTTTACA AACCTATAGA TCAATTAGCC GAAAAGGCAA TTGACGTTGT ACTGGGACTT TTGAATAATG ATTATTATGC CTGCAATAAA TTTATTAATG ACGGTGAAAG TGATATTCCC TATGAGATGG TGGAACCTGT CGTAGTCACA AAGGATACCC TTGTTGACAC TGTTATAAGT GCAGGCTTTC ATAAACTTGA AGATGTATAC CGTAATGTGC CTGAAAGTAA ATGGCCTCGA AAAAAATAG
|
Protein sequence | MWIKQLKIFL LLASLITMVL LAGCSYKNPI DKPDKITIGL SMATLQEERW HRDIEALRAK AQAKGAEILF RNANNNINDQ ISQVKSLLSK DIDILVIVPQ DAEKSQQAVQ LARNKGIRVI CYDRLIKNSN TDFYVSFDNI RVGEYMASLM VSKVPKGNYI LINGAKTDYN SFMYNKGFKN ILGKYLYEGS IKIVDEVWAN DWKPEDAFKC VDKALRDGKK IDAIIAANDS LAGAAIKALS QRRLAGKVPV AGHDADISGC QRVAEGTQLL TVYKPIDQLA EKAIDVVLGL LNNDYYACNK FINDGESDIP YEMVEPVVVT KDTLVDTVIS AGFHKLEDVY RNVPESKWPR KK
|
| |