Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1252 |
Symbol | |
ID | 7312204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1552551 |
End bp | 1554266 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643608173 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002505588 |
Protein GI | 220928679 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.670255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTCA ATTACTTAAA AAAAGCTGTA TCATTAGCAG TAGTTTTATC TTTAACAGCT TCAATCGTTG TAGCTTGCGG TAGTCAAAAC AATACATCCG ACAGCACAGC ATCATCATCT CAAAGTACTA CTTCTTCAGT AGCAAATAAG GATCCTTTCG GAAAATATAG TCCTGAGATT GATATTTCTT TTGTTCGTGG AATAGACGAT GACCTTGCAG CAAACATATT GCCAAAGACA CCGGGGGAAA CTCTTGAAGA CAATCGTTGG ACGAAGTTAT ATAAAGATGA ATTAGGTATA AATGTAAAAT ATGCTTGGAC AGTAAAAGGA AATGAAACAT CAGATGCTTA TACACAGAAA ATCAATGTAA CTCTGGCATC TGGAGAACTC CCTGATGTAG TTCTTGTAAA TCCTTCTCAG CTTAAACAAC TGGTTGATTC AGATATGATA GAGGATATGA CCCAATATTA TAATGATTAT GCTTCAGAGG ATTTCAAGAA ACTTATGACA GAAGAAGGAA CCGGTAATAT TGATTCAGCT ATGTTTGATG GTAAAATGAT GGCTATTCCG GAACCTGTTT CAACTAATGA AACCGCACAC TATCTATGGA TTAGAAATGA CTGGTTGAAA AAATTAAACC TACAGGCACC TAAGACTATG GATGATGTTT TAAAAATATC AGAAGCATTT ACAACTCAAG ACCCTGATGG AAACGGAAAG AACGATACTT TCGGACTTCC GATCACAAAA GATATCTACA GCGGATGTAT GGGAACAGAA GGCTTCTTTG CAGGATATCA TGCATACCCG AATATGTGGA TAGAAGATGA TTCAGGTAAA ATAGTATGGG GTAGTACTTT ACCTGAAACA AAGGTAGCAC TCCAGAAATT GGCTGACATG TACAAGAGCG GCCAGCTTGA CAAAGAATTT GGTGTCAAGG ATGGAGGTAA GGTTGCCGAG ACTATAGCAG CAGGTAAAGT AGGTATAAAC TATGGTCAAC AGTGGAACCC AATGTATCCT CTAATAAGCA ATTTTAACAA TGATAAAAAT GCCGATTGGA CAGCATATCC TATTGTTTCA AATGATGATA AAAAGGTTAT GGTACCTTTA AAGTTCAATC AGACAAGAAT ATTTGCAGTA AAAAAAGGAT ATGAACATCC AGAAGCACTT GTAAAACTGT TCAATGCACA TGTTGAGAAG AACTGGGGTA AAACAGCAGA CTTCAACAAG TACTATATGC CGGTTGAGAA CGGCGGTGTA GGAGTTTGGA AATTCTCTCC TGTGTGCCCT GCTCCTGTAT TCAAGAACCT TGAAGCATTC GTGGCAATTG ATGAAGCCAG AAATAACAAT GACTTCAGCA AGCTGACAGG TGAGCCTAAA ATTATCCAAG GTAATATAGA GGCTTATGCT AAAGGCGACA CTTCACAATG GGGATGGGAA AAGATTTACG GTAAAGAAGG TGCATTCCGT AATATGGTTG AGTACAAGAA GAATGATCAG CTGTTAAAAG AAAAGTTTGT TGGAGCTCCA ACTACAACAA TGGCTGAAAA GAAAACAACA CTTGAAAAAA TGGAAAAAGA AGTATTTATA AAAATTATAA TGGGTGCAGC TCCTATTAGT GAATTCGACA AGTTTGTTAG CGATTGGAAT AAACTTGGTG GTGCTGATAT GACAAAGGAA GTTAACGAAT GGTATGACTC AGTTAAATCC AAGTAA
|
Protein sequence | MRFNYLKKAV SLAVVLSLTA SIVVACGSQN NTSDSTASSS QSTTSSVANK DPFGKYSPEI DISFVRGIDD DLAANILPKT PGETLEDNRW TKLYKDELGI NVKYAWTVKG NETSDAYTQK INVTLASGEL PDVVLVNPSQ LKQLVDSDMI EDMTQYYNDY ASEDFKKLMT EEGTGNIDSA MFDGKMMAIP EPVSTNETAH YLWIRNDWLK KLNLQAPKTM DDVLKISEAF TTQDPDGNGK NDTFGLPITK DIYSGCMGTE GFFAGYHAYP NMWIEDDSGK IVWGSTLPET KVALQKLADM YKSGQLDKEF GVKDGGKVAE TIAAGKVGIN YGQQWNPMYP LISNFNNDKN ADWTAYPIVS NDDKKVMVPL KFNQTRIFAV KKGYEHPEAL VKLFNAHVEK NWGKTADFNK YYMPVENGGV GVWKFSPVCP APVFKNLEAF VAIDEARNNN DFSKLTGEPK IIQGNIEAYA KGDTSQWGWE KIYGKEGAFR NMVEYKKNDQ LLKEKFVGAP TTTMAEKKTT LEKMEKEVFI KIIMGAAPIS EFDKFVSDWN KLGGADMTKE VNEWYDSVKS K
|
| |