Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2458 |
Symbol | |
ID | 7311127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 2971590 |
End bp | 2973356 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643609388 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002506767 |
Protein GI | 220929858 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT GGCTTATCAA ATCCCTAAGT GCTGCCATTA TCCTCAGTTT TACCGTAAGT ATGGCAGCTT GTACCAAACC GGGTACCGAC AGCAGCTCGT CATCCGGTTC CGGCAGTTCT ACATCCGGCA GTAACCCTTA TAAGGACACT GTGACTTTAG ATGTTTACAC AATGACAGGC AATTTTCAGG GTGAACAGAT AGGATGGTTC GGAAAGGTAA TCAAGGACAA ATTTAATCTG AAACTAAACA TTATTTCTGC TCAGACAGAC GGGAATGCCG ATCAGTCCTT TCAGACACGT TCTGCTTCCG GAGACCTCGG TGATATCGTG GTATATGGTG CCATAGATAC AAAATTCACC GATTCTTTAA AGGCGGGTCT TTTAATGAAG CTGTCTGATA ATGATTTGCT GGCAAAGCAC GGAGATAACA TTGTAAAGAA TTTTTCAGGT GCTATTAAGC GTATTTCCGC TAAATACGGC GATTATGCAA TTCCCAATAA TGTATCTAAC GAGTCTCCCA TTACTCCTTC CGAGGATTCT GACCTTACAT TTGGTGTATA CACCCGTTAC GATTACTATC AGGAAATCGG TTCTCCTAAG CTTAATAGCT TCGATGATGT ATATCCGATG TTAAAAGCAA TGGCAAAAAA GCATCCTACC AACTCAAATG GTCAGAAGCA GTATGCTTTT TCACTGTTTA AAGATTGGGA CGGCTGCATG ATGATGTTCG GTAAAATGAT TGCAGAGCTT TACGGCTATG AGGAAGCTCC CTTCGGCGGT TTCCTGCTGA CAAACAATAT GGCTACCGAG TATCAGAGTA TTATTGATCC GGACGGTTAC TATGTCAAGG CTCTTAAAGT TTATAATCAA GCATACCGTG ACGGTTTGCT TGACCCAGAT TCAATCAATC AGACCTGGGA TGATATTACC AAAAAGTATG CAAACGGTCA GGTTCTGTAT TCCCAATTCA GCTGGCTTGG CCCTAACAAT TTTAACAATG CTGACAACAT ATCAAAGGGT GTAGGCTTTG CCCTTGTTCC TATAGCTGAT GAGAAAATCT GGTCAAATGG ATTTACTCCA AACGGCAGCA CCTACCTTGT AGCTCTTGGT AAATCCTGTA AGAACCCCGA ACGTGCCATG GATTTCTTAA ATTGGTACTA TTCATCTGAA GGTGAAATGA TAATAAAGAA CGGCCCTGAA GGTCTTGCAT GGAGAATAGA AAATGGCAAA CCAACATTAA CTGATTTTGG TAGAAAATGT ATGCCATCCA TTGCAGTTGA AGTATCTAGT GAGTTTGGTG GCAGTGATTG GCAGACAGGG GAGTGTAAAG TCGGCTTTGA CGGCTTAGCA GCAAACTCTA TAAATCCAAA TACCAAAGAG CCATACTATT ACAAGCTTTG GTCGTCGACC CTGTCATCCA ACACAAGTAC CCTTCAGAAA AACTGGAGTG CAGCGATGGA CGATGCTCTG TCCACCAAAG ACTGGCTTAT GAAGAATAAC CATGTTTCAG TTTCTCCCGG CACTGATTAT GTTGGACCTA CCCTTCCTTC CGATATTCAA TCAACTCAGG ATGTTGTTAA AATGGATATT CGTGATTATT CCTGGAAAAT GGTTTATGCC AAGAACGATA GTGAATTTAA TTCTCTTCTT AAGCAAATGA CTGAGAAGGT AAAAGGCGAA GGTTATGACA CCGTTGTTGA ATGGAATAAA CAGCAGTTGG CTGAACTTTC TGCGGCCCGC AAGCAGGCTG CAGAAAACAG CAAATAA
|
Protein sequence | MKKWLIKSLS AAIILSFTVS MAACTKPGTD SSSSSGSGSS TSGSNPYKDT VTLDVYTMTG NFQGEQIGWF GKVIKDKFNL KLNIISAQTD GNADQSFQTR SASGDLGDIV VYGAIDTKFT DSLKAGLLMK LSDNDLLAKH GDNIVKNFSG AIKRISAKYG DYAIPNNVSN ESPITPSEDS DLTFGVYTRY DYYQEIGSPK LNSFDDVYPM LKAMAKKHPT NSNGQKQYAF SLFKDWDGCM MMFGKMIAEL YGYEEAPFGG FLLTNNMATE YQSIIDPDGY YVKALKVYNQ AYRDGLLDPD SINQTWDDIT KKYANGQVLY SQFSWLGPNN FNNADNISKG VGFALVPIAD EKIWSNGFTP NGSTYLVALG KSCKNPERAM DFLNWYYSSE GEMIIKNGPE GLAWRIENGK PTLTDFGRKC MPSIAVEVSS EFGGSDWQTG ECKVGFDGLA ANSINPNTKE PYYYKLWSST LSSNTSTLQK NWSAAMDDAL STKDWLMKNN HVSVSPGTDY VGPTLPSDIQ STQDVVKMDI RDYSWKMVYA KNDSEFNSLL KQMTEKVKGE GYDTVVEWNK QQLAELSAAR KQAAENSK
|
| |