Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0374 |
Symbol | |
ID | 7309258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 426951 |
End bp | 428303 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643607302 |
Product | beta-galactosidase |
Protein accession | YP_002504739 |
Protein GI | 220927830 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTCA AAGAAGGTTT CGTTTGGGGT ACGGCAACAG CATCATATCA AATTGAAGGA GCTGTTAACG AAGGCGGAAG AGGTGAGTCC GTTTGGGATG AATTTTGCAG GATGAAAGGC AAAATTGATG ACGATGATAA CGGAGATTCT GCATGTGACA GTTATCACAG ATATTCTGAG GACATACAGC TTATGAAGGA AATCGGTATT AAGGCATATA GGTTTTCCAT AAGCTGGACC AGGATTTTAC CCGATGGTAT AGGTGAAATC AACATGGAAG GCGTAAACTA CTACAATAAT CTTATTAATG GGCTGCTGGA AAATGGTATA GAGCCATATG TAACTCTATT TCACTGGGAC TACCCTATGG AGCTTCAATA TAAGGGAGGA TGGCTGAATC CTGAAAGTCC TCTTTGGTTT GAAAATTATG CAGCCATATG CTCAAGACTA TTTTCCGACA GAGTAAAGTA CTGGATAACC AGCAACGAAT CCCAGTGTTA CATTGGATTT GGTTACGGCA CAGGCTGGCA TGCACCGGGC TTTAAGCTTC CGGTAAACCA GGTGGTAAGA GCTTGGCATC ATAATTTAAA GGGATTGGGA CTGGCTGCGA AAGCAATACG GGAAAATGCA AAGGGAGAAG TCAAAGTAGG GCTGGTAGCC TGCGGAGAGG TTGGAATTCC TGCATCAGAC AGTGAGGCAG ATATGCAGGC TGCACGTAAT GTACTTTTTG ACAGAGAGCA TTCCGAGGAT TCAATCGATT TTGGATATGG GGACCTTTTC GAGCCTGCAT TAAAGGGAGA GTATCCGAAA AGCCTAATCC CATATCTTCC TAAAGGCTGG CAGGAGGATA TGAAAGACAT TTGTGTTCCT CTTGATTTTT TAGGCGTGAA CGCTTATATA GGTTCTATTG TAGAAGCATG TGAAAATAAA AAATACAGAC ACCTTAAATT GCCTGTTGGT ATAGGCAAAA CTTCCATGGA ATGGCCGTTT AAACCGGAAA CTCTGTACTG GGTAACTAGA TTTATATCCG AGAGATATAA ATTGCCAGTA TACATTACAG AAAATGGCAT GGCGAATAAT GACTGGATAA GCACTGACGG AAAAATCAAT GATACTCAGA GAGAAGACTA TTTGAACCAA TATCTTTCTG CACTGTCAAA GTCTATAGAT GACGGAGCCG ACGTAAGAGG ATATTTTTAC TGGTCACTCC TTGACAATTT TGAGTGGGCA TACGGATATG CAAAGAGGTT TGGACTTGTA TATGTAGATT ACAGCAATTT CAGCCGAACT CTAAAACAGT CTGCATTAAG GTATAAGAAA ATTATCGAAT TAAATGGCGA AGTATTAAAA TAA
|
Protein sequence | MAFKEGFVWG TATASYQIEG AVNEGGRGES VWDEFCRMKG KIDDDDNGDS ACDSYHRYSE DIQLMKEIGI KAYRFSISWT RILPDGIGEI NMEGVNYYNN LINGLLENGI EPYVTLFHWD YPMELQYKGG WLNPESPLWF ENYAAICSRL FSDRVKYWIT SNESQCYIGF GYGTGWHAPG FKLPVNQVVR AWHHNLKGLG LAAKAIRENA KGEVKVGLVA CGEVGIPASD SEADMQAARN VLFDREHSED SIDFGYGDLF EPALKGEYPK SLIPYLPKGW QEDMKDICVP LDFLGVNAYI GSIVEACENK KYRHLKLPVG IGKTSMEWPF KPETLYWVTR FISERYKLPV YITENGMANN DWISTDGKIN DTQREDYLNQ YLSALSKSID DGADVRGYFY WSLLDNFEWA YGYAKRFGLV YVDYSNFSRT LKQSALRYKK IIELNGEVLK
|
| |