Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1002 |
Symbol | |
ID | 7309829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1244410 |
End bp | 1245879 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643607929 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002505344 |
Protein GI | 220928435 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTA AAGTTGCTTT TATAGGAGCG GGAAGTCTTG TGTTTACGAG AACACTGTTT ACAGACATAA TGTCGGTTCC TGAATTCAGG GATATTAAAA TTGCATTTAC CGACATTAAC GGAGATAATC TTCAAAAGGT GGCGGAATTG TGCCAGAGAG ATCTTGAGGC CAATGGTATC ACTACTAAAA TCCAGGCTAC CACTGACAGG CGGGAGGCTT TCAAGGATGC AAAATATATT GTGAATTGTG TTCGTATAGG AGGCCTGGAA GCTTTTGAAA CAGATATAGA CATACCGTTA AAATACGGTG TTGACCAATG TGTGGGAGAT ACTCTCTGTA CAGGTGGGAT TATGTATGGA CAGCGTGTTA TAGCCGCAAT GTTGGATTTT TGTAAAGACA TAAGAGAAGT TTCGGCACCC GGAGCAATTC TGCTGAACTA CTCAAATCCT AATGCTATGG CAACCTGGGC CTGCAACAAG TACGGTGGAG TTCGCACCAT AGGGCTTTGC CACGGTGAAA TTCATGGCGA GGATCAGATT GCCCAGGTGC TGGGAATACC AAGAAACGAA CTTGACATCA TCTGTGCTGG TATAAACCAC CAAACGTGGT ATATTTCAGT AAAACACAAG GGAAAAGAGC TGTTGGACAA AATACTTCCC GGATTTGAGG CACACCCCAA GTTCAGCGAG GAAGAAAAGG TCAGAATTGA TGTACTAAAG CGCTTTGGTT ACTATTCCAC AGAATCAAAC GGTCATCTTT CGGAATACGT GGCATGGTAC CGTAAGAGAC CCCAGGAAAT AATGAAATGG ATAAACCTTG ATAGCTGGAT TAACGGTGAA ACAGGCGGCT ATTTGAGAAT TACAAGGGAA GAACGAAACT GGTTTGAGAC TGATTACCCA AAGATACTTG CTGAACCTCC AAAAAAATAT GACGGTTCGT CCAGAGGGAG GGAACATTGT TCCTACATAA TCGAGTCTTT GGAAACAGGA AGAAAATACA GAGGACATTT TAATGTAATG AATGAAGGCT GTATTACGAA CCTTCCGTAT GAGTCGGTGG TTGAAGTTCC CTGTTACGTG GACGGCAACG GTATATCTGT CCCGAAGGTA GGAGATTTGC CACTGGGCTG TGCCGCAGTT TGTTCACAAT CCATATGGGT ACAGCGGCTT GCTGTTGAGG CGGCGGTATC AGGGAATGTA ACTCTGCTGA AACAGGCAGC TTTGATGGAC CCACTCACCG GAGCCGTTTG CAATCCGCCG GAAATATGGC AAATGATTGA CGAAATGCTA ATAGCTCAGG AAAAGTGGCT TCCTCAGTAT GTTGAAGGTA TCAAAGCTGC AAAAGAGAGA TTTGCCAAGG GAAACCTTAT TCCTATAAAT GAAGGATATC GTGGTGCAGT AAGACAAAGG GTCAAAACTC CTTCCGAGGT AGCTGCCGAA CGTGAATCCA GAAGTATTAC AGCTGATTAA
|
Protein sequence | MSFKVAFIGA GSLVFTRTLF TDIMSVPEFR DIKIAFTDIN GDNLQKVAEL CQRDLEANGI TTKIQATTDR REAFKDAKYI VNCVRIGGLE AFETDIDIPL KYGVDQCVGD TLCTGGIMYG QRVIAAMLDF CKDIREVSAP GAILLNYSNP NAMATWACNK YGGVRTIGLC HGEIHGEDQI AQVLGIPRNE LDIICAGINH QTWYISVKHK GKELLDKILP GFEAHPKFSE EEKVRIDVLK RFGYYSTESN GHLSEYVAWY RKRPQEIMKW INLDSWINGE TGGYLRITRE ERNWFETDYP KILAEPPKKY DGSSRGREHC SYIIESLETG RKYRGHFNVM NEGCITNLPY ESVVEVPCYV DGNGISVPKV GDLPLGCAAV CSQSIWVQRL AVEAAVSGNV TLLKQAALMD PLTGAVCNPP EIWQMIDEML IAQEKWLPQY VEGIKAAKER FAKGNLIPIN EGYRGAVRQR VKTPSEVAAE RESRSITAD
|
| |