Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1656 |
Symbol | |
ID | 7312263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 1997345 |
End bp | 2000200 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608584 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002505987 |
Protein GI | 220929078 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00148801 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA ACAAGGTATC AAAAAAATTT ATATCATTAA TTCTTTCTTT AGTAATTGTG TTATGCAGTA TTTTCACATT TACGGTATCA CCGATGGTTG TTACTGCTGC AACGCAGGCA GCCTATTACG TATCCCCTAC GGGAAGTGAC GGTAACCCGG GAACCATAGA TGCACCCTTT CAGACAATCA CAAAAGCCAG GGATGTAGTA CGTACTGTAA ACAAAAATAT GACGGGTGAC ATTTATGTTT ATTTAAGAGG AGGAGACTAT CGATTAACCA GTACCATTAA TTTTGGACCG GAGGATTCCG GGACAAACGG ATTCAGAATA TTTTATCAGG CATATCAAGA TGAAATACCT GTTTTAAACG GTGCAACAAA GGTTACAGGG TGGACTCAAC ATAACGGAAA TATATATAAG GCTACCTTGA ACCGCAATAC CAAATTGAGA TACCTTTTTG TGAATGACAA GAGAGCTCAG ATGACTAAAA AAACAGTAAA AGGTCAGGGC GGGTATGGAA CTTATGCTGT TACAAAAGGA CAGGCCAGCT GGGCTTGGAC AAGCGGCAGC AATAGTGACG GAGTTAAGTA TAATGCGTCG GATGTGCCGG AAATCACAAG CAATAAGGAT GACCTTGAGA TAGTAAACGG TACCACTTGG AACGAAAATA TTGTTTGTAC AAGAGATGTT ATTACATCAG GTTCTTCAAG AGTACTTCTT TTACAGCAGC CATATGGAGC AATAGCACAA CTGCCCGGTT GGGGTGCAGC TTTCACTACT ACCAGTACAC ATACAATTTA CAATGCATTT GAATTTTTAA ATTCTCCGGG GCAATTCTAT TTTAATAAGA CTACAAAAAC GCTATATTAT TATCCGCGTC CAGGTGAGGA CATGTCAACT GCCGATGTTG AGGCCCCTGT TTTAGAGAAA CTAATCGACA TCTCAGGGAA ATCAAACACA AACAGAGTCA AGAATATAAC CTTCCAGGGC ATTACCTTTG CAAATACGGA CTGGAACCTT ATAAAAGTCG GAGATTCTTA CGGCAGAACA ACATGCCAGA GTGCAGACGG CTTTATAGCT TATTATAACG GTAATTGGCA TGACACCAAA TACACTCTTC TTGATACATA TCCGGGCATG ATAAACGTAA GCAGCAGTGA TTCCATCAAC TTTACAGGTA ATATAATAAA GCACAGTGCT GCCGACGGAA TTACCATGGT AAACGATGTT ATAAATTCAA ATCTTGTAGG TAACTATATT TATGACATCA CATCAAGTGG GATTACAGTA GGTCATCCAC AGCATGTGTA CTTGGGTGAC GGTGGAGAAC ACCAGAAATA CGCACCGGGT GTAGAAGGTA TTTGCACCAA TGTTGTAATC AACAATAATA TGCTGTGGGA TTGCAGCACT GCTCCCGGAT TCGGGGGATG CGCCGGTATA ACGGCATTTT TTGTAAACAA ACTCAAGGTA ACGTACAACA CAGTTCATAC CACGGGCTAT AATGGTATCA CGCTGGGGTG GGGCTGGTGT AATTTCCTCG ACTCAAATAC CTGTAAGGAC AACGTAATAA ATAATAATCG TATATATAAT GCATTAAACA GGCTTCATGA CAGCGGAGCA ATATACACAA TAGGCCAAAT GCCTTATACG ACTATAAATG AGAATTATGT TAAGGGAATT CCTGACAATT CAACTGGACC TACTTATGGT TTACATAACG ACGAAGGAAG TGCGTTCATA ACTGAAAATG ACAATGTGCT TGATATTAGT CCGGGGGTAA CGTATACAAT CAACTGTGAG GATTTCGGAC AAAAACACGA CTTAACAATA CTAAGAACCT ATGCGACGGT TAATAAAATG GGTAAAAATC CGCCAAAAAG CACAATAGAC ACACCCGTTG CAGTTCCTGA TAATGTATGG CCTTTAGCAC AGTACAATAT AGCTTTAAAT GCGGGAGTTC AGGAAGCATA CAGGAGTATT CTGCCAAGTA ATCTCTTCCC TGTTCAGGAT TATGTATTTC CCGCAAGCTG TGCCACAAAA TGTGGAGCTG ATTTAAATAT AAGAAGCAGC GGTAATGCAG CAAATACAGT ATGGTTTGCT CCCGCCGGGA CAACCAAATT TGTTGTTGGG CCAACTATGA CAAAGGCGGT AGGAACTGCT ACATCAATTG TAACACCTGA AACAGCAGGG ACATACAAAT TATTTGTAGT AAATTCATCT GGTGCTAAGA TAGGTGAATC TGATGCATTG CTGAGAGTGA GCGGTTCCGC TTCACAAATT GAAGCTGAGA GTTTCTCCTC ACAATCGGGA GTTCAAACTG AAAACTGCAG TGAAGGCGGA CAGGATGTAG GATATATTGA AAATGGAGAT TATACGGTTT ACAACAATTT TGATTTCAAA AACGGAGTTA CGGGCTTTAA GGCCAGAGTA GCCAGTGGTG CCAGCGGGGG AAATATAGAG ATTAGGCTGG ACAGTATTAC AGGACCTTTA GTAGGAACGT GTCCGGTGAC GTCCACAGGA GGATGGCAGA CCTGGGCTGA TGCTACATGT AACGTCAGCG GAGTCAGCGG AATTCACAAT TTGTATCTTA AATTTACAGG TGGTAGCGGG TACTTATTTA ACATTAACTG GTTTGAATTT ACAGGCGGAG GTACGACTCC TGTACTTACA GGAGATGTCA ATGGAGATGA TAGCGTTGAT GCAACAGACT ATGCCATGAT GAAAAAGTAT CTTCTCGGTT TAATCAATGA TTTCCCTGTA GAAGACGACC TCAAAGCGGG TGACATAAAT AAAGATGGTG TTATTGATGC AATTGATTTG GCTCTGCTAA AGAAAAACCT GCTAAGCGGT ACTTAG
|
Protein sequence | MQKNKVSKKF ISLILSLVIV LCSIFTFTVS PMVVTAATQA AYYVSPTGSD GNPGTIDAPF QTITKARDVV RTVNKNMTGD IYVYLRGGDY RLTSTINFGP EDSGTNGFRI FYQAYQDEIP VLNGATKVTG WTQHNGNIYK ATLNRNTKLR YLFVNDKRAQ MTKKTVKGQG GYGTYAVTKG QASWAWTSGS NSDGVKYNAS DVPEITSNKD DLEIVNGTTW NENIVCTRDV ITSGSSRVLL LQQPYGAIAQ LPGWGAAFTT TSTHTIYNAF EFLNSPGQFY FNKTTKTLYY YPRPGEDMST ADVEAPVLEK LIDISGKSNT NRVKNITFQG ITFANTDWNL IKVGDSYGRT TCQSADGFIA YYNGNWHDTK YTLLDTYPGM INVSSSDSIN FTGNIIKHSA ADGITMVNDV INSNLVGNYI YDITSSGITV GHPQHVYLGD GGEHQKYAPG VEGICTNVVI NNNMLWDCST APGFGGCAGI TAFFVNKLKV TYNTVHTTGY NGITLGWGWC NFLDSNTCKD NVINNNRIYN ALNRLHDSGA IYTIGQMPYT TINENYVKGI PDNSTGPTYG LHNDEGSAFI TENDNVLDIS PGVTYTINCE DFGQKHDLTI LRTYATVNKM GKNPPKSTID TPVAVPDNVW PLAQYNIALN AGVQEAYRSI LPSNLFPVQD YVFPASCATK CGADLNIRSS GNAANTVWFA PAGTTKFVVG PTMTKAVGTA TSIVTPETAG TYKLFVVNSS GAKIGESDAL LRVSGSASQI EAESFSSQSG VQTENCSEGG QDVGYIENGD YTVYNNFDFK NGVTGFKARV ASGASGGNIE IRLDSITGPL VGTCPVTSTG GWQTWADATC NVSGVSGIHN LYLKFTGGSG YLFNINWFEF TGGGTTPVLT GDVNGDDSVD ATDYAMMKKY LLGLINDFPV EDDLKAGDIN KDGVIDAIDL ALLKKNLLSG T
|
| |