Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1101 |
Symbol | |
ID | 7309914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 1358085 |
End bp | 1359950 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643608025 |
Product | Spore coat protein CotH |
Protein accession | YP_002505440 |
Protein GI | 220928531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GTCGCTTGCT ATTACTGTTA TCGGCTATTG CACTCGTCAC CTGCCTTTTT TCCATGTCCT TTGTCGGTGC TGCTACAGTG TACGGGGACC TGAATTCAGA CAGTTCCGTA GATTCCATCG ACTATTCCAT TATGAAGGGT TATCTTTTTG GCAGCGTGAC CATAAGTAAT CCTGCAGCAG CTGATGTAAA TGGAGACGGT AACATAGACG CTATTGATTT AGCATTAATG AAACAATATA TTCTGCATAT TATTACTAAA TTTCCGGCCG ATAACAATAC AACGCCCGGA GATAATACTC CTGCGGCCCA ATTGACCGGA GATATTGTTT TCTCCGCTCC AAGCGGTACT TTCAGCAATC AGGTTTCAGT GTCATTGAAT TCAAAGATTG CTGGTGCCCA AATCCGTTAC ACTGTTGACG GGAGTGTTCC TACCGCCAGT TCCCCTGCAT ATACCAGTCC TTTGTCGTTT ACCAGGACAA CACAGTTGAG AGCTCAATCT TTTGTAAACG GGACTCCAAG CGGTAAAATG GGCACAGCAA TCTACGTTTC CAGTGCCATA GATACAAAGC ATGATCTCCC TGTATTAATT CTGGATGCTT ACGGCGGAGG AAAACCTGCA CGTGAATATA AGGACGTTGC TATTATGCTG ATGGAACCAA AGAATAATGA AGTTTCCCTA CTACAAACAC CAACAGTCGC CACCCGCGCC GGCTTCCATG TACGTGGCCA GTCATCAGCA AATTTTGAAA AAACACCTTA TCGTCTTGAA TTGTGGGATA ATCAGAACGA AGATGCCAAG TATTCCTTGT TGGGTATGCC CGGCGACGGC GACTGGGCGT TACTTTCACC TTTTCCTGAT AAATCTCTTA TAAGAAATGC TCTTGCTTAT GAATTAGGAA CAACTATGGG ATTGAAAGCA CCAAGGTACA GATTTGTGGA AGTGTATCTC AATCTTGATA ATCAGCCGTT ATCCTCAGCA GACTACCAAG GAGTATATCT TCTCACAGAA ACACTTGAAA TCGACAAGGA CCGTGTTAAT ATTAAAAAGC TCAAGGACGA TGATCTAACA GAACCGAACA TAACCGGAGG TTACCTGATG CAGTTCAATA TGATGGCAAC AGACGGACCG TTGGTAAAAG GGTCGGGCTG GAACGATCTT GAGATAAAAG ATCCTGATGA CCTGCTGCCC CAACAGTTGA CATGGATAAG CAACTATATT CAAAAGGTGC ATAACTCTAT TCGCAGTACT AATCCTTCTG ACCCAACAAC CGGGTATCCT GCTTACATCG ATGTTGATTC CTTCATAAAC TACATTATCG AAAATGAGCT TGCCCGTGAA GGTGACTCAT ACATGCGCAG CACCTACATA TATAAGGACC GTGGTGCAAA GCTGGCGGCA GGTCCGGTTT GGGACTATGA TCTGGGTTAC AACTGCGTAA CAGGTATGAT GGGTATGCAG ACAAATTACG TAGAAGGTTG GCAATTTCAG CCAATGTTTG GAATGAGTTC CACATGTGAT TGGTACTACA AGCTTATGCA GGACTCTGCT TTCCAAAGCA AGATAAGTGC TCGCTGGCAG GAATTACGTA ATGGTCCCCT TTCCGACACA CAGATAAAAG CACTGGTTCA AAAGCTGACA ACACCTTTAG CCAACGGAGC CAAACGTAAT TTTCAGAAAT GGAACAATCT GGGCACAGCC ACTGTAGGTG GTTTCAGTAC CCAAACCACC CAGACATGGG AAGAACAAGT AACAATTTTA CAGAACTTTC TGCTCCAAAG AGCTGCTTGG TTGGATAAAT CCGGATGGAA GCCAACCACA AATACAAATC CCGGATGGCC CGGTTGGGGT GGTTGA
|
Protein sequence | MKKSRLLLLL SAIALVTCLF SMSFVGAATV YGDLNSDSSV DSIDYSIMKG YLFGSVTISN PAAADVNGDG NIDAIDLALM KQYILHIITK FPADNNTTPG DNTPAAQLTG DIVFSAPSGT FSNQVSVSLN SKIAGAQIRY TVDGSVPTAS SPAYTSPLSF TRTTQLRAQS FVNGTPSGKM GTAIYVSSAI DTKHDLPVLI LDAYGGGKPA REYKDVAIML MEPKNNEVSL LQTPTVATRA GFHVRGQSSA NFEKTPYRLE LWDNQNEDAK YSLLGMPGDG DWALLSPFPD KSLIRNALAY ELGTTMGLKA PRYRFVEVYL NLDNQPLSSA DYQGVYLLTE TLEIDKDRVN IKKLKDDDLT EPNITGGYLM QFNMMATDGP LVKGSGWNDL EIKDPDDLLP QQLTWISNYI QKVHNSIRST NPSDPTTGYP AYIDVDSFIN YIIENELARE GDSYMRSTYI YKDRGAKLAA GPVWDYDLGY NCVTGMMGMQ TNYVEGWQFQ PMFGMSSTCD WYYKLMQDSA FQSKISARWQ ELRNGPLSDT QIKALVQKLT TPLANGAKRN FQKWNNLGTA TVGGFSTQTT QTWEEQVTIL QNFLLQRAAW LDKSGWKPTT NTNPGWPGWG G
|
| |