Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2274 |
Symbol | |
ID | 7310954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2656059 |
End bp | 2657084 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643609201 |
Product | hypothetical protein |
Protein accession | YP_002506591 |
Protein GI | 220929682 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0250087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA CAAAATTAAT CAGGCTTATT TGCATTATCT TTACTTTATG TTTATTTTTT AGCTTTACAT CGTATGCATT AACAGGGTCA ACTTTTTCAG ACAGCGGTAA CCACCTTGTA TATGTTAATA ATCCAGAGTC CTTTGGTCTT AATACGGGTG ATTTGGTATA TTTATACGGT ACAAACCTTG GAAATTCTTA TAAGGATGTT GAGTTTTATC ACCATTTGTA TAATGGATGG TCAAATGCAG GAGCATGCAG AGTTGGAGTG GCAATGATGA ACAAAGGTAC AAAGCCCGCA GTAATAACTT ACAAGGGAAG TTGCACTGCC ACACTGGATG GATATAATTT CGCTGTCATT GAGGACACTT CACAGGTACT AAAAAGCTTT CAGGATGCGA AGCATAAAAC TATTACTCTT AATCCCGGAG AAAAGAAAAT CGTTTGGTCG GATGACTTCA GATTTACGCC CGGTTTTTCT GAGTTTGTAT ATGGCAGAGC ACGGTTTAAA TCAAATCAGA AATTCGGTGT TTGGATGAGG GTATTTGCAG CAGGACAATC AAAAACCGCA GAAGATGTTT TCAAGGAAAA GCATCCGATA CAGGGAACAG GACATGGTTT TTGCGGTGAA CTGGCTTACA CAGAGAAAAA GGTTACCCTT GACGCGGATA GAAGTAACAT TACAAACCTT TGTGAATGGC CCCGACTACT CAATACATAT GAATACAGCG GTGTAATAAA CTCAAAACCT GGTACATCTC AATATCATGC CGGTAACTAC GGCGTCGTTT ACAATATAAC AGTAAATCAT TCAGCACAAA AAAAGATAAT AATAAACCCT AAGTGGAGCA GTGACAGGCC ATATGCATCC ATTGTATACA GTGTCAACAA TGGCCCTTGG ATTGCAGGTG AAAAAATTAC TACGGGTATG TGCTGGATAG AAGACCTGGG GGACGGCGAA ACTTCAAATT TTAGATTTAT GCTGCCTGGT GGAAATTGCG GAAGCTACTA TGTATCCTTC GAATAA
|
Protein sequence | MIKTKLIRLI CIIFTLCLFF SFTSYALTGS TFSDSGNHLV YVNNPESFGL NTGDLVYLYG TNLGNSYKDV EFYHHLYNGW SNAGACRVGV AMMNKGTKPA VITYKGSCTA TLDGYNFAVI EDTSQVLKSF QDAKHKTITL NPGEKKIVWS DDFRFTPGFS EFVYGRARFK SNQKFGVWMR VFAAGQSKTA EDVFKEKHPI QGTGHGFCGE LAYTEKKVTL DADRSNITNL CEWPRLLNTY EYSGVINSKP GTSQYHAGNY GVVYNITVNH SAQKKIIINP KWSSDRPYAS IVYSVNNGPW IAGEKITTGM CWIEDLGDGE TSNFRFMLPG GNCGSYYVSF E
|
| |