Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0950 |
Symbol | |
ID | 7312164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1131299 |
End bp | 1132606 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643607878 |
Product | HI0933 family protein |
Protein accession | YP_002505293 |
Protein GI | 220928384 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1635] Flavoprotein involved in thiazole biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00109997 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTG AATCAGTATC ATATCAGAAA TCTTTGGAGG TAGCAGCCCA TTATGATGTT GTTGTCATAG GCTCCGGCCC TGCCGGCATT TGTGCAGCTG TTTCAGCGGC TCGAATGGGG GCTAAAACTG CATTAATTGA GCGTTATGGT ACCCTTGGTG GTAATTTGAC AAATGGTGCT GTTGCACCAA TTCTCGGAAG TGTATCGAGA GGAACATTAC GAGATGAGCT TATTGTACGT CTTGGTGTTC CGGACAGCGA CGAAACAGGT GTAACTCAAC AAACTCACGA CTTTGAAAAA GCAAAGCGCG TGCTTGTAGA TTTTGCACAT GAAGCAGGTG TTAAGATTTA TTTACAAACC CCCGTTGTTG ATACGATAAC AGATGGAAAA AGGCTTACAG GTATTGTCAT TTCTCAAAAA ACGGGTATGA AGGTAATCAG GGCAAAATCC TTTATTGATG CTTCCGGTGA CGGTGATGTT GCATACTTTG CAGGTGCAGA ATATGAAATG GGACGCGAAG GAGATTCCTT GCTACAACCT GTTACTTTGA TGTTCCGCTT GCAGGGCGTT GAGGATGATG CACTAACTTG TATAGGTGAG CTTGACCACG TTACTTACAA AGGTGAAAGA TTCCTTGATT ATACAACCAG ACTTTGCAAC GAAGGCCATT TACCGCCTAA TGCGGCATCG GTACGTTCAT TCCGCACTTC TGTTCCCGGT GAACGTGTTA TCAACACTAC GCAGACAAAT GGTATTTCTG CATTGTTAAG TGATGATTTG GAAAAGGCTG AGGTTGACCT ACGTGCCCAG ATTGATGCTG TAACTGATTT TCTGCGAAAG TATGTTGATG GTTATCAGAA TTGTTTTGTA AAGAGTACGG CAAATACTCT GGGAGTTCGT GAGACTCGCC GTTTTATTGG ACAATATATG CTTCAGGACT TAGATCTTCG TACAGGAAGA AGATTTGAAG ATGTTGTTGT TCACAAGGCA AGCTTTATAG TGGACATTCA TAATCCAGCA GGTAGTGCAC AGGCTGAAGG AGTACCTGAG GAGGTAATTC CTTATGATAT TCCTTTGCGC AGTTTGATTC CAAAGAGCTT GGATGGTTTA GTTCTTGCTG GAAGATGTAT TTCGGGTACG CACCGAGCAC ATGCTTCTTA TCGAGTTATG TCAATCTGTA TGGCGATAGG TGAAGCGGCG GGTATAACTG CTGTACTAGC AGCACAAAAG GATATATCAC CTCGTGAAGT TGATTTCAAA GATGTACAGA AAGTTTTGAG TGAACGGGGA GTTGAGCTTT TCGATTGA
|
Protein sequence | MSFESVSYQK SLEVAAHYDV VVIGSGPAGI CAAVSAARMG AKTALIERYG TLGGNLTNGA VAPILGSVSR GTLRDELIVR LGVPDSDETG VTQQTHDFEK AKRVLVDFAH EAGVKIYLQT PVVDTITDGK RLTGIVISQK TGMKVIRAKS FIDASGDGDV AYFAGAEYEM GREGDSLLQP VTLMFRLQGV EDDALTCIGE LDHVTYKGER FLDYTTRLCN EGHLPPNAAS VRSFRTSVPG ERVINTTQTN GISALLSDDL EKAEVDLRAQ IDAVTDFLRK YVDGYQNCFV KSTANTLGVR ETRRFIGQYM LQDLDLRTGR RFEDVVVHKA SFIVDIHNPA GSAQAEGVPE EVIPYDIPLR SLIPKSLDGL VLAGRCISGT HRAHASYRVM SICMAIGEAA GITAVLAAQK DISPREVDFK DVQKVLSERG VELFD
|
| |