Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0972 |
Symbol | |
ID | 7309803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1163023 |
End bp | 1164276 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643607899 |
Product | glycosyltransferase, MGT family |
Protein accession | YP_002505314 |
Protein GI | 220928405 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAAGG TTATTTTTCT AGGATATTTG TTTCAGGGAC ACATTAATCC TACCCTTGGA TTAGTAAATG AATTAGTAAA TAGAGGAGAA GAAGTCATAT ACTATTCTGG AAAGGAATTC TGTAAAAAAA TAGAAGCAAC AGGTGCCAAA TTCCGTGACT ATGGGTTTAT ACAGGACCCT AGCGAATCCA ACAAGACTCA AAAGGTACAG GGCAATTTAG ATGCTTTTGC CCAAGTGGTT ACCTGGGTGC TGAATGTAGG AAAACAGATT ATAAGCAATA TCTCCAGTGA GATAGAAGCT GATAGGCCGG ATTATATTAT CCATGATTCG CAAGCTTTTT GGGGGAAAAG AATAGCAGGT AGCTTGGGTA TACCTGCGGT GTCTTCAATA GCAAGCTTCG CCCTAACTGG TAAAATGCTG GATATAGATC CTGACTTTTT TATTGAAAAC ATGCTGAGAA TGCCAAATGC CAGCTTGTTT GCAAAAAAGA AATCAAATAT AGTGAGACTG CTTGATTTGC TTTCGAGAAG AATATCTGCG GCATATGATG AGCCAGATTT TAATATATAT GATTTTGCAA ATAATACAGG AAAACTTAAT ATTGTGTATA CCTCCGAGTA TTTTCAACCA TATGGAGAAG TCTTTGATGA CAGCTTTAAG TTTGTAGGAC ATTCAATCTT TAAAAGGACT GAGAATGTAG ATTTTCCATT TGATAAGCTG GGTAGCCTTC CATTAATATA TATTGCACTT GGAACTGTCC GTACAAATCG TTTAGATTTT TATAGGGAAT GTTTTTCGGC ATTTGGTGAC ATGGAAATAC AGGTAGTTCT GTCTGTAGGA ACAAATGTTG ATGTATATCA ACTGGGGAAA ATCCCTGATA ATTTTATTGT TAGAAATTAT GTTCCTCAAC TTGAAATACT TAAATATGCC AGTGTATTTA TAACTCATGG AGGCACAAAC AGTGTAAATG AAGGACTTTA CAACAATGTT CCTTTAATAG TATATCCTCA GGGAGATGAT AACCATATTG TTGCAGGTAG AGTTGAAAAC CTTGGAGCAG GTATATATCT CAAAAATGAT GATATTAATG CTGAGGAGCT TAAGAATGCT ATAAGTCGGG TACTTTCTGA TGAAAACTTT AAAATTAATA GTAAAGCAAT TGGAGATACA CTAAAAACAG CAGGTGGGTA TTTAAGGGCA GTTGATGAGA TATTCAAATT CAAAGGGGAA CTAGACTATG AAAATATGGT GTAA
|
Protein sequence | MSKVIFLGYL FQGHINPTLG LVNELVNRGE EVIYYSGKEF CKKIEATGAK FRDYGFIQDP SESNKTQKVQ GNLDAFAQVV TWVLNVGKQI ISNISSEIEA DRPDYIIHDS QAFWGKRIAG SLGIPAVSSI ASFALTGKML DIDPDFFIEN MLRMPNASLF AKKKSNIVRL LDLLSRRISA AYDEPDFNIY DFANNTGKLN IVYTSEYFQP YGEVFDDSFK FVGHSIFKRT ENVDFPFDKL GSLPLIYIAL GTVRTNRLDF YRECFSAFGD MEIQVVLSVG TNVDVYQLGK IPDNFIVRNY VPQLEILKYA SVFITHGGTN SVNEGLYNNV PLIVYPQGDD NHIVAGRVEN LGAGIYLKND DINAEELKNA ISRVLSDENF KINSKAIGDT LKTAGGYLRA VDEIFKFKGE LDYENMV
|
| |