Gene Ccel_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0972 
Symbol 
ID7309803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1163023 
End bp1164276 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content35% 
IMG OID643607899 
Productglycosyltransferase, MGT family 
Protein accessionYP_002505314 
Protein GI220928405 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAGG TTATTTTTCT AGGATATTTG TTTCAGGGAC ACATTAATCC TACCCTTGGA 
TTAGTAAATG AATTAGTAAA TAGAGGAGAA GAAGTCATAT ACTATTCTGG AAAGGAATTC
TGTAAAAAAA TAGAAGCAAC AGGTGCCAAA TTCCGTGACT ATGGGTTTAT ACAGGACCCT
AGCGAATCCA ACAAGACTCA AAAGGTACAG GGCAATTTAG ATGCTTTTGC CCAAGTGGTT
ACCTGGGTGC TGAATGTAGG AAAACAGATT ATAAGCAATA TCTCCAGTGA GATAGAAGCT
GATAGGCCGG ATTATATTAT CCATGATTCG CAAGCTTTTT GGGGGAAAAG AATAGCAGGT
AGCTTGGGTA TACCTGCGGT GTCTTCAATA GCAAGCTTCG CCCTAACTGG TAAAATGCTG
GATATAGATC CTGACTTTTT TATTGAAAAC ATGCTGAGAA TGCCAAATGC CAGCTTGTTT
GCAAAAAAGA AATCAAATAT AGTGAGACTG CTTGATTTGC TTTCGAGAAG AATATCTGCG
GCATATGATG AGCCAGATTT TAATATATAT GATTTTGCAA ATAATACAGG AAAACTTAAT
ATTGTGTATA CCTCCGAGTA TTTTCAACCA TATGGAGAAG TCTTTGATGA CAGCTTTAAG
TTTGTAGGAC ATTCAATCTT TAAAAGGACT GAGAATGTAG ATTTTCCATT TGATAAGCTG
GGTAGCCTTC CATTAATATA TATTGCACTT GGAACTGTCC GTACAAATCG TTTAGATTTT
TATAGGGAAT GTTTTTCGGC ATTTGGTGAC ATGGAAATAC AGGTAGTTCT GTCTGTAGGA
ACAAATGTTG ATGTATATCA ACTGGGGAAA ATCCCTGATA ATTTTATTGT TAGAAATTAT
GTTCCTCAAC TTGAAATACT TAAATATGCC AGTGTATTTA TAACTCATGG AGGCACAAAC
AGTGTAAATG AAGGACTTTA CAACAATGTT CCTTTAATAG TATATCCTCA GGGAGATGAT
AACCATATTG TTGCAGGTAG AGTTGAAAAC CTTGGAGCAG GTATATATCT CAAAAATGAT
GATATTAATG CTGAGGAGCT TAAGAATGCT ATAAGTCGGG TACTTTCTGA TGAAAACTTT
AAAATTAATA GTAAAGCAAT TGGAGATACA CTAAAAACAG CAGGTGGGTA TTTAAGGGCA
GTTGATGAGA TATTCAAATT CAAAGGGGAA CTAGACTATG AAAATATGGT GTAA
 
Protein sequence
MSKVIFLGYL FQGHINPTLG LVNELVNRGE EVIYYSGKEF CKKIEATGAK FRDYGFIQDP 
SESNKTQKVQ GNLDAFAQVV TWVLNVGKQI ISNISSEIEA DRPDYIIHDS QAFWGKRIAG
SLGIPAVSSI ASFALTGKML DIDPDFFIEN MLRMPNASLF AKKKSNIVRL LDLLSRRISA
AYDEPDFNIY DFANNTGKLN IVYTSEYFQP YGEVFDDSFK FVGHSIFKRT ENVDFPFDKL
GSLPLIYIAL GTVRTNRLDF YRECFSAFGD MEIQVVLSVG TNVDVYQLGK IPDNFIVRNY
VPQLEILKYA SVFITHGGTN SVNEGLYNNV PLIVYPQGDD NHIVAGRVEN LGAGIYLKND
DINAEELKNA ISRVLSDENF KINSKAIGDT LKTAGGYLRA VDEIFKFKGE LDYENMV