Gene Ccel_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3002 
Symbol 
ID7311611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3553773 
End bp3554981 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content39% 
IMG OID643609906 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_002507276 
Protein GI220930367 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATA AATTAATTAA TAAACTGGTT AACGAACTGG AGACCGAACT CAGGAATGAC 
ATTTTGCCGT TTTGGATTAA TAACGCCGTT GATACCGACA ATCGCGGGTT TTACGGCTTA
ATATCATCTG ACCTTACCAT TGATAAAACT CACGCAAAAG CAGCCGTATT GAATGCGAGA
ATTCTATGGA CATATTCAAA GGCCTACAGT CGTTATAAGG AAGGAAAGTA TTTTTTCATG
GCAGAACGTG CATATAATTA TATAGTTGAC TTCTTTATCG ACAAAGTAAA TTCAGGTGTA
TACTGGCTTC TGGATTATAA GGGCAATGTC CTAAACTCAA AAAAGCAGAC TTATGCAATT
GCATTTGCAA TCTATGGTTT ATCAGAATTT TTCCTTGCAA CAGGCAGAAA GGAAAGTCTA
ACTAAAGCTA TAGAGCTTTA TAACGCATTG GAAACTCATA CATGGGATTG TGTAAACAAG
GGATACTATG AAGCACATAC AACGGATTGG CAGCCTCTTG CCGATATGTC TCTAAGCCCT
GCGGATATGA ATGTTTCAAA GTCAATGAAC ACCCACCTCC ATATTATAGA AGCCTACACA
AACCTGTACA GGGTTTGGAA GGATGCCAGA CTAAAATCAA CTCTAGAAGA GATTATCAAT
ATTACAATTA ATCATATAAT AGACCCTAAG AAACATTCCT TTAATCTGTT TTTCGATGAA
AAATGGAATC CTGTTTCTGA AAAAATATCC TTCGGACATG ATATTGAGGG CAGCTGGCTT
TTATGTGAGG CAGCCGAAGT CCTTGGCAAC AAAGAATTAA TTAAAAGAGT AAGTGAAATC
TCCGTAGCGA TGGCTCAAAG AGTGTATAAC ACAGGCATTG ATACCAAGTA TGGAGGTCTT
TTTTACGAAC AGGACAAAAA TGTTATTGAA ACGATTAAGG ACTGGTGGCC CCAGGCAGAA
GCAGTTGTGG GTTTCTCCAA TGCATATCAG CTGACAGGCA ACGACTGCTT TATGATTGAA
GCTGTTAATA CGTGGAGTTT CATTAAAGCA CACATTATTG ACAAGGTTCA TGGTGAATGG
GTCTGGGGAA CTTCCGCAGA CGGATTAAAT GTCACAAACA ATGAAAAAGC AGGCCCGTGG
AAATGTCCTT ATCACAACAG CAGAATGTGT TTTGAGATTT TACAGAGATT CAAAAAGCGA
AAGTCTTAG
 
Protein sequence
MDDKLINKLV NELETELRND ILPFWINNAV DTDNRGFYGL ISSDLTIDKT HAKAAVLNAR 
ILWTYSKAYS RYKEGKYFFM AERAYNYIVD FFIDKVNSGV YWLLDYKGNV LNSKKQTYAI
AFAIYGLSEF FLATGRKESL TKAIELYNAL ETHTWDCVNK GYYEAHTTDW QPLADMSLSP
ADMNVSKSMN THLHIIEAYT NLYRVWKDAR LKSTLEEIIN ITINHIIDPK KHSFNLFFDE
KWNPVSEKIS FGHDIEGSWL LCEAAEVLGN KELIKRVSEI SVAMAQRVYN TGIDTKYGGL
FYEQDKNVIE TIKDWWPQAE AVVGFSNAYQ LTGNDCFMIE AVNTWSFIKA HIIDKVHGEW
VWGTSADGLN VTNNEKAGPW KCPYHNSRMC FEILQRFKKR KS