Gene Ccel_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3438 
Symbol 
ID7312494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp4002835 
End bp4004394 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content41% 
IMG OID643610347 
Productglycoside hydrolase family 43 
Protein accessionYP_002507706 
Protein GI220930797 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.990127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA CAAATTCCAA TTATTTCAGT AATCCAATAT TGCCCGGGTT TTATCCTGAT 
CCATCCATAT GTCGTGTAGA GGATGATTAC TATCTTGTTA CATCCAGCTT TACATATTTT
CCGGGCTTGC CTATATTCCA TAGCAAAGAT TTGGTTAACT GGAGGCAGAT AGGACACGCC
CTTGACAGAC CTTCCCAGCT TGACCTAGAT GGTTTAGAAC AGTCTCAGGG ATTATATGCC
CCAACCATTA GATACAATAA TGGCATCTTC TATATTGCAT GTACCAACGT TGGGAAAAAA
GGTAATTTCA TAATAACATC GGAAAAACCT GAAGGACCGT GGTCAGACCC ATATTGGATT
GCTGATGCAC CTGGTATAGA CCCGTCACTT TTCTTCGATG ATGATGGAAA GGTGTATTTT
ACTGGTACAA ATGATTCTCC TGATGGAACC TATTACGGTG ATAACGAAAT CTGGATGAGG
GAACTTGACA CCGGGAAGAT GCAGCTTACC GGTCCAAGAT ATGGCTTGTG GAGAGGTGCA
TTGAAGAATG CAATTTGGTC GGAAGCACCC CATATATATA AAATTAATGG ATACTACTAT
CTAATGATTG CCGAGGGCGG TACTGACTAT CACCATTCTG TCACTATAGC CAGAAGCAGG
GAGATAACAG GACCATATGA AGGCTATATA GGGAATCCTA TCATAACTCA TAGGCATTTA
GGAAGAAAAT ACCCTATTGC AAATGTAGGC CACGCTGATT TGGTTGAGAC CCAAAACGGT
GAGTGGTGGA TGGTAGCACT GGCATCAAGG CCGTATGGCG GGCATTATAG AAACCTTGGC
CGTGAAACAT TTCTTATCCC TGTAGAATGG GAAGATGGTT GGCCGGTAGT AAGCCCATTA
AGCGGAAAAG TAGAGTTTTC ATACCAAAGA CCTGCACTAT CCCCGGATAA TCCTGTTGAA
GTAACGGCTT GTGACCACTT TGACAATGAA AAGCTTAGTT TTATATGGAA TTTTATACGT
ACTCCCAGAG AAAACTTTTA CAGCTTGACT GACAGGCCGG GGCACTTGAG GCTTAACCTG
AAATCTCCTA AAATTAAAGA GCAGAAAAAT CCGAGTTTTA TTGGGAGACG TCAGCAGCAC
ATTAATTTCC GGGCAAAAAC GGTAATGGAA TTTGTACCTG GCAACGAAAA TGAAGCTGCA
GGTATATTAT TAATACAGAG TAACAATTAT CATATGAGGT TTGAATGTAC CAAATCAGGA
GAGAAGGATG TAGTAAGGTT GATTGTATGT AATGACGGTA AGGAAAGTAT TGTTGCCCAA
AGGGAAAATA CTTATACCCG GATTCACATG GTAGTTCAAG CCTACGGTCA GGATTACAGC
TTTTATTGCG GAGATGAAAA TGAATTGGTT GAACTGGCTG TCAATGTAGA CGGAAGAATT
CTCAGTACCG ATGTAGCAGG AGGATTTGTC GGGACGTATG TAGGGATGTT TACCAGCAGC
AATGGTTTTG ACAGCAGTAA TATGGCAGAT TTTGATTTAT TTGAATATAC AGGCTTATAA
 
Protein sequence
MTKTNSNYFS NPILPGFYPD PSICRVEDDY YLVTSSFTYF PGLPIFHSKD LVNWRQIGHA 
LDRPSQLDLD GLEQSQGLYA PTIRYNNGIF YIACTNVGKK GNFIITSEKP EGPWSDPYWI
ADAPGIDPSL FFDDDGKVYF TGTNDSPDGT YYGDNEIWMR ELDTGKMQLT GPRYGLWRGA
LKNAIWSEAP HIYKINGYYY LMIAEGGTDY HHSVTIARSR EITGPYEGYI GNPIITHRHL
GRKYPIANVG HADLVETQNG EWWMVALASR PYGGHYRNLG RETFLIPVEW EDGWPVVSPL
SGKVEFSYQR PALSPDNPVE VTACDHFDNE KLSFIWNFIR TPRENFYSLT DRPGHLRLNL
KSPKIKEQKN PSFIGRRQQH INFRAKTVME FVPGNENEAA GILLIQSNNY HMRFECTKSG
EKDVVRLIVC NDGKESIVAQ RENTYTRIHM VVQAYGQDYS FYCGDENELV ELAVNVDGRI
LSTDVAGGFV GTYVGMFTSS NGFDSSNMAD FDLFEYTGL