Gene Ccel_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2226 
Symbol 
ID7310912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2602977 
End bp2604752 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content40% 
IMG OID643609158 
Productglycoside hydrolase family 9 
Protein accessionYP_002506548 
Protein GI220929639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGGA AAAATACAAT TATAAGCCTT ACTCTGGCAG GCAGTTTTAT TTTTACCATG 
AGTTCATGTA CTTCCAAGCC TGTAAGCAGT AATGTTCCAG AAGTAACAAA TCAGGCGGTT
ACAAGTATGT CTCAAGATTT AAAAGCTGAA AACGAAATAA GTGATGATGT ACATGTAAAC
CAGTTAGGTT ATAAGACACT GTCTAAAAAG ATAGCTGTTA TTAAGGGGCA ATATAAGAAG
TTTCAGGTGG TTGACAGTAA AACAGGTGTG GCAGTTTTAA CAGGAGATCT TACTGGGAAT
CCAAAGGACG AATCCAGCGG AGATACCGTA TGCTATGCGG ACTTTAGCAA AATTACAGTA
CCCGGTAAGT ACTTTATCTC GATATCAGGA TTAGGTAAAT CATATGATTT TTTAATAGAT
GATAATGTAT ATTCAAAGCT TGGTGACGCT ATGCTTAAAG CACTTTGTTA CCAGCGGTGC
GGAACAGCAC TTACACCTGA TTTTGCAGGT GAATACAGCC ATGCGGAATG TCACAAGACA
TTAGCTAAAT TCTATAACGA TGAGACAAGG GAGATTGACG TAAGCGGAGG CTGGCATGAT
GCAGGAGATT ATGGAAGATA CGTTGTACCT GCAACAGTAA CAGTTGCAGG TTTACTTCTG
GCATACGAGT TTTATCCTCA GGTATTTACG GATGCAGTAA GGATTCCTGA AAGCGGCAAT
AAAATACCGG ATGTTCTTGA TGAAGCAAAA TACGGAATTG AGTGGCTTCT TAAAATGCAG
GATAGTGAAT CGGGCGGAGT ATATCACAAA GTTACTTCAA GGGTATTCCC GGAAATGACA
ACAATGCCTG ACAAGGATGT TGACAATCAG CTTGTAATGT CCATATCAAC TAATGCTACT
GCTGATTTTG CAGCAGTTAC GGCAATGGCT TCAAGAATAT ATGTTACTAT AGATCCAGTA
TTTTCTCAAA ACTGTTTGCA GGCCTCCCAA AAAGCTTGGG AATGGCTTGA AAAAAACAAG
GATTTCGTCA GTTTCAAGAA TCCTTCCGAT GTGGCATCAG GAGAGTATGG CGACAGTTCA
GGAAAGGATG AGAAGGCTTG GGCCGCTGCA GAACTCTTCA GAGCCACAGG AAACGAGAAG
TACAACGAAT ATTTTACAGA CAATTATCAA ATTGAAGGCT TTGGACTTGG ATGGCAAAAT
GTAAGCGGAT TTGCGGCAAT TGCATATATG TTTTCAGACG TATCAGGAAC CGATCAGAAA
AAAGTAGATG AAATCAAAAA GGCATGGCTT GATAAAGCGG ATATGTTTGC CTCTACTGGA
CAAAAGGATG GGTATCTGGT AGCAATGCAT AAAATGGAGT ATAATTGGGG GAGCAACATG
AATGTCGCAA CACATGCCAT GCATCTGCTC ATATCTGACC GTCTTAAAAC TGATGATAAA
TATATACACA CAGCAGAAGA TTGCACCCAT TATCTGTTGG GGAGAAATAC ACTTAACCAG
AGCTATATAA CAGGGTTTGG GTCAAAGCAG GTAAAGAAAC CACATCATAG ACCGTCAGCG
GCTGATCTGG CATTAAATCC TGTTCCAGGA TTAATGGTAG GGGGGCCGGA CTCGGCACTG
GAGGATGATG TAGCAAAAAG CAAACTCTCA GGGAAGTTTC CTGCTGAGTG CTATATTGAT
GATATAAATT CCTTTTCAAC AAATGAAGTT GCTACCTACT GGAATTCGCC GGTAATTTTT
ATTCTAGGTT ATTTAAACTC AAATAGATTA TTATAA
 
Protein sequence
MNWKNTIISL TLAGSFIFTM SSCTSKPVSS NVPEVTNQAV TSMSQDLKAE NEISDDVHVN 
QLGYKTLSKK IAVIKGQYKK FQVVDSKTGV AVLTGDLTGN PKDESSGDTV CYADFSKITV
PGKYFISISG LGKSYDFLID DNVYSKLGDA MLKALCYQRC GTALTPDFAG EYSHAECHKT
LAKFYNDETR EIDVSGGWHD AGDYGRYVVP ATVTVAGLLL AYEFYPQVFT DAVRIPESGN
KIPDVLDEAK YGIEWLLKMQ DSESGGVYHK VTSRVFPEMT TMPDKDVDNQ LVMSISTNAT
ADFAAVTAMA SRIYVTIDPV FSQNCLQASQ KAWEWLEKNK DFVSFKNPSD VASGEYGDSS
GKDEKAWAAA ELFRATGNEK YNEYFTDNYQ IEGFGLGWQN VSGFAAIAYM FSDVSGTDQK
KVDEIKKAWL DKADMFASTG QKDGYLVAMH KMEYNWGSNM NVATHAMHLL ISDRLKTDDK
YIHTAEDCTH YLLGRNTLNQ SYITGFGSKQ VKKPHHRPSA ADLALNPVPG LMVGGPDSAL
EDDVAKSKLS GKFPAECYID DINSFSTNEV ATYWNSPVIF ILGYLNSNRL L