Gene Ccel_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1099 
Symbol 
ID7309912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1355760 
End bp1357187 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content34% 
IMG OID643608023 
Productglycoside hydrolase family 5 
Protein accessionYP_002505438 
Protein GI220928529 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CAACAGCTTT TTTATTATGT TTTCTAATGA TTTTTACAGC ATTATTGCCA 
ATGCAAAATG CTAATGCGTA TGATGCTTCA CTTATTCCGA ATCTTCAGAT TCCACAAAAG
AACATTCCGA ATAATGATGG AATGAATTTT GTAAAAGGTT TAAGACTCGG ATGGAATCTG
GGTAATACAT TTGATGCTTT TAACGGTACA AATATTACTA ATGAATTGGA TTATGAAACA
TCATGGAGCG GTATCAAAAC AACTAAGCAG ATGATAGATG CAATAAAGCA AAAAGGATTC
AATACTGTTC GTATTCCTGT ATCCTGGCAT CCACACGTAA GTGGTTCAGA TTACAAAATC
AGTGATGTAT GGATGAATCG TGTTCAAGAA GTAGTAAATT ATTGTATAGA TAATAAAATG
TATGTCATTT TAAACACACA TCATGACGTT GACAAAGTAA AAGGTTATTT CCCAAGCAGT
CAATATATGG CAAGCTCCAA GAAATATATA ACTAGTGTCT GGGCACAGAT TGCTGCTAGG
TTTGCAAACT ATGATGAGCA TCTTATTTTT GAAGGAATGA ACGAGCCTCG TCTTGTAGGA
CATGCAAATG AGTGGTGGCC TGAGCTGACA AATTCAGATG TAGTTGATTC TATTAATTGT
ATTAATCAAC TTAATCAGGA TTTTGTTAAT ACAGTACGTG CAACAGGTGG AAAAAATGCA
AGCAGATATC TTATGTGTCC AGGATATGTT GCATCTCCTG ACGGAGCAAC AAACGATTAC
TTCAGAATGC CAAATGATAT TTCTGGTAAT AACAACAAAA TAATTGTATC TGTACATGCA
TATTGTCCAT GGAATTTTGC AGGGTTGGCA ATGGCTGATG GAGGTACAAA TGCTTGGAAT
ATAAATGATT CAAAAGATCA AAGTGAAGTT ACTTGGTTTA TGGATAATAT TTATAATAAG
TATACAAGCA GGGGTATTCC TGTAATAATC GGTGAATGTG GAGCAGTAGA TAAGAACAAT
CTGAAGACAA GAGTAGAATA TATGTCCTAT TATGTTGCAC AAGCTAAAGC ACGTGGTATA
TTATGCATAT TGTGGGATAA CAATAATTTC TCAGGTACTG GTGAATTATT TGGTTTCTTC
GATAGAAGAA GCTGTCAGTT CAAGTTCCCT GAAATTATAG ATGGAATGGT GAAATATGCT
TTCGAAGCCA AGACAGATCC TGACCCAGTA ATTGTATATG GAGATTATAA CAATGATGGA
AATGTTGATG CACTTGATTT TGCAGGCTTA AAGAAATATA TTATGGCTGC TGACCATGCT
TATGTAAAGA ATTTGGATGT TAATCTCGAC AATGAAGTGA ATGCATTTGA CCTTGCTATT
TTGAAAAAAT ATCTGCTTGG TATGGTAAGT AAGCTTCCAA GCAACTAA
 
Protein sequence
MKKTTAFLLC FLMIFTALLP MQNANAYDAS LIPNLQIPQK NIPNNDGMNF VKGLRLGWNL 
GNTFDAFNGT NITNELDYET SWSGIKTTKQ MIDAIKQKGF NTVRIPVSWH PHVSGSDYKI
SDVWMNRVQE VVNYCIDNKM YVILNTHHDV DKVKGYFPSS QYMASSKKYI TSVWAQIAAR
FANYDEHLIF EGMNEPRLVG HANEWWPELT NSDVVDSINC INQLNQDFVN TVRATGGKNA
SRYLMCPGYV ASPDGATNDY FRMPNDISGN NNKIIVSVHA YCPWNFAGLA MADGGTNAWN
INDSKDQSEV TWFMDNIYNK YTSRGIPVII GECGAVDKNN LKTRVEYMSY YVAQAKARGI
LCILWDNNNF SGTGELFGFF DRRSCQFKFP EIIDGMVKYA FEAKTDPDPV IVYGDYNNDG
NVDALDFAGL KKYIMAADHA YVKNLDVNLD NEVNAFDLAI LKKYLLGMVS KLPSN