Gene Ccel_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2123 
Symbol 
ID7310821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2489013 
End bp2490290 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content37% 
IMG OID643609056 
Productglycosyl hydrolase 53 domain protein 
Protein accessionYP_002506447 
Protein GI220929538 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.052262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGTT CGAAAAATAA AAAATTAATT GTTATTTCTC TTTTGATATT GATACTAGTG 
TTTGAAAACC TGATTCTGCA CATAAATAAT TACACTGTTT CTGCTGCCGC TCCACAGCTT
TTATATGGTG ACGTTGACGT GAGCGGAGAA GTGAATTCTT TGGATTATGC ACTAATAAAA
AGCTATTTGC TGGGAAATAT AACTGATTTC CCGGATGTCA ACGGTAAAAA GGCTGCTGAT
GTAAACGGTG ACGGCAGCAT AGACTCTTTG GACATTTCAT TGATAAAAAG TTTTATTTTG
GGGATTATAG AAAGGTTTCC AATAGAAACT CCTGCAAATA CGTTTGCAAA AGGTGCGGAT
ATAAGCTGGT TGCCGCAGAT GGAGGCAAGC GGATATAAAT TTTATAACAA TAAAGGCCTT
CAGCAGGACT GCTTACAGAT CTTAAAGGAT TATGGAGTTA ACTCAGTCAG AATAAGAACG
TGGGTAAATC CGTCAACTGA TAAATGGAAT GGCCACTGCA GTACCAATGA AACAATAGCT
TTAGCAAAGA GGGCTAAGAA CCTGGGGTTC AGGATTATGA TTGACTTTCA TTACAGTGAT
TCTTGGGCAG ATCCCGGAAA GCAGACAAAA CCCGCAGCCT GGTTAAATCT AGATTTCAAT
GGGTTAATGA AAATGACATA TGACTATACA TATGATGTAA TGACTAAACT AAGGAATAAT
GGAATATTAC CAGAGTGGGT GCAGGTTGGG AATGAAACCA ACAATGGCAT GCTATGGGAG
GATGGAAAAG CGTCAAACAA TATGAAAAAT TTCGCATGGC TTGTGAATTG CGGTTATGAT
GCTGTAAAAG CTGTAAACCC CAAAACCAAG GTAATTGTAC ACATATCAAA TGGATTTAAC
AATACATTGT TTAGGTGGAT GTTTGACGGA CTTAACTCCA ATGGGGCAAA ATACGATGTT
ATAGGAATGT CATTATATCC TGACAAAGAC AATTATCCTG CTCTTTTAAA CCAATGCCTG
AATAATATGA ATGATATGGT ATCAAGGTAT AACAAAGAAA TAATGATTTG TGAAATAGGA
ATGCAATATA ACTATGCTTC AGAGAGTAAA GCTTTTATTA TAGATATGGT AAATAAGACT
AAATCGTTAC CAAACAATAA AGGTCTGGGC GTATTCTATT GGGAACCGGA GTCATATCCA
GGAATGAACG GTTACAATAA AGGCTGTTGG AATTCTGATG GAAAGCCTAC AATCGCATTG
GATGGATTTT TAAATTAG
 
Protein sequence
MKGSKNKKLI VISLLILILV FENLILHINN YTVSAAAPQL LYGDVDVSGE VNSLDYALIK 
SYLLGNITDF PDVNGKKAAD VNGDGSIDSL DISLIKSFIL GIIERFPIET PANTFAKGAD
ISWLPQMEAS GYKFYNNKGL QQDCLQILKD YGVNSVRIRT WVNPSTDKWN GHCSTNETIA
LAKRAKNLGF RIMIDFHYSD SWADPGKQTK PAAWLNLDFN GLMKMTYDYT YDVMTKLRNN
GILPEWVQVG NETNNGMLWE DGKASNNMKN FAWLVNCGYD AVKAVNPKTK VIVHISNGFN
NTLFRWMFDG LNSNGAKYDV IGMSLYPDKD NYPALLNQCL NNMNDMVSRY NKEIMICEIG
MQYNYASESK AFIIDMVNKT KSLPNNKGLG VFYWEPESYP GMNGYNKGCW NSDGKPTIAL
DGFLN