Gene Ccel_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0643 
Symbol 
ID7309508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp739911 
End bp741539 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content42% 
IMG OID643607584 
Productglycoside hydrolase family 18 
Protein accessionYP_002505004 
Protein GI220928095 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.649155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCTAA AAAAATGTAT AATACCACTG CTTATATTTG TGTTCCTTTT TCAAACAAGC 
ACTTCCGCTT ATGAAAGCAT AAGCTTTTTA TATGCAGGAA ACACAACCAC CTATATAAAT
AATGTTAATC GTTCAGGTAC AAATCTTACA ACGGTTTCTC CCGATTATTT TGAAATAAAC
AGTAACGGGA CTCTAAAAAA AACAATAAAG GTAGATCCTC TTTTTGTGGA GACTATGCAC
GCACGGGGAA TAAAAGTAGT ACCCTACCTT AGTAATAACT GGGACAGAAC TCTCGGCAGG
GCTGCATTGG CCAACCAAAA CTCATTTGTC GCCAGTTTAA GTGCCGAAAT TGTACGACTT
GGTTGTGACG GTATAAATAT AGACATTCAA AACCTTACCG AAGCTGACCG CAACGCATTT
ACAGCTTTTA TCAGACTTTT ACGTTCTTAT TTGCCAAAAA CAAAAATACT ATCTGTATGT
GTCGCAGCTA ATCCGTGGGG ATCAACTATT GGCTGGCAGG GTTCTTATGA TTATGCCGCC
TTAGGTACTA TAAGCGACCA GCTATTCATA ATGGCTTATG ATGAGCATTA CACCGGAAGT
GCACCGGGAG CGGTAGCCAG CTTTTCTTTT GTTGAAAAAA GTGTAAACTA TGCCTTGAAA
TATGTTCCAT CTACAAAAAT AGTTTTAGGA GTACCATTTT ACGGCAGATA TTGGAAACAA
GGAGCAGCAA GCGGCGGCTA CGGAATAACC GTATCTGACG TTGAACGTCT GGTAGCAACC
TGTAAATCCA AAACCTGGTA TGACAGTACA TCCCAGTGTG CCCGTGCAAC CGTAACGGTA
ACCTCGACAG ATAATGCTGT TATCTGGGGA AGCAGCAGAC TTTCAGCCGG CACTTACGAT
ATTTGGTATG AAAATGAAAC TTCACTTGAG AAAAAGCTAT CCCTTGTCTC AAAAAATAAT
CTGCTTGGTG CGGGGAGTTG GGCATTGGGG CAGGAGCCTC AGCGTTTCTG GAATAATTAT
AGCCAATGGC TGATAGGTAA GCCCTTTATT GATATATCAA ATCACTGGGC ACAAAGCTAC
ATCATAGATC TGTTTCAAAA AGGCATCGTC AGTGGTATGC CCGGCAAGCG TTTTGTACCT
GACGGCAGTC TCACGAGGGC CGAAGCCGCA GCATTACTTG TAAAAACCCT TGGTTTGCAG
AATGAAACCG CAACTGCATC TTTCGCCGAC ACTAAAGATC ACTGGGCATC AAAACAAATT
GCCATTGTTA AAGAAAAGGG CATTTTCAGC GGGTATTCGG GAAACATGTT TTACCCTGAA
AGAAAGATTA CAAGGGAAGA ATTTGCAGTG GTATGTGACA AAATACTATT CAGCCCTGAT
ACTGTAGATT TCTCTCAAAG AATTTTCAGT GATGTAAGTC CTGAAAGCAA CCCATGGTCA
AATAAATCTA TCATTGTTCT TTCAATGAAT AATATTCTAT CAGGATACCC GGATGGTACT
TTCAGACCGA AAAAAACAAT TACAAGAGCG GAAGCAACCA GAGTAATAGC CGCTTTACTG
GAATATCCCG GAGGGTTTAC AATTTCGCCG ACTCATATTC AGAGTCCGTC CCCTGTACCG
CCAAGATAA
 
Protein sequence
MRLKKCIIPL LIFVFLFQTS TSAYESISFL YAGNTTTYIN NVNRSGTNLT TVSPDYFEIN 
SNGTLKKTIK VDPLFVETMH ARGIKVVPYL SNNWDRTLGR AALANQNSFV ASLSAEIVRL
GCDGINIDIQ NLTEADRNAF TAFIRLLRSY LPKTKILSVC VAANPWGSTI GWQGSYDYAA
LGTISDQLFI MAYDEHYTGS APGAVASFSF VEKSVNYALK YVPSTKIVLG VPFYGRYWKQ
GAASGGYGIT VSDVERLVAT CKSKTWYDST SQCARATVTV TSTDNAVIWG SSRLSAGTYD
IWYENETSLE KKLSLVSKNN LLGAGSWALG QEPQRFWNNY SQWLIGKPFI DISNHWAQSY
IIDLFQKGIV SGMPGKRFVP DGSLTRAEAA ALLVKTLGLQ NETATASFAD TKDHWASKQI
AIVKEKGIFS GYSGNMFYPE RKITREEFAV VCDKILFSPD TVDFSQRIFS DVSPESNPWS
NKSIIVLSMN NILSGYPDGT FRPKKTITRA EATRVIAALL EYPGGFTISP THIQSPSPVP
PR