Gene Ccel_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1249 
Symbol 
ID7310044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1546692 
End bp1548752 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content39% 
IMG OID643608170 
Productglycoside hydrolase family 9 
Protein accessionYP_002505585 
Protein GI220928676 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000227885 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TAAGTGTACT GGTTGTGCTT GCTGTTTTAC TGGTGAGCAT TATTCCATCT 
GCGGTTTCAG CTGAGGAAAA CAACTTTAAC TATGTTGATG CATTTGCCAA GTCAATTCTT
TTCTATGAGG CAAATTGGTG TGGTCCTGAT GCAGGAAACA ACAGGATTAA ATGGCGCGGC
CCTTGCCATT GTGATGACGG AAAGGATGTA GGACTGGACT TGACGGGAGG GTTTCATGAC
TGCGGAGACC ATGTAAAGTT CGGGTTACCG CAATGTGCTT CTGCTTCAAC CCTTGCATGG
GCTTACTATG AGTTCGAGGA TGTTTTTATT GACAAGGGAC AGGATGGCTA TATGCTTAAT
ATTTTAAAGC ATTTTTGCGA TTACTTTATG AAATGCTTCC CGAACAAGAC TACGTTTTAC
TATCAGGTAG GCGACGGTGA TGTAGATCAC CAATATTGGG GGCCGCCTGA GCTTCAGACA
TATGACAGGC CAGCCTATTA TGTTGCAACA CCATCAAATC CCGGTTCTGA TGTAGCAGGT
GATGCGGCTG CTGCACTGGC ACTTATGTAT CTGAACTATA AGGATATAGA TTCAACTTAT
GCAGAAAAAT GTCTTACTTA TGCAAAAGAC CTTTATGACT TCGGTATGAC CTATAGAGGA
AACAGTAAAG GTCAAAGCTA TTATCTGCCT AGAACTTATC TGGATGAACT TATGTGGGGA
TCAATCTGGC TTTATGTTGC TACAAATGAT AATAAATATA TGGATAATGT AGAAAAGCTA
ATGGTTGAGA AAGGGATAAC CGGAGGCAAC TCCTTTAATG ACAATTGGAC CCAGTGCTGG
GATTATGTAT TAACAGGAGT GTTTACAAAG CTTGCAACAC TTTCGACTAA TCCTTTATAC
AAATCAATAG CCGATGACCA TATTGATTAC TGGCAGAATA GATTAAAAAC TACTCCTGCA
GGATTGAAAT ACCTTGATAG CTGGGGTGTT TGCAAATACC CTGCAGCAGA GAGTATGGTT
CAGCTTGTTT ATTATAAATA TTTTGGTAAT GAGAAGTGTC TGGACTTTGC AAAGGGACAA
ATAGATTACA TTCTTGGAGA CAATCCCAAC AATATGTCTT ATGTAGTTGG TTTCGGAGAT
AATTATCCCA AATACCCTCA TCATCGTGCC GCAAGCGGAG TTTTGGAAGG CCCGCCTGCA
GATGAAAAGA AGGAATTGCC GGAAAGGCAT ATTCTATACG GTGCTCTTGT AGGGGGAGCA
GACATGAATG ATGAATATCA TGACAATGTT AATGAGTATG TTTATTCAGA AACCGGATTG
GACTATAATG CAGGCTTAGT AGGAGCAATG GCGGGTATGT CAAAATACTT CGGTAAAGAC
CAGTTGCCTG AACCTACTCC GGGTATTGAG GGTGAGCCGA CCCAATACTA TTCGGAAGCT
AAACTATACA AATCAAATAC AGAGGGTGTT ACCGTTGACC TTAATTTGTA TAATATTGTT
ACGGCACCTC CTCAATATGA AAAAGACTTA TCCTGTAAGT TCTTCGTAGA CTTATCAGAA
TTTGCTGCTG AAGGAATTAA TCCATCTAAG TTTACAACTA AGGTATATTA TTCTCCGGCC
GATGCACAAA TATCTGGAAT TCAGCCATAT GATAAGGAGA AAAATATCTA TTACGTAGAA
ATTACTTTCC CAAATAGTCA GTTGTATGCA AGAACTTATG TTCAATTCTG TATTTACAAT
TACGAAAGTA AACTATGGGA TTCCAAAAAT GACCTTTCTA CCGCAGGTTT AACCGATGAA
TATGCGAAAA TAGAAAATAT TCCAATATAC AAAAATGGCG TTAAAGTTTA CGGGAATGAC
CCTGCGGGAA GTACAACAAT TTTGTATGGA GATTTAAATA ACGATAAAGA AATAAATGCA
ATTGATTTTG CATTATTAAA AAAATACCTG CTGAATGGAG ATGCAGAGGG TATAATTCTT
AATAATGCCG ACATTAATAA GGATGGAGCT GTAAATGCTC TGGATTTTGC AAACTTAAAA
CTGTACTTAC TTGGTAAATA A
 
Protein sequence
MKKVSVLVVL AVLLVSIIPS AVSAEENNFN YVDAFAKSIL FYEANWCGPD AGNNRIKWRG 
PCHCDDGKDV GLDLTGGFHD CGDHVKFGLP QCASASTLAW AYYEFEDVFI DKGQDGYMLN
ILKHFCDYFM KCFPNKTTFY YQVGDGDVDH QYWGPPELQT YDRPAYYVAT PSNPGSDVAG
DAAAALALMY LNYKDIDSTY AEKCLTYAKD LYDFGMTYRG NSKGQSYYLP RTYLDELMWG
SIWLYVATND NKYMDNVEKL MVEKGITGGN SFNDNWTQCW DYVLTGVFTK LATLSTNPLY
KSIADDHIDY WQNRLKTTPA GLKYLDSWGV CKYPAAESMV QLVYYKYFGN EKCLDFAKGQ
IDYILGDNPN NMSYVVGFGD NYPKYPHHRA ASGVLEGPPA DEKKELPERH ILYGALVGGA
DMNDEYHDNV NEYVYSETGL DYNAGLVGAM AGMSKYFGKD QLPEPTPGIE GEPTQYYSEA
KLYKSNTEGV TVDLNLYNIV TAPPQYEKDL SCKFFVDLSE FAAEGINPSK FTTKVYYSPA
DAQISGIQPY DKEKNIYYVE ITFPNSQLYA RTYVQFCIYN YESKLWDSKN DLSTAGLTDE
YAKIENIPIY KNGVKVYGND PAGSTTILYG DLNNDKEINA IDFALLKKYL LNGDAEGIIL
NNADINKDGA VNALDFANLK LYLLGK