Gene Ccel_1351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1351 
Symbol 
ID7310131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1649676 
End bp1650869 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content39% 
IMG OID643608271 
Producthypothetical protein 
Protein accessionYP_002505685 
Protein GI220928776 
COG category[R] General function prediction only 
COG ID[COG5401] Spore germination protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.033419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTATTAGTAT TGTGTTGGTA ATTATGCTTT TATCGACAGG ATGCTCTTTT 
ATTGGTCACA AAAATGAACA AGAGCCTAGC ATTCAGGCAG CAAATCAAAC AGTTAATGCT
TCCGCATCAA ACACCGCTCA GGATTCCAAA GCAGCGGCTT CCGCCACATC AACCAATTCA
AATGGGTTAG CTATGACTTC CGGGACAACA GCAGCTCCTA CTACAACAAG CTCAATGGTA
AGCGGCAATT ATACCGACAG TCAGATAAAA GACCTTGTCA TAGACAACGG AGCGGGGAAA
GGTATGGCAG CTGCTTCTGA TAAAAGCAAT CTTCTCATAA CTTTATATTA CAAAAACCAA
AAGGGATTGA TTATTCCTGT TACCAGAACA GTTAAAAAAC AGGAAAGCCT TGCAAAGGCA
GCAATTCTCG GGTTGGTTGA TGAAGCTGTA ACAAAAGAAC AACTGGACTA CTACGGACTT
TACCCTGTAC TGCCCAGAGG TACAAAGATA AAAGGAATAA ATATAAAAGA TAAGGTTGCT
GTAATTGACT TTTCCAAGGA ATTTTTAAAT TTATCGGGTA AACAGGAAGA GCAGGAGGCA
GTAGCTTCGA TTGTGTATAC TCTGACAGGA TTCTCAACAG TATCTGATGT AAGAATCAGG
GTTGAAGGAA AAGAAATAAC CACTCTTGAA AATGGAACGG ATTTATCGAT TCCCAGAAAC
AGAAGCAATA CACTTATCAA TACAAGCGAC ACCCAAATAA AGGATGGATG TGTGAAATGC
GATTTGTATT ATGTTTCAGA TGACAGCAAT AACCATAATT ATCTTGTTCC AGTGTCTATA
CAGATACCCC AGACAGACCC TCGCGGTATA CCCGGACTAA TATTTGATGA GCTTGGTAAA
AAGCCCAACG AAACAACTTA TTTTACATCT ATGCCAGAAG GAACAAAATT GCTTTCTTTT
AATCGACAGG GAAGTACTGC TGTTCTGGAC TTTTCAAACC AGATTACCAA CTATGGCGGC
TCTGAAAAAG AAGATACCTT GTTAAATCAA ATATATTATA CAGTCAGCCA GATGAAAGGA
ATACAAAAAA TTAAGCTGCT TATAAACGGT AAGGAAAAGA CTCTGCCTGA AGGTACGGAA
GTGGCATCAG CTAGAAGTGT TCCAATAACC TTTAATAAGG TAATAGAGAA CTAA
 
Protein sequence
MKKIISIVLV IMLLSTGCSF IGHKNEQEPS IQAANQTVNA SASNTAQDSK AAASATSTNS 
NGLAMTSGTT AAPTTTSSMV SGNYTDSQIK DLVIDNGAGK GMAAASDKSN LLITLYYKNQ
KGLIIPVTRT VKKQESLAKA AILGLVDEAV TKEQLDYYGL YPVLPRGTKI KGINIKDKVA
VIDFSKEFLN LSGKQEEQEA VASIVYTLTG FSTVSDVRIR VEGKEITTLE NGTDLSIPRN
RSNTLINTSD TQIKDGCVKC DLYYVSDDSN NHNYLVPVSI QIPQTDPRGI PGLIFDELGK
KPNETTYFTS MPEGTKLLSF NRQGSTAVLD FSNQITNYGG SEKEDTLLNQ IYYTVSQMKG
IQKIKLLING KEKTLPEGTE VASARSVPIT FNKVIEN