Gene Ccel_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0251 
Symbol 
ID7309151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp279611 
End bp281650 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content40% 
IMG OID643607181 
ProductSpore coat protein CotH 
Protein accessionYP_002504618 
Protein GI220927709 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0200884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACCA GCAAATATAT AAATATAATT ATTGCTGTTG TTCTGATTGT GGTGGTGGTC 
TTTACCGGAA TCTTTATGAC TGTTCCCGGC TCAATAGGAA TAGCCACTGA AAATTCCCAA
CCTGAATATG TCACAAAGTT GTTTGATAAG AATAAGATCA TTTCTATTGA TATAAAGGCA
GATAAAAAGG AATGGGAAGA AATGTTGAAA AATGCGACAA GTGAAGAGTA TATTCAATGT
GATGCAACTA TCAACGGAAC AACCTATAAA TCAGTGGGTA TAAGGCCTAA AGGTAATTCA
AGCTTATCCA TGGTAGCAAA CAGTGATTCA GACAGATACA GTTTTAAATT GGAGTTCGAC
CACTATATAA AAGGACAAAC CTGTATGGGG GTGGACAAAA TGGTTATAAA CAATATACAG
GCGGATTCCA CATATATGAA GGAATACCTT TCTTTTGACA TGATGTCCTA TATGGGGGTT
ACTACTCCTT TATATGCCTA CACTGATGTA ACCGTAAACG GGGAAAAGTG GGGTTTCTAT
CTTGCGGTTG AGGCTATGGA GGAATCCTTT GCCAAGAGAA ATTACGGAGT TGATTTTGGA
AAGCTGTACA AGCCGGAAAC TATGGGTGGC CAACGGGATG AAAATATGAA GGACATTCCC
AGAAATCAGG AGGACGAACA GAAGAAAAAC AGAGTTCAGC AACAACAGCC GGGGATAGTT
TCAGGTACCA CTGCCGAAGC GAATGGTATA AATGGTAATT CAAAGACAAC TGATGCACAA
AATACTAACC AGGCAGGCCC CGGAGGTTTC CCGGGAGGAC CGGGTTTTGG CGGATTTGGA
AATAACCAAG GCGGAGGGGC TGACCTGAAA TATATTGATG ATAAAATAAG CAGTTATTCA
AATATATTTG AAGGTGAAAT ATTTAAGGGT ACTGATGCAG ATTATAAGAG AGTTATAAAA
GCGATTAAGA ATCTAAACAA TGGCACGGAA CTTGAAAAAT ATATAAGTGT TGATGAATGT
CTAAGATATT TTGCGGTAAA TACAGTGGTA GTTAACCTTG ATAGTTATTT CAGCAATATG
AAGCATAACT ATTATCTTTA TGAGGAAAAT GGTAAGCTTT CAATGTTGCC ATGGGATTAT
AACCTTGCTT TTGCAGGCTT TCAGTCAGGA ACGGCATCTT CAGCCGTAAA TTTCCCAATA
GACACACCTG TATCGGGTAT TGAAATGTCT GAAAGGCCGA TTTTAAACAA ACTTCTGGAA
GTCGATGAAT ATAAGGAGAA ATATCATAAA TACCTAAAAG ATATATTGAA CGGTTATATT
GATAATGAAA AATTCGGAAG CACAGTAGAT AAACTGAATT CATTAATTGC AGGATATGTA
AAAAATGATG CAACGGCCTT TTTTACATAT GACAAGTATA CGGCAGGGGT AGCAATGCTA
AAGGAATTCG GCAAACTAAG AGCAAAGAGT ATTGAGGGAC AGCTTAACGG AAGTATTCCG
TCAACTAAAG AAGGACAGTC TAAGGCCTCT GACAAATTAA TCGACGCATC AGCAATAAAT
CTTTCGGTTA TGGGTACGCA AGGCGGCGGA GGCGGAGGCG GAAGAGTGGG CGGAGACAGA
GCTAGCCAAC TTGAGAAAAG TCAACAGCAG GGCGATGATC AAAAGCCCCA GGAAAATCAG
CAGGGTAATA GTAGGATGCT GCCGGGGGAC ATGCCTGATC GCGATATAAT GATGCAGGCA
ATGAGGATAG TACAATCAGC TGACGAAGAG GATTTGACCG AAGAACAGCT CAAGCAGTTA
AAAGAGCTTG GAATGACAGA GGAACAGATC GCTTTTGTTA AAAATATGCC CTTCGGAAAA
GGAGGCATGG GGGGAGACGG GATTTCATCC CAAAAGGGCG ATAACAGTAT AAAATCCAAC
AACAGGCAAG AGACTACACG AATGTCGGAT CAGGATATTC TTGTGACGGG CATATATTTA
GTATTTATGG CAGCGGGATT GCTATTTGTA ATCAAATTTA AACGAAGAAA AAGCAGTTGA
 
Protein sequence
MITSKYINII IAVVLIVVVV FTGIFMTVPG SIGIATENSQ PEYVTKLFDK NKIISIDIKA 
DKKEWEEMLK NATSEEYIQC DATINGTTYK SVGIRPKGNS SLSMVANSDS DRYSFKLEFD
HYIKGQTCMG VDKMVINNIQ ADSTYMKEYL SFDMMSYMGV TTPLYAYTDV TVNGEKWGFY
LAVEAMEESF AKRNYGVDFG KLYKPETMGG QRDENMKDIP RNQEDEQKKN RVQQQQPGIV
SGTTAEANGI NGNSKTTDAQ NTNQAGPGGF PGGPGFGGFG NNQGGGADLK YIDDKISSYS
NIFEGEIFKG TDADYKRVIK AIKNLNNGTE LEKYISVDEC LRYFAVNTVV VNLDSYFSNM
KHNYYLYEEN GKLSMLPWDY NLAFAGFQSG TASSAVNFPI DTPVSGIEMS ERPILNKLLE
VDEYKEKYHK YLKDILNGYI DNEKFGSTVD KLNSLIAGYV KNDATAFFTY DKYTAGVAML
KEFGKLRAKS IEGQLNGSIP STKEGQSKAS DKLIDASAIN LSVMGTQGGG GGGGRVGGDR
ASQLEKSQQQ GDDQKPQENQ QGNSRMLPGD MPDRDIMMQA MRIVQSADEE DLTEEQLKQL
KELGMTEEQI AFVKNMPFGK GGMGGDGISS QKGDNSIKSN NRQETTRMSD QDILVTGIYL
VFMAAGLLFV IKFKRRKSS