Gene Ccel_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1101 
Symbol 
ID7309914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1358085 
End bp1359950 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content44% 
IMG OID643608025 
ProductSpore coat protein CotH 
Protein accessionYP_002505440 
Protein GI220928531 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GTCGCTTGCT ATTACTGTTA TCGGCTATTG CACTCGTCAC CTGCCTTTTT 
TCCATGTCCT TTGTCGGTGC TGCTACAGTG TACGGGGACC TGAATTCAGA CAGTTCCGTA
GATTCCATCG ACTATTCCAT TATGAAGGGT TATCTTTTTG GCAGCGTGAC CATAAGTAAT
CCTGCAGCAG CTGATGTAAA TGGAGACGGT AACATAGACG CTATTGATTT AGCATTAATG
AAACAATATA TTCTGCATAT TATTACTAAA TTTCCGGCCG ATAACAATAC AACGCCCGGA
GATAATACTC CTGCGGCCCA ATTGACCGGA GATATTGTTT TCTCCGCTCC AAGCGGTACT
TTCAGCAATC AGGTTTCAGT GTCATTGAAT TCAAAGATTG CTGGTGCCCA AATCCGTTAC
ACTGTTGACG GGAGTGTTCC TACCGCCAGT TCCCCTGCAT ATACCAGTCC TTTGTCGTTT
ACCAGGACAA CACAGTTGAG AGCTCAATCT TTTGTAAACG GGACTCCAAG CGGTAAAATG
GGCACAGCAA TCTACGTTTC CAGTGCCATA GATACAAAGC ATGATCTCCC TGTATTAATT
CTGGATGCTT ACGGCGGAGG AAAACCTGCA CGTGAATATA AGGACGTTGC TATTATGCTG
ATGGAACCAA AGAATAATGA AGTTTCCCTA CTACAAACAC CAACAGTCGC CACCCGCGCC
GGCTTCCATG TACGTGGCCA GTCATCAGCA AATTTTGAAA AAACACCTTA TCGTCTTGAA
TTGTGGGATA ATCAGAACGA AGATGCCAAG TATTCCTTGT TGGGTATGCC CGGCGACGGC
GACTGGGCGT TACTTTCACC TTTTCCTGAT AAATCTCTTA TAAGAAATGC TCTTGCTTAT
GAATTAGGAA CAACTATGGG ATTGAAAGCA CCAAGGTACA GATTTGTGGA AGTGTATCTC
AATCTTGATA ATCAGCCGTT ATCCTCAGCA GACTACCAAG GAGTATATCT TCTCACAGAA
ACACTTGAAA TCGACAAGGA CCGTGTTAAT ATTAAAAAGC TCAAGGACGA TGATCTAACA
GAACCGAACA TAACCGGAGG TTACCTGATG CAGTTCAATA TGATGGCAAC AGACGGACCG
TTGGTAAAAG GGTCGGGCTG GAACGATCTT GAGATAAAAG ATCCTGATGA CCTGCTGCCC
CAACAGTTGA CATGGATAAG CAACTATATT CAAAAGGTGC ATAACTCTAT TCGCAGTACT
AATCCTTCTG ACCCAACAAC CGGGTATCCT GCTTACATCG ATGTTGATTC CTTCATAAAC
TACATTATCG AAAATGAGCT TGCCCGTGAA GGTGACTCAT ACATGCGCAG CACCTACATA
TATAAGGACC GTGGTGCAAA GCTGGCGGCA GGTCCGGTTT GGGACTATGA TCTGGGTTAC
AACTGCGTAA CAGGTATGAT GGGTATGCAG ACAAATTACG TAGAAGGTTG GCAATTTCAG
CCAATGTTTG GAATGAGTTC CACATGTGAT TGGTACTACA AGCTTATGCA GGACTCTGCT
TTCCAAAGCA AGATAAGTGC TCGCTGGCAG GAATTACGTA ATGGTCCCCT TTCCGACACA
CAGATAAAAG CACTGGTTCA AAAGCTGACA ACACCTTTAG CCAACGGAGC CAAACGTAAT
TTTCAGAAAT GGAACAATCT GGGCACAGCC ACTGTAGGTG GTTTCAGTAC CCAAACCACC
CAGACATGGG AAGAACAAGT AACAATTTTA CAGAACTTTC TGCTCCAAAG AGCTGCTTGG
TTGGATAAAT CCGGATGGAA GCCAACCACA AATACAAATC CCGGATGGCC CGGTTGGGGT
GGTTGA
 
Protein sequence
MKKSRLLLLL SAIALVTCLF SMSFVGAATV YGDLNSDSSV DSIDYSIMKG YLFGSVTISN 
PAAADVNGDG NIDAIDLALM KQYILHIITK FPADNNTTPG DNTPAAQLTG DIVFSAPSGT
FSNQVSVSLN SKIAGAQIRY TVDGSVPTAS SPAYTSPLSF TRTTQLRAQS FVNGTPSGKM
GTAIYVSSAI DTKHDLPVLI LDAYGGGKPA REYKDVAIML MEPKNNEVSL LQTPTVATRA
GFHVRGQSSA NFEKTPYRLE LWDNQNEDAK YSLLGMPGDG DWALLSPFPD KSLIRNALAY
ELGTTMGLKA PRYRFVEVYL NLDNQPLSSA DYQGVYLLTE TLEIDKDRVN IKKLKDDDLT
EPNITGGYLM QFNMMATDGP LVKGSGWNDL EIKDPDDLLP QQLTWISNYI QKVHNSIRST
NPSDPTTGYP AYIDVDSFIN YIIENELARE GDSYMRSTYI YKDRGAKLAA GPVWDYDLGY
NCVTGMMGMQ TNYVEGWQFQ PMFGMSSTCD WYYKLMQDSA FQSKISARWQ ELRNGPLSDT
QIKALVQKLT TPLANGAKRN FQKWNNLGTA TVGGFSTQTT QTWEEQVTIL QNFLLQRAAW
LDKSGWKPTT NTNPGWPGWG G