Gene Ccel_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1106 
Symbol 
ID7309919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1363980 
End bp1365086 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content41% 
IMG OID643608030 
ProductTail Collar domain protein 
Protein accessionYP_002505445 
Protein GI220928536 
COG category[S] Function unknown 
COG ID[COG4675] Microcystin-dependent protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTAA AATTTCCCGT AGGTATGGTC ATACCTTTTG CCGGACCCCT TAAAGAGGAC 
CAGTTAAAAT CATCCGGTTG GGTTCCTTGT GACGGAAGGG TGTTGGATAA AACACAGTAT
TCAGAGCTTT TTGATGTAAT CGGAACTAAA TATGGCGGAG ATGGTATTCC GAATTTCAAT
ATACCGGATC TTCGAGGCAG ATTTGTACGT GCCACCGATC ATGGAAGAGG ATATGATCCT
GATGCTCAAC GGCGTAAAGC GTCAAAATCA GGTGGAGCCG CAGGAGACAA TACAGGCTCG
GTACAGGAGT ATGCAACAGC AAAACCGAAG AATAATTTTA TAACAAACGA TAAGGGTAAT
CATAACCACT TGGTAGACCA TCTACCGACA GACTATTGGA ATGCTGCGTG TGCAATAACT
AGTAATGAAG GTGCTAATTT CCCCGGCCGT ACGGCAACAT CAGGGGAAGC AGGACAACAT
TCACATACCA TCGTATCCGG AGGAGATAGT GAATCAAGAC CTGTAAACCT ATATATGTAC
TGGATTATAA AATTTACTTC CAGTGACTAC GATGAATCCA TTTTATTACC GGCTGGTTCA
ATTGTTTCGT TTGCAGGTGA TTCTGTAAAG AAGAGCAATG AGCTAATTGC TAACGGCTGG
CTGCCTTGTA TAGGCAGTTC ATATGAAGCA AACAAATATC CTGATCTTTA CGAGAATATT
TCCAATATAT ACGGAGGAGA CCAGAACAAA TTTAATGTAC CCGACCTTCG AGGTTTATTT
ATAAGAGGTG TTAATTCAAA TACTTCAGAA ACACCCGGGG TTCATGGAGC TACCAGAGTT
GGTCAAACAG AGGACTATTC AACTGCTCTT CCAAAGACTT TAAATTTCAC ATTGTCAACA
GATGGAGCTC ATACCCATAG TGCCCCAAAA CTGCCACAGG ACAAATACAT AGAAAATTAT
TGTGCAGGTC ATGAAGTTGC CAACTTTCCA TCTAATCAAT ATACCGGCAA TAACGGCAAT
CATGCTCATA CAATCGCCGG CGGTGATGCT GAAACCCGTC CTGTAAATAT TTATCTGGAT
TATATTATAA AATCCAGTAA TGTTTAA
 
Protein sequence
MPLKFPVGMV IPFAGPLKED QLKSSGWVPC DGRVLDKTQY SELFDVIGTK YGGDGIPNFN 
IPDLRGRFVR ATDHGRGYDP DAQRRKASKS GGAAGDNTGS VQEYATAKPK NNFITNDKGN
HNHLVDHLPT DYWNAACAIT SNEGANFPGR TATSGEAGQH SHTIVSGGDS ESRPVNLYMY
WIIKFTSSDY DESILLPAGS IVSFAGDSVK KSNELIANGW LPCIGSSYEA NKYPDLYENI
SNIYGGDQNK FNVPDLRGLF IRGVNSNTSE TPGVHGATRV GQTEDYSTAL PKTLNFTLST
DGAHTHSAPK LPQDKYIENY CAGHEVANFP SNQYTGNNGN HAHTIAGGDA ETRPVNIYLD
YIIKSSNV