Gene Ccel_2959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2959 
Symbol 
ID7311568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3510792 
End bp3512039 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content32% 
IMG OID643609859 
Productphage portal protein, HK97 family 
Protein accessionYP_002507233 
Protein GI220930324 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTTG ATAAAATTGA AAAACGTGAG AATACGCCTA TTAATGATTG GAAGGATATT 
TATAGCTTTG AAAAAGGTTA TGAAATTATA CCGTATGATA GGAGTTTAAA GGAAAGTACA
TATTATAGTG CAATCAAAAT AATAAGTAAA AGCGTTGCAA AATGTAGTTT ACAAGTTAAA
TTAGAGTCTG AAAAAGGTGA GGATTTAGCT AAGAATCATT ACTTATTTGA TAAGTTAAGG
CTGCGATTTA ATTCGTATAT GTCATCAATT GATGCAATGA AAGCATTTGT GGCAATTTCA
AAGCATGAGG GTATTAGCGG TTTGTTTATT AATAGGGATG TAAAAGGCAC TATTGAAGGT
TTATATCCTG TTAAAATAAC TCAAATAACT ATTGATAACG CCGGGATAAT AAAAGGTACT
AAAAGTAATA AGGTTTTATA TGATTTTGAA TGTGTTGAGG GTGAAACTGG AAGTTGTTTT
GACAAAGATA TAATAATATT ACGAGATTTT ACACTGGATG GAATCAAGAC TAATGCTACT
CGAAAAATAT TATCAGAAAG CTTGGACACA TCTGTAAAAA GTCAAGATTA TCTCAATACC
TTATTTACAA ACGGATTGAC AAACAAAGTA GTGGTACAGC TTTCATCAGA CATAAAAGAT
GAAAAAGAGA TAAAGAAGAT GCAGGACAAA TTTAATAGAA TTTACAGCAG CAATGGGAAG
ATATTTACCG TTCCGGCAGG ATATTCAATT GATGCATTAA ATCTGTCCCT TGCAGATGCT
CAATATGAGC AACTTCGTCG TTTATCAAAA GAGGAAATTG CATGTATGAT GGGAGTCCCT
TTATCAAAGT TAGGATTTGT AAAAGAAAAT GCCAAGAGTG AGGAGCAGGA TAATTTGAAA
TTTCTTTCTG ACACCCTATT AATAATCTTT GAAGCCATAG AACAGGAAAT GGATTGGAAG
TTATTAACTG AAAAAGAAAG AAAACAGGGC TATAAGATAA GGTTTAACAT AAATGTCTTA
CTTAGAACTG ATAGTAAAAC CCAGTCCGAA GTTATTAATT CTTATGTTAA TAATGGGGTT
TATGATTTAG ATTACGCAAG GGGAATTCTT GGTGTTCCTA AATTAGGTGG AGAACCAATT
ATCACACTGC CTTCCGGGCA AATTCTATTG AAAGATTTAT TAAGCGGTAA TGTAAGCTAT
CTAAAAAATA AATCAGGAGA GGGAGGTGAT AATTTAAATG GAAACTAG
 
Protein sequence
MIFDKIEKRE NTPINDWKDI YSFEKGYEII PYDRSLKEST YYSAIKIISK SVAKCSLQVK 
LESEKGEDLA KNHYLFDKLR LRFNSYMSSI DAMKAFVAIS KHEGISGLFI NRDVKGTIEG
LYPVKITQIT IDNAGIIKGT KSNKVLYDFE CVEGETGSCF DKDIIILRDF TLDGIKTNAT
RKILSESLDT SVKSQDYLNT LFTNGLTNKV VVQLSSDIKD EKEIKKMQDK FNRIYSSNGK
IFTVPAGYSI DALNLSLADA QYEQLRRLSK EEIACMMGVP LSKLGFVKEN AKSEEQDNLK
FLSDTLLIIF EAIEQEMDWK LLTEKERKQG YKIRFNINVL LRTDSKTQSE VINSYVNNGV
YDLDYARGIL GVPKLGGEPI ITLPSGQILL KDLLSGNVSY LKNKSGEGGD NLNGN