Gene Ccel_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1501 
Symbol 
ID7310267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1824380 
End bp1825924 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content40% 
IMG OID643608425 
Producttail sheath protein 
Protein accessionYP_002505833 
Protein GI220928924 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAATT ATTTATCACC AGGAGTTTAT GTAGAAGAAG TTTCCAGCGG AGTCAAGCCT 
ATAGAAGGTG TAGGTACAGC GGTGGGTGCT TTTATCGGAA TTGCTGAGAA GGGTGTCATT
GGTAAAGCTG TTCTGGTTAC CAATTGGAGC CAGTATGTCA GCGAATTTGG AGGATTTATT
CCAAATGCAT ACCTTGCTTA TGCCGTGTAC AATTTCTTTG CGGAAGGAGG AACATCCTGC
TATGTTGTAA GAGCTGCATC AGAAGATGCG AAGAAGTCTC TTTACATTGT AAAGGATAGT
CAAGGAGAAA ACTTATTCGA AATAAGTGCC CGTTCAGAAG GAAACTGGGG TAACAGAATT
TCCTTTCAAA TAAGCAGTTC AACAAACGGA CAGATGAACG GTTTCAAACT CAATATCAAG
TATACCGAGA AGAGTTCATT CAGCGATGAG TATGTTGGAG AGGATGTTGA AGGGGAACTT
GTTGAGACCT TCGACAATCT GCTTATAGTT AACTTTGAAG AGAAAATAAA CGATTTATCA
TCGTTTATAA GTGTCAGGCC GTTAGTAGAT CTTAAAAAAG TTGATAACAT GGACAAGGTT
CCAATGTTCA CTGAAGAGGA TGAATTTATA GAATTGGCAA ATGGCGTTGA TGGAATATCA
TACGTAGAGT ATATTGACAG CGAAGAAAAG AAATTGGGAA TCAACGCATT TACACCAATA
GACGAAATTA ATATAATTGC TGCACCTGAC GTTTCCAACA TGGTATCAAA CAGAAATATT
ATTCTCGAAA TTCTTAACTA TTGTAAGACC AGAAAAGATT GCTTCTACGT TATAGATCCT
CCGCATGGCC TGACCCCACA ACAAGTAAAG GACTTCAAAG AGGGTGCGGG AGAGTTTACA
GGCAACTCAT TTAATTCATC TTATGGTGCA TTGTATTACC CATGGGTGTT TATCAATGAC
CCCCTGACAG GAAAGAAAAA ACTTATCCCA CCTTCAGGTT CAGTTGTAGG TACATATGCA
TATGTTGATT CAGCAAGAGG AGTACACAAA GCTCCTGCCG GAACTACTGA CGGATATCTT
GATACTGTAG TAGGAGTTGA AAAGATAGTA ACAAAAGGAG AACAGGAGCT TCTCAATCCA
ATAGGTGTAA ATGTAATCCG TTCTTTACCA GAGGGAATTT GTATCTGGGG TGCCAGAACT
CTTTCATCTG ATTCAGAATG GCGCTATATA AATATCAGAC GTCTGATGAT GTATATTGAA
GAATCAGTTG ACAAGGCAAG CCAATGGGTA GTTTTTGAAC CCAATGAGCC AACTCTCTGG
GGGAAGGTAA AGAGAAACAT ATCAGCTTTC CTGACAAGAG TTTGGAGAGA TGGCGCTCTT
TACGGTTCAA CCCAGGAAGA AGCATTCTTT GTAAAGGTGG ACGAGGAAAA CAACCCTCCG
GCATCAAGAG ATGCAGGCCA ATTGGTAATA GAAGTCGGTG TAGCTCCAGT TAAACCGGCA
GAATTTGTAA TAATCAAGGT CAGCCAAAAG ACACTGTCTA AATAA
 
Protein sequence
MPNYLSPGVY VEEVSSGVKP IEGVGTAVGA FIGIAEKGVI GKAVLVTNWS QYVSEFGGFI 
PNAYLAYAVY NFFAEGGTSC YVVRAASEDA KKSLYIVKDS QGENLFEISA RSEGNWGNRI
SFQISSSTNG QMNGFKLNIK YTEKSSFSDE YVGEDVEGEL VETFDNLLIV NFEEKINDLS
SFISVRPLVD LKKVDNMDKV PMFTEEDEFI ELANGVDGIS YVEYIDSEEK KLGINAFTPI
DEINIIAAPD VSNMVSNRNI ILEILNYCKT RKDCFYVIDP PHGLTPQQVK DFKEGAGEFT
GNSFNSSYGA LYYPWVFIND PLTGKKKLIP PSGSVVGTYA YVDSARGVHK APAGTTDGYL
DTVVGVEKIV TKGEQELLNP IGVNVIRSLP EGICIWGART LSSDSEWRYI NIRRLMMYIE
ESVDKASQWV VFEPNEPTLW GKVKRNISAF LTRVWRDGAL YGSTQEEAFF VKVDEENNPP
ASRDAGQLVI EVGVAPVKPA EFVIIKVSQK TLSK