Gene Ccel_2836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2836 
Symbol 
ID7311456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3389815 
End bp3391059 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content46% 
IMG OID643609731 
Productphage portal protein, HK97 family 
Protein accessionYP_002507110 
Protein GI220930201 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTAA TAAAAGGACT ATTTCGTTCA AGAGACAAAC CGCAAAACCG TGTGGGCAGT 
GCATTTTCCT TTTTGTTTGG CGGTACGTCA TCTGGCAAAA CGGTTAATGA GCGTACTGCA
ATGCAGGCAA CTGCGGTGTA TGCTTGCGTA AGGATACTAG CTGAAGCAAT TGCAGGACTG
CCACTACATG TATATAGATA TCGTTCTGAT GGAGGTAAAG AAAAGATTCC TTTCCACCCG
CTGTATTACC TTCTTCATGA TGAACCAAAT CCAGAGATGA CTTCATTCGT GTTTCGAGAA
ACACTGATGA GTCATCTTTT GCTTTGGGGA AATGCCTATG CACAGGTGGT CAGAAACGGT
CGTGGGCAGG CAGTTGCACT TTATCCCCTA CTTCCCAACA AGATGGAAGT TAGTCGAGCA
ACAAACGGAG AGCTGGTCTA TACCTACTAT CGTGATACTG ATGAAAGTGG CCTAAACCCA
AAAGGTGGCT ATGTCACACT CCGTAAAGAT GAAGTTCTAC ACATACCTGG CTTAGGTTTT
GATGGACTCA TTGGCTATAG CCCTATCGCT ATGGCGAAAA ATGCAATCGG TATGTCACTT
GCTACTGAAG AGTACGGTGC GGCATTCTTT GCCAATGGTG CTAATCCCGG AGGTGTGCTG
GAACACCCAG GAGTAATCAA AGATATACAG AGGGTCAAGG ATAGTTGGAA TAGCGCCTAC
CAAGGCACAG GCAACGCTCA CAAAATCGCT GTGTTGGAAG AAGGCATGAA GTTCCAAGCC
ATTGGTATCC CGCCGGAACA GGCGCAATTT CTTGAAACAC GGAAATTCCA AATTAATGAG
ATTGCGAGGA TTTTCCGTGT GCCGCCCCAT ATGGTGGGTG ATCTTGAGAA GTCTAGTTTC
TCCAATATTG AGCAGCAGTC TTTGGAGTTT GTAAAATACA CCCTCGATCC GTGGGTGGTG
CGATGGGAAC AAAGTCTCCA GCAATCGCTT ATTTTGCCTT CTGAGAAAAC TTCACTGTTT
ATCAAGTTCA ATTTGGACGG TCTGCTTCGT GGTGATTACC AAAGTCGTAT GAATGGCTAT
GCTACAGGTC GACAAAATGG CTGGATGTCT GCCAACGATA TCCGTGAACT GGAGGATATG
AACCGAATAC CGGCTGAGGA AGGCGGCGAT TTATATCTGG TTAACGGAAA TATGACAAAA
CTGGCTGACG CAGGCGCGTT TGCCAAAACC GAAGGAGGTC AGTAA
 
Protein sequence
MNLIKGLFRS RDKPQNRVGS AFSFLFGGTS SGKTVNERTA MQATAVYACV RILAEAIAGL 
PLHVYRYRSD GGKEKIPFHP LYYLLHDEPN PEMTSFVFRE TLMSHLLLWG NAYAQVVRNG
RGQAVALYPL LPNKMEVSRA TNGELVYTYY RDTDESGLNP KGGYVTLRKD EVLHIPGLGF
DGLIGYSPIA MAKNAIGMSL ATEEYGAAFF ANGANPGGVL EHPGVIKDIQ RVKDSWNSAY
QGTGNAHKIA VLEEGMKFQA IGIPPEQAQF LETRKFQINE IARIFRVPPH MVGDLEKSSF
SNIEQQSLEF VKYTLDPWVV RWEQSLQQSL ILPSEKTSLF IKFNLDGLLR GDYQSRMNGY
ATGRQNGWMS ANDIRELEDM NRIPAEEGGD LYLVNGNMTK LADAGAFAKT EGGQ