Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2836 |
Symbol | |
ID | 7311456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3389815 |
End bp | 3391059 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643609731 |
Product | phage portal protein, HK97 family |
Protein accession | YP_002507110 |
Protein GI | 220930201 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTAA TAAAAGGACT ATTTCGTTCA AGAGACAAAC CGCAAAACCG TGTGGGCAGT GCATTTTCCT TTTTGTTTGG CGGTACGTCA TCTGGCAAAA CGGTTAATGA GCGTACTGCA ATGCAGGCAA CTGCGGTGTA TGCTTGCGTA AGGATACTAG CTGAAGCAAT TGCAGGACTG CCACTACATG TATATAGATA TCGTTCTGAT GGAGGTAAAG AAAAGATTCC TTTCCACCCG CTGTATTACC TTCTTCATGA TGAACCAAAT CCAGAGATGA CTTCATTCGT GTTTCGAGAA ACACTGATGA GTCATCTTTT GCTTTGGGGA AATGCCTATG CACAGGTGGT CAGAAACGGT CGTGGGCAGG CAGTTGCACT TTATCCCCTA CTTCCCAACA AGATGGAAGT TAGTCGAGCA ACAAACGGAG AGCTGGTCTA TACCTACTAT CGTGATACTG ATGAAAGTGG CCTAAACCCA AAAGGTGGCT ATGTCACACT CCGTAAAGAT GAAGTTCTAC ACATACCTGG CTTAGGTTTT GATGGACTCA TTGGCTATAG CCCTATCGCT ATGGCGAAAA ATGCAATCGG TATGTCACTT GCTACTGAAG AGTACGGTGC GGCATTCTTT GCCAATGGTG CTAATCCCGG AGGTGTGCTG GAACACCCAG GAGTAATCAA AGATATACAG AGGGTCAAGG ATAGTTGGAA TAGCGCCTAC CAAGGCACAG GCAACGCTCA CAAAATCGCT GTGTTGGAAG AAGGCATGAA GTTCCAAGCC ATTGGTATCC CGCCGGAACA GGCGCAATTT CTTGAAACAC GGAAATTCCA AATTAATGAG ATTGCGAGGA TTTTCCGTGT GCCGCCCCAT ATGGTGGGTG ATCTTGAGAA GTCTAGTTTC TCCAATATTG AGCAGCAGTC TTTGGAGTTT GTAAAATACA CCCTCGATCC GTGGGTGGTG CGATGGGAAC AAAGTCTCCA GCAATCGCTT ATTTTGCCTT CTGAGAAAAC TTCACTGTTT ATCAAGTTCA ATTTGGACGG TCTGCTTCGT GGTGATTACC AAAGTCGTAT GAATGGCTAT GCTACAGGTC GACAAAATGG CTGGATGTCT GCCAACGATA TCCGTGAACT GGAGGATATG AACCGAATAC CGGCTGAGGA AGGCGGCGAT TTATATCTGG TTAACGGAAA TATGACAAAA CTGGCTGACG CAGGCGCGTT TGCCAAAACC GAAGGAGGTC AGTAA
|
Protein sequence | MNLIKGLFRS RDKPQNRVGS AFSFLFGGTS SGKTVNERTA MQATAVYACV RILAEAIAGL PLHVYRYRSD GGKEKIPFHP LYYLLHDEPN PEMTSFVFRE TLMSHLLLWG NAYAQVVRNG RGQAVALYPL LPNKMEVSRA TNGELVYTYY RDTDESGLNP KGGYVTLRKD EVLHIPGLGF DGLIGYSPIA MAKNAIGMSL ATEEYGAAFF ANGANPGGVL EHPGVIKDIQ RVKDSWNSAY QGTGNAHKIA VLEEGMKFQA IGIPPEQAQF LETRKFQINE IARIFRVPPH MVGDLEKSSF SNIEQQSLEF VKYTLDPWVV RWEQSLQQSL ILPSEKTSLF IKFNLDGLLR GDYQSRMNGY ATGRQNGWMS ANDIRELEDM NRIPAEEGGD LYLVNGNMTK LADAGAFAKT EGGQ
|
| |