Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2837 |
Symbol | |
ID | 7311457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3391103 |
End bp | 3392707 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643609732 |
Product | Terminase |
Protein accession | YP_002507111 |
Protein GI | 220930202 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGAAAC TGAAGAAATA TAAGCCGACC GCCTTTATAG CTGATGGGTC ATATTACGAT AAGGATGCTG CTGATTACGC TGTAGCTTTT ATCGAAGCAC TCTCCCATAC GAAAGGTTTA TGGGCAGGTA AGCCTTTTGA ACTTATCGAT TGGCAGGAGC AAATAATCCG TGATTTATTT GGAATTTTAA AGCCAGATGG ATATCGGCAG TTTAATACTG CTTATGTAGA GATACCTAAA AAGATGGGAA AAAGCGAGCT TGCCGCAGCA ATTGCACTTC TCCTCACTTG CGGTGATGGT GAAGAACGGG CGGAGGTGTA TGGTTGTGCC GCCGATCGCC AGCAGGCATC AATTGTATTT GAAGTAGCAG CCGATATGGT GCGGATGTGT CCGGCACTGA ATAAACGAGT AAAGTTGCTG GCTTCAACTA AGCGATTGGT GTACCTGCCG ACCAACAGCT TCTATCAGGT ATTGTCGGCT GAAGCCTACT CCAAACACGG CTTCAATATA CATGGTGTTG TTTTCGATGA ACTTCATACT CAGCCAAACC GGAAACTATT TGATGTTATG ACGAAGGGAT CTGGGGATGC AAGGACCCAA CCGCTGTATT TTCTTATCAC CACAGCGGGA ACGGATACTC AAAGTATCTG CTACGAAACA CACCAAAAAG CGGTTGATAT TATTGAGGGC AGAAAATACG ATCCTACCTT TTACCCCGTA ATCTATGGTG CTAAAGAAGA GGATGATTGG ACAGATCCAA AAGTGTGGAA GAAAGCAAAT CCAAGCTTAG GAATTACGGT GGGAATTGAC AAGGTAAGGG CAGCTTGTGA AAGTGCAAAG CAAAACCCTG CTGAAGAGAA CAGCTTCAGG CAGTTAAGAT TGAATCAGTG GGTTAAACAG TCTGTCCGTT GGATGCCAAT GGCAAAGTGG GATGCCTGTG CATTTCCAGT TATACCAGAA AGTCTTGAAG GGCGGGTATG TTATGGGGGT CTTGACCTAT CTTCTACAAC AGACATTACA GCCTTTGTGT TGGTGTTCCC ACCAGAGGAT GAAACAGATA AATACATTGT TCTTCCGTAT TTTTGGATGC CAGAGGACAA CATTGACCTC CGAGTCCGAA GAGACCATGT GCAATACGAT CTTTGGGAGA AGCAAGGGTA TATTCTAACC ACAGAAGGCA ATGTAGTGCA TTACGGCTAC ATTGAGCGGT TTATTGAAGA ACTGGGCGAA AAGTATAACA TTCGAGAAAT TGCGTTTGAC CGTTGGGGAG CTGTTCAAAT GGTTCAGAAC CTTGAAGGAT TAGGCTTTAC TGTCGTTCCT TTCGGTCAAG GCTTTAAAGA TATGTCACCA CCAACCAAAG AACTGATGAA ATTGACATTA GAAGAAAGAA TAGCACACGG TGGGCATCCA GTGCTTCGGT GGATGATGGA CAACATCTTT ATAAAAACTG ATCCGGCGGG CAACGTGAAG CCGGATAAAG AAAAAAGTAC AGAAAAAATA GATGGCGCGG TGGCAACTAT CATGGCACTT GATCGTGCTA TTCGTTGTGG CTCAGGTAAT AGTGGGGATT CGGTGTATGA CGAGCGAGGT TTGATTGTCT TTTAA
|
Protein sequence | MRKLKKYKPT AFIADGSYYD KDAADYAVAF IEALSHTKGL WAGKPFELID WQEQIIRDLF GILKPDGYRQ FNTAYVEIPK KMGKSELAAA IALLLTCGDG EERAEVYGCA ADRQQASIVF EVAADMVRMC PALNKRVKLL ASTKRLVYLP TNSFYQVLSA EAYSKHGFNI HGVVFDELHT QPNRKLFDVM TKGSGDARTQ PLYFLITTAG TDTQSICYET HQKAVDIIEG RKYDPTFYPV IYGAKEEDDW TDPKVWKKAN PSLGITVGID KVRAACESAK QNPAEENSFR QLRLNQWVKQ SVRWMPMAKW DACAFPVIPE SLEGRVCYGG LDLSSTTDIT AFVLVFPPED ETDKYIVLPY FWMPEDNIDL RVRRDHVQYD LWEKQGYILT TEGNVVHYGY IERFIEELGE KYNIREIAFD RWGAVQMVQN LEGLGFTVVP FGQGFKDMSP PTKELMKLTL EERIAHGGHP VLRWMMDNIF IKTDPAGNVK PDKEKSTEKI DGAVATIMAL DRAIRCGSGN SGDSVYDERG LIVF
|
| |