Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2834 |
Symbol | |
ID | 7311454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3387912 |
End bp | 3389105 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643609729 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002507108 |
Protein GI | 220930199 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.174812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TTCTTGAATT GCGTGAGAAA CGCGCTAAAG CTTGGGACGC AGCAAAGGCA TTCCTTGATT CAAAACGTGG CGGTGACGGA CTGTTATCCG CCGAGGACAC GACAACCTAT GAAAAAATGG AAGCCGATGT GGTGGCTCTT GGTAAGGAAA TTGAGCGTTT AGAACGCCAA GCATCTATCG ACTTAGAACT GTCCAAAGCA ACCAGTAACC CAATTACGAA CGAACCTACT AGAACTGGAG AGGAAAAGAC CGGCCTTGCA AGTGCTGAAT ACAAAAAAGC TTTCTGGAAT GCGATGCGTG ACAATGTCAG CTATGAAGTA AGGAACGCTC TAAAGATTGG CACTGATTCT GAAGGTGGAT TTCTTGTACC TGATGAGTTT GAGCGTACCC TAGTAGAAGC TCTAGAGGAA GAAAATATTT TCCGTAGATT GGCCAATGTA ATCACTACAT CTTCTGGTGA CCGCAAGATT CCTGTTGTTG CAAGCAAAGG CAATGCAAGC TGGATCGATG AAGAAGGGGC TATTCCAGAG AGTGATGACA GCTTCGGTCA AGTATCCATC GGTGCTTATA AACTAGCAAC GATGATTAAA GTCTCTGAGG AACTGCTAAA TGATTCCGTG TTTAATCTCG AAAGCTACAT CACAAGAGAA TTCGCACGTC GCATTGGTAA CAAGGAGGAA GAAGCCTTCT TTGTAGGTGA TGGCACAGGT AAGCCAACAG GAATTCTAAA TGCCACAGGC GGTGGTCAAG TTGGTGTTAC TGCGGCAAGT GCCACTGCCA TCACTTTGGA TGAGGTATTA GATTTATTCT ACAGCTTAAA AGCACCGTAT CGTAATAAGG CAGTATTCGT AATGAACGAT GCCACTATAA AGGCTATTCG TAAATTGAAA GACGGTAATG GGCAATACCT ATGGCAACCT TCCATCCAAG CGGGAACACC TGATACGATT CTTAACCGCC CGCTGTATAC CTCATCATAT GTACCTACTG CTGAAGCTGG TGCAAAGACT GTGGTATTCG GTGATTTTAG TTATTACTGG GTGGCAGACC GTCAAGGACG AGTATTCAAA CGCTTAAATG AACTCTATGC TGTCACAGGT CAAGTAGGAT TTATTGCGAC TCAACGAGTT GACGGAAAGC TTATCTTACC GGAGGCCGTT AAGGTACTCC AACAGAAAGC CTAA
|
Protein sequence | MSKILELREK RAKAWDAAKA FLDSKRGGDG LLSAEDTTTY EKMEADVVAL GKEIERLERQ ASIDLELSKA TSNPITNEPT RTGEEKTGLA SAEYKKAFWN AMRDNVSYEV RNALKIGTDS EGGFLVPDEF ERTLVEALEE ENIFRRLANV ITTSSGDRKI PVVASKGNAS WIDEEGAIPE SDDSFGQVSI GAYKLATMIK VSEELLNDSV FNLESYITRE FARRIGNKEE EAFFVGDGTG KPTGILNATG GGQVGVTAAS ATAITLDEVL DLFYSLKAPY RNKAVFVMND ATIKAIRKLK DGNGQYLWQP SIQAGTPDTI LNRPLYTSSY VPTAEAGAKT VVFGDFSYYW VADRQGRVFK RLNELYAVTG QVGFIATQRV DGKLILPEAV KVLQQKA
|
| |