Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccur_02810 |
Symbol | |
ID | 8374489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptobacterium curtum DSM 15641 |
Kingdom | Bacteria |
Replicon accession | NC_013170 |
Strand | - |
Start bp | 331349 |
End bp | 332629 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644993205 |
Product | phage terminase, large subunit, PBSX family |
Protein accession | YP_003150690 |
Protein GI | 256826731 |
COG category | [R] General function prediction only |
COG ID | [COG1783] Phage terminase large subunit |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.965454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 3.50115e-26 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTAACG CTGCGCGCCT CATCATCTCC CACTTTCACA GCATCCTCGC CGACGTCTTC GCCGAGTGCG GTCATCACGA GTACTGGCTC GAGGGAGGGC GCGGCTCTAC CAAGTCGAGC TTTATCAGCC TTGTAATCGT CCTTCTAGTA GCTACCTTCC CCTGGGTTAA TGCCGTTGTC TTCCGCCGCC AGGCTAATAC GCTACGGGAT ACAGTCTACG GGCAGTTCCT CTGGGCGATA GGCGCCCTCG GGCTTGATGG CTGCTTCTAC ACATCCAAGT CCCCGCTGGA GATCGTCTAC CTGCGGACAG GGCAGAAGAT CATATTCCGC GGGCTAGACG ACCCGAAGAA GCGGAAGGGC GCGGTGTTCC CTGTCGGCTA CTGTGCCGTC CAGTGGTTTG AGGAGCTCGA CGAGTTCAAC GGCTGGGATG ACATCTCATC GACGCTCCGC ACGTATCGCC GTGGCGGGTC GAGGTTTTGG ACGTTCTACA GCTATAACCC TCCGCGGTCT CTGTGGTCGT GGGTAAACAA GAAGGCTCTG GAGATGCAGC GCAAGCGGGG GTGTGTGGTA GACCACAGCA CCTATCTCGA CGTGATCGAC GGCGGTCATA GTGACTGGCT CGGCGAGAAA TTTATAGAGG ATGCCGACTA CGAGAAAGAG GAGCACCCGA CGGGCTACCG CTGGGAGTTC CTCGGCGAGA TCACGGGGAC GGGCGGCAGC GTCTTTGAGA ATGTCGTACA GGTGAGCCTA AGCGACAAGG AGGTAGAGAG CTTCGATAAC CTCCGCTGCG GCGTGGACTG GGGCTGGTTC CCCGATCCCT GGCGCTTCGT CATGTGCGAG TGGCAGCCCG CACGGCGTCG CCTGGTGCTC TTCCGCGAGC TATCGGCTAA CCGTACCACG CCGCAGGATA CGGGGGCGAT GGTACGCGAG GCGCTGACGT ATCGGGATGC GCGGCACAAG GAGCCGACGT ACCACCGCGA CGCGGTCTGG TGCGATAGTG CCGAGCCGTC GAGTATCGAT ATATACCGCC GGCAGTGCGG ACTGAATGCC CGGGCGGCAG ACAAGGGCGG GATGAGACGC GTGAGCTACC AGTGGCTTGA GGGACTACGG GAGATCGCTA TCGACCCCGA GCGATGCCCG AGAGCCTGGG AGGAGTTCAC CCTGTGTGAG TACGCCAAGG ATCGCGCGGG GAGATGGCTC GATGACTACA ACGACGGTAA TGACCACAGT ATCGACGCAG TGCGCTACGC GATGATGCGC GAGTGCGTGA GAGGAGCATA G
|
Protein sequence | MINAARLIIS HFHSILADVF AECGHHEYWL EGGRGSTKSS FISLVIVLLV ATFPWVNAVV FRRQANTLRD TVYGQFLWAI GALGLDGCFY TSKSPLEIVY LRTGQKIIFR GLDDPKKRKG AVFPVGYCAV QWFEELDEFN GWDDISSTLR TYRRGGSRFW TFYSYNPPRS LWSWVNKKAL EMQRKRGCVV DHSTYLDVID GGHSDWLGEK FIEDADYEKE EHPTGYRWEF LGEITGTGGS VFENVVQVSL SDKEVESFDN LRCGVDWGWF PDPWRFVMCE WQPARRRLVL FRELSANRTT PQDTGAMVRE ALTYRDARHK EPTYHRDAVW CDSAEPSSID IYRRQCGLNA RAADKGGMRR VSYQWLEGLR EIAIDPERCP RAWEEFTLCE YAKDRAGRWL DDYNDGNDHS IDAVRYAMMR ECVRGA
|
| |