Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_08230 |
Symbol | |
ID | 8394714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 954173 |
End bp | 955384 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644985583 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_003143228 |
Protein GI | 257063556 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0061868 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00502474 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTAAGT ACAACACCGT TGCCGAGGCT TACGGTTTCT GGAAGAACTC TAGCGTCGCC GTCATGGAGG CACGCGCCAA GGCCATCGAG AAGGACATTG CAGAAGACCC GAACGCCGAC GTTGCGGGCT ACGCAATCGA GGCCGAGGCG TTGGAGCAGG CCATCGCGGA GAAGCGAGGC GCACAGCCCG CCACCAGCGC GAACGCGCCC GCCGAGGTCG CCAACACCGC CAAGGGCGAG AAGGACGGCG AGGGCGCGGC CTCGAAGGTC TACCGCTCCG CATTCTTCAA GCACCTGCAA GGCAACAAAC TCACCCAGGC CGAGCAGGCC GCGTTCGACA ACGTTAACGC CGAGGCGCGA GCCAACGCAT TTAACAAGCT GTCCGACACG GCGGCTGTTA TTCCGACCCA CACGCTTAAC GAGATCATCG TCAAGGCGCG TGACATGGGC GGCATCATGA GCATTTCCCG CGGCTTCCAC ATGCCCGCGA ACATCAGCAT TCCCGTTGCG ACACCGGGCG CGGCTGCGTC CTGGCACGTC GAGGGCGCTG CCGTCGAAAC CGAGAAGGTT TCTCCCGTCC CCGTCACGTT CGGCGCTTAC GAGATCATGC GCATTCTCTC CATTTCGGCG GCTGTGCGCA CCATGTCTAT CGGCGCGTTC GAGAGCTACC TTGCCGACGA GCTGACCGCC TCCGTCATGG CTTGCCTTGG TAACGCCATG GTTGACGGCA CGGGCAGCGG GCAGGGTACG GGCATCGTCT CCGGCATCAC CTGGACTGAC GGCACCAACA AGGTAACGGT TGCCGCCAAC AAGTCGCTTG CATACGCCGA CATCGTCAAT GCAATTGCCC TGCTGCATCG CGGCTACTCG CAGAACGCCC GATTTGTCAT GAACAACACG ACCCTTTACA CGGACGTTTA CGGCCTGGTC GACGAGAACA AGCGGCCTAT CTTCGTTGCT GACCCCGTGG AGAAGGGCAA GGGGCGCATT CTCGGTTTCC CCGTCGTGAT TGACGATTAC ATGGAAGACC ACGACATTCT GTTCGGCGAC TTCCGTTACA ACGGCTGGAA CATGCCCGAG GGAATCGCGC TTGACGTTTC CCGCGACAGC TCGTTCACCA AGGGGCTTAT CGACTACCGC GCCCTGGCAA TCGCGGACTG CAAGCCCATC GTCGCCGATG CTTTCGTTTA CGTCACCAAG GCGACGGCCT AA
|
Protein sequence | MTKYNTVAEA YGFWKNSSVA VMEARAKAIE KDIAEDPNAD VAGYAIEAEA LEQAIAEKRG AQPATSANAP AEVANTAKGE KDGEGAASKV YRSAFFKHLQ GNKLTQAEQA AFDNVNAEAR ANAFNKLSDT AAVIPTHTLN EIIVKARDMG GIMSISRGFH MPANISIPVA TPGAAASWHV EGAAVETEKV SPVPVTFGAY EIMRILSISA AVRTMSIGAF ESYLADELTA SVMACLGNAM VDGTGSGQGT GIVSGITWTD GTNKVTVAAN KSLAYADIVN AIALLHRGYS QNARFVMNNT TLYTDVYGLV DENKRPIFVA DPVEKGKGRI LGFPVVIDDY MEDHDILFGD FRYNGWNMPE GIALDVSRDS SFTKGLIDYR ALAIADCKPI VADAFVYVTK ATA
|
| |