Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0644 |
Symbol | |
ID | 4897021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 658831 |
End bp | 660036 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640111227 |
Product | HK97 family phage portal protein |
Protein accession | YP_001042529 |
Protein GI | 126461415 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATGT GGCACAAGCT GGTCAACAAG ATGCTGACCG CGCGCGATGG CGATCTCTAC GAGGCGGTCG GCGCCGCCGA GACGTGGGCG GGCGAGCCTG TCTCGGCGCA GGGGGCCATG AACCTCTCGG CGTTCTTCGC CTGCGCGCGG GTGACGGCCG AGACGGTCGC CAGCCTCTCG CTCGAGGTCA TGGAGCGGAA AGAGGACGGG ACGAAGGTTC GCGTGGCCCA TGGCCATCCG CTGCAGGAGT TGCTGGGTGG CTCCCCGAAT GCGGACCAGA CGCCCATGGA GTTCTGGGAG GGTCGAATCC TCGGCCTCTG CACCACCGGC AACGCCTTTG CGGAGAAGGT CTATCAGGGG AACCGCGTCG TGGCGCTCCT GCCCATGCCC GCGACGACTG CGGTGGAGCG GCGGGGGGAC GACCTGCTCT ATCGCTTCAA TGACCGCGGG CGGGCGGTCG TCCTGCCAGC CGACAAGGTC TTCCACGTCA AGGCGTTCGG GGACGGCGAT GTCGGCTTGT CGCCGGTGGA ATATGCGCGC CAGACGCTCG GGATCGCCAT CGCCTCGGAG CGCGCGGCCG GGCAGGTCTA CTCCCGCGGG CTGCGGGCGA AGGGCTTCTT CCTCATTCCG GGGGCGCTCA CTCCGGAGCA GCGCGAGGCC GCCCGGAAGA ACCTGGCTGA TCGCTACTCG GCCAAGGACG CGCCGGGGGT GGGCATCCTT GAGGGCGGGG TGAAGTTCGA GGGGGTCAAC ATCACGCCGC GCGATGCGGA GTTGATCCTG AATCGGCGCT TCAACGTCGA GGAGGTCTGC CGCTGGATGG GATGCCCTCC GATCCTCGTG GGCCACGCCG CGCAGGGCCA GACGATGTGG GGCACGGGCG TCGAAGCCGT CATGCAGCAA TGGCTGAACC TGTCGCTGCG GGCGCTCCTG AAGCGGATCG AGCAAGCATC GGCAAAGCGG GTGCTGTCGG TGTCCGAGCG CGGCCGGTTC TCGGCGAAGT TCAATTACGA GGATCTGCTC CGCAGCAACT CGGCGGCGCG GGCGGCCTAC TACACCTCGC TTCTCAACTG CGGGGTGCTG ACCATCAACG AGGCCCGGCG GCTTGAGGGC CTGCCTCCCG TCGAGGGCGG CGATGTGCCG CGAATGCAGA TGCAGAACGT TCCCATTACG GAGGCCGGCG CGGAACCGCC GGGTGAGCAG CCATGA
|
Protein sequence | MGMWHKLVNK MLTARDGDLY EAVGAAETWA GEPVSAQGAM NLSAFFACAR VTAETVASLS LEVMERKEDG TKVRVAHGHP LQELLGGSPN ADQTPMEFWE GRILGLCTTG NAFAEKVYQG NRVVALLPMP ATTAVERRGD DLLYRFNDRG RAVVLPADKV FHVKAFGDGD VGLSPVEYAR QTLGIAIASE RAAGQVYSRG LRAKGFFLIP GALTPEQREA ARKNLADRYS AKDAPGVGIL EGGVKFEGVN ITPRDAELIL NRRFNVEEVC RWMGCPPILV GHAAQGQTMW GTGVEAVMQQ WLNLSLRALL KRIEQASAKR VLSVSERGRF SAKFNYEDLL RSNSAARAAY YTSLLNCGVL TINEARRLEG LPPVEGGDVP RMQMQNVPIT EAGAEPPGEQ P
|
| |