Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1133 |
Symbol | |
ID | 4895324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1177187 |
End bp | 1178350 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640111719 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001043015 |
Protein GI | 126461901 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.308903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAG GCCCCGATCC GGCCGTGGAG GCGAAAGCCG CAATGGCCGG TTTCCTGAAG GAGATCAATC GCTTTCAGGA GGAGGTGAAG AATGTGCTGC AACAACAGGA AGAGCGTTTG ACCATGCTGG ACCGCAAAAC CATGATCTAC GGGCGCCCGG CGCTGGCGGC CGCGGCCGAC CAGGAGGCGC CGCATCGAAA GGCGTTCGGG GCCTATCTCC GCTCGGGCGA CGACGATGGT CTGCGCGGCC TCGTCCTCGA GGGCAAGGCG ATGACGGCGA GCGTCGCCTC GGACGGCGGC TATCTGGTCG ATCCGCAGAC CTCGGACGCC ATCCGCTCGA TGTTGCTGTC CACGGCCTCG ATCCGTCAGA TCGCCGGTGT GGTCCATGTG GAAGCCACGA GCTTCGACGT GCTGATCGAC CGCACTGAGG TGGGGTCGGG CTGGGCCACG GAGGCCGCCA CGATCAGCGA AAGCGCCTCG CCCACCATCG AGCGGATCTC GATCAAGCTG CACGAACTGT CGGCGATGCC GAAGGCGAGC CAGCGGCTGC TGGACGACTC GGCCTTCGAC GTCGAGAGCT GGCTGGCGGC CAAGATCGCG ACGCGCTTCA TGCGGGCCGA GAGCGCGGCC TTCGTCAGCG GCGACGGGAT CGACAAGCCG CGGGGCTTTC TGGCGCCGGC GAAGGTTGCG AACGCGAGCT GGAGCTGGGG CTCGATCGGC TATGTCCCCT CGGGTGCGGC GAGCGATTTC CTCGCCACGA ACCCGGCCGA TTGTATCATC ACCCTGATCT ATTCGCTCGG CGCCGATTAC CGCGCGAATG CGACCTTCGT GATGAATTCG AAGACCGCGG GCGCGGTGCG GAAGATGAAG GACTCGGACG GCCGCTTCCT GTGGTCGGAC GGTCTGGCCG CGGCGGAGCC TGCGCGGCTG ATGGGATATC CGGTGCTCCT CTGCGAGGAC ATGCCGGACA TTGCCGCGGG CGCCTTTGCC ATCGCTTTCG GGGATTTCGC CGCCGGCTAC ACGATCGCCG AGCGGCCCGA GGTGCGGGTT CTGCGCGATC CGTTCTCGGC CAAGCCCCAT GTCCTCTTCT ATGCGACGAA GCGCGTGGGA GGCGATGTCA GCGACTATGC GGCGATCAAG CTCCTGAAGA TCGCGGTGTC CTGA
|
Protein sequence | MSAGPDPAVE AKAAMAGFLK EINRFQEEVK NVLQQQEERL TMLDRKTMIY GRPALAAAAD QEAPHRKAFG AYLRSGDDDG LRGLVLEGKA MTASVASDGG YLVDPQTSDA IRSMLLSTAS IRQIAGVVHV EATSFDVLID RTEVGSGWAT EAATISESAS PTIERISIKL HELSAMPKAS QRLLDDSAFD VESWLAAKIA TRFMRAESAA FVSGDGIDKP RGFLAPAKVA NASWSWGSIG YVPSGAASDF LATNPADCII TLIYSLGADY RANATFVMNS KTAGAVRKMK DSDGRFLWSD GLAAAEPARL MGYPVLLCED MPDIAAGAFA IAFGDFAAGY TIAERPEVRV LRDPFSAKPH VLFYATKRVG GDVSDYAAIK LLKIAVS
|
| |