Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3318 |
Symbol | |
ID | 4898538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 376476 |
End bp | 377777 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640113917 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001045186 |
Protein GI | 126464073 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0235436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATC TGAACATGCC GCTGGTGGCG TCCGCGCTCC TCGCGGCGAC CCAGCCCCAT GCCGTGCTCT GTGCCCCCCG GGCAGAGGGT GGCGCGGGCA ACCTCGAAGC CCTGCTGAAG GAGGTCAAGC AGGAGCTCGA CCGCATCGGC AATGACGTCC GCAAGACGGC CGACACCGCC TTCCAGGAGG CGAAGAACGC GGGCAAGCTC TCGGACGAGA CGAAGGCCAA GGCCGACAGT CTGCTGACGG CGCAGAACGC CCTGCAGGAT TCGGTCGCCA AGCTGCAGCA GCGGCTGGAG GACATGGACG CGCGCAACCT CGACATCGAG CAGCGCATGT CCGGTCGCCG GGGCGGGGGC ACCGCGCGCC AGACCCTCGG GCAGGCGATC TCGATGGACG CCCAGGTGAA GGCCTTCAAC GGCAAGGGCA CCATCACTCT CATCGTGCAG AACGCGATCA CCTCGGGTTC GGCCTCGGCC GGCCCGCTGA TCGCGCCCCA GCGCGAAACC GAGATCGTGG GTCTCCCGCG CCGGCAGGTG TTCGTCCGTG ACCTTCTGAG CCGGTCCACC ACCAACTCGA ACCTCGTGCA GTATGCCCGC ATGAAGGCCC GCACCAATGC CGCCGGCGTC GTGGCGGAAG GCGCGCTGAA GCCCGAGAGC GGGCTGGAGT ATGAGGCCGC TGATGCTCCG GTGCGCACCA TCGCGCACTG GATCCCGGTC TCGCGGCAGG CGCTGGAAGA TGCCGACCAG CTGCAGGGCG AGATCGACGG CGAGCTTCGC TACGGTCTCG ACCTGACCGA GGAGGCGGAG ATCCTCTCGG GCGACGGCGA GGGTCAGCAC CTGTCGGGCC TGATCACCAA CGCCAGCGCC TATTCCGGCG CCTACGAGCC TGCCGGTGCC ACGGCGATCG ACAAGCTGCG CTTCGCGCTG CTGGAGGCGA GCCTTGCTCT CTATCCGGCG GATGGGATGG TGCTCAACGA GATCGACTGG GCGCTGATCG AGACGGCCAA GGATTCCGAG AACCGCTACA TCTTCGCGAA CCCCCTGCAG CTGGCCGGCC CCGTGCTCTG GGGCCGCCCC GTCGTGCCGA CGACCGAGAT CGACGAGGAC AAGTTCCTGG TGGGCGCATT CCGTGCGGCC GCCACGATCT ACGACCGCAT GGACACCGAG GTGCTGATCT CGTCCGAGGA CCGGGACAAC TTCGTGAAGA ACATGCTGAC CGTGCGGGCC GAGAAGCGGC TGGCGCTGGC CATCAAGCGC GCGGCCGCGC TGATCTACGG CGACTTCGGC CGCGTCGCCT GA
|
Protein sequence | MKHLNMPLVA SALLAATQPH AVLCAPRAEG GAGNLEALLK EVKQELDRIG NDVRKTADTA FQEAKNAGKL SDETKAKADS LLTAQNALQD SVAKLQQRLE DMDARNLDIE QRMSGRRGGG TARQTLGQAI SMDAQVKAFN GKGTITLIVQ NAITSGSASA GPLIAPQRET EIVGLPRRQV FVRDLLSRST TNSNLVQYAR MKARTNAAGV VAEGALKPES GLEYEAADAP VRTIAHWIPV SRQALEDADQ LQGEIDGELR YGLDLTEEAE ILSGDGEGQH LSGLITNASA YSGAYEPAGA TAIDKLRFAL LEASLALYPA DGMVLNEIDW ALIETAKDSE NRYIFANPLQ LAGPVLWGRP VVPTTEIDED KFLVGAFRAA ATIYDRMDTE VLISSEDRDN FVKNMLTVRA EKRLALAIKR AAALIYGDFG RVA
|
| |