Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3316 |
Symbol | |
ID | 4898269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 374377 |
End bp | 375627 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640113915 |
Product | HK97 family phage portal protein |
Protein accession | YP_001045184 |
Protein GI | 126464071 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.246565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTCA TCACCCGCCT CGCCGCGCGC CTGCCGGCGC AGGTCCGCAG CGCCGCCTAC GACATCGAGA AGGAACGGCG GCTGTCGCTG TCGGACGGCT CGGCATGGTC GCGGCTCTTC GGCCGGACAT CCGCGGCCGG CAAGCCGGTC ACACTCGACA AGGCCATGCA GCTCTCGGCC GTCTGGGCCT GCGTCCGTCA GACCGCCATG GCCATCTCGG CCCTGCCGCT CGCCGTCTAC CGCAAGGAAG GCGACGGCTC CCGCAGCTCA GTGGATGACC GACTGGCCGA GGTCCTCTCG GTCTCGCCGA ACCTCGATCA GACCGCGCTC GAGCACTGGG AGGGGCAAGT GGCGTGGCTG ATGGTCAACG GCAATTGCTA TTCCGAGCGG ACCGACATCG GCGGGCGGCT GTCGTCGCTG CAGCCGCTGC CGGCCAACAT GACCCGCCCG ATCCGCAACA GCGACGGCGA GCTCTTCTAC CAGATCCTCG ATCGGGGGAA GAGCGAGGTG CTGCCCCGCG ACAAGGTCTT CCATGTGAAG GGATTCGGCT TCGGCGGGGA CATGGGGCTG TCGGCCATCA ACTTCGGCGT CCAGACCATG GGCACGGCGC TGGCGGCCGA CGAGAGCGCG GGCAAGCTCT TCTCGAACGG GATGCAGATC TCGGGGGTGC TGAAGGCAGG GCAGACGCTG ACCGCCGAGC AGCGTCAGCA GATGCGGACG ATGCTGGAGG CCTACCGCAG CTCGGACAAC GCCTGGAAGG TGATGGTGCT CGAAGCCGGA ATGAGCTTCG AGGCGCTGAC GCTGAACCCC GAGGATGCCC AGATGCTGGA GACCCGGCGC TTCCAGGTCG AGGACATCTG CCGCTGGTTC GGGGTGCCGC CGATCGTGAT CGGCCACGCG GGCGAGGGCC AGACGATGTG GGGCTCGGGC GTCGAGCAGA TCCTGATCGC CTGGATGGAG CTCGGGCTGA ACCCGGTGCT GCGGCGCATC GAGAAGCGGA TCCAGAAGGA TCTGATGCCC CGGGGTGAGC GGCTCTCGCG CTACGCCGAG TTCAACCGCG AGGGCATCCT CCAGATGGAC AGCAAGGCCA AGTCCGAGTT CCTGACCAAG CTCGTCTCCA ACGGGATCAT GTCCCGCAAC GAGGCCCGCG AGAAACTGAA CCTTTCCCGG CGCGACGGCG GCGACGAGCT GACGGCTCAG ACCGCGATGG CGCCGCTATC CGATCTCGGC CAGAAGGAGA ATCAGGCATG A
|
Protein sequence | MSLITRLAAR LPAQVRSAAY DIEKERRLSL SDGSAWSRLF GRTSAAGKPV TLDKAMQLSA VWACVRQTAM AISALPLAVY RKEGDGSRSS VDDRLAEVLS VSPNLDQTAL EHWEGQVAWL MVNGNCYSER TDIGGRLSSL QPLPANMTRP IRNSDGELFY QILDRGKSEV LPRDKVFHVK GFGFGGDMGL SAINFGVQTM GTALAADESA GKLFSNGMQI SGVLKAGQTL TAEQRQQMRT MLEAYRSSDN AWKVMVLEAG MSFEALTLNP EDAQMLETRR FQVEDICRWF GVPPIVIGHA GEGQTMWGSG VEQILIAWME LGLNPVLRRI EKRIQKDLMP RGERLSRYAE FNREGILQMD SKAKSEFLTK LVSNGIMSRN EAREKLNLSR RDGGDELTAQ TAMAPLSDLG QKENQA
|
| |