Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2069 |
Symbol | |
ID | 3719459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 662387 |
End bp | 663637 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640070233 |
Product | HK97 family phage portal protein |
Protein accession | YP_352121 |
Protein GI | 77462617 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTCA TCACCCGCCT CGCCGCGCGC CTGCCGGCGC AGGTCCGCAG CGCCGCCTAC GACATCGAGA AGGAACGGCG TCTGTCGCTG TCGGACGGCC CGGGATGGTC GCGGCTCTTC GGCAGGACAT CCGCGGCCGG CAAGCCGGTC ACCCTCGACA AGGCCATGCA GCTCTCGGCC GTCTGGGCCT GCGTCCGTCA GACCGCCATG GCCATCTCGG CCCTGCCGCT CGCCGTCTAC CGCAAGGAAG GCGACGGCTC CCGCAGCTCG GTGGATGACC GGCTGGCCGA GGTCCTCTCG GTCTCGCCGA ACCTCGATCA GACCGCGCTC GAGCACTGGG AGGGGCAGGT GGCGTGGCTG ATGGTCAACG GCAACTGCTA TTCCGAGCGG ACCGACATCG GCGGGCGGCT GTCGTCGCTG CAGCCGCTGC CGGCCAACAT GACCCGCCCG ATCCGCAACA GCGACGGCGA GCTCTTCTAC CAGATCCTTG ATCGGGGGAA GAGCGAGGTG CTGCCCCGCG ACAAGGTCTT CCATGTGAAG GGGTTCGGCT TCGGCGGGGA CATGGGGCTG TCGGCCATCA ACTTCGGCGT CCAGACCATG GGCACGGCGC TGGCGGCCGA CGAGAGCGCG GGCAAGCTCT TCTCGAACGG GATGCAGATC TCGGGGGTGC TGAAGGCAGG GCAAACGCTG ACCGCCGAGC AGCGTCAGCA GATGCGGACG ATGCTGGAGG CCTACCGCAG CTCGGACAAC GCCTGGAAGG TGATGGTGCT CGAAGCCGGA ATGAGCTTCG AGGCGCTGAC GCTGAACCCC GAAGATGCCC AGATGCTGGA GACCCGGCGT TTCCAGGTCG AGGACATCTG CCGCTGGTTC GGGGTGCCGC CGATCGTGAT CGGCCACGCG GGCGAGGGCC AGACGATGTG GGGCTCGGGC GTCGAGCAGA TCCTGATCGC CTGGATGGAG CTCGGGCTGA ACCCGGTGCT GCGGCGCATC GAGAAGCGGA TCCAAAAGGA TCTGATGCCC CGGGGTGAGC GGCTCTCGCG CTACGCCGAG TTCAACCGCG AGGGCATCCT CCAGATGGAC AGCAAGGCCA AGTCCGAGTT TCTGACCAAG CTCGTCTCCA ACGGGATCAT GTCCCGCAAC GAGGCCCGCG AGAAACTGAA CCTTTCCCGG CGCGACGGCG GCGACGAGCT GACGGCTCAG ACCGCGATGG CGCCGCTATC CGATCTCGGC CAGAAGGAGA ATCAGGCATG A
|
Protein sequence | MSLITRLAAR LPAQVRSAAY DIEKERRLSL SDGPGWSRLF GRTSAAGKPV TLDKAMQLSA VWACVRQTAM AISALPLAVY RKEGDGSRSS VDDRLAEVLS VSPNLDQTAL EHWEGQVAWL MVNGNCYSER TDIGGRLSSL QPLPANMTRP IRNSDGELFY QILDRGKSEV LPRDKVFHVK GFGFGGDMGL SAINFGVQTM GTALAADESA GKLFSNGMQI SGVLKAGQTL TAEQRQQMRT MLEAYRSSDN AWKVMVLEAG MSFEALTLNP EDAQMLETRR FQVEDICRWF GVPPIVIGHA GEGQTMWGSG VEQILIAWME LGLNPVLRRI EKRIQKDLMP RGERLSRYAE FNREGILQMD SKAKSEFLTK LVSNGIMSRN EAREKLNLSR RDGGDELTAQ TAMAPLSDLG QKENQA
|
| |