Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1356 |
Symbol | |
ID | 5083127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1389040 |
End bp | 1390221 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640482913 |
Product | HK97 family phage portal protein |
Protein accession | YP_001167558 |
Protein GI | 146277399 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.423386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCT GGCCCTTTTC TCGCAAGTCG CTCGCCGCGC CGTCCGATGA TCTGCTCGGG ATCTTCGGGG CACTTCCGAC CGCTGCCGGG GTGCCCCTGT CCGTCACCGA CGCGCTGAAG GTCCCCGCCG TGGCGTCGGC GATCCGCATC ATCAGCGAGG CCGCTGCCAG CCTTGATGTG AAGGTTGTCC AGGTGGCGGG GGACGGGGCC GAGACGAACG TGCCTGGGCA TGCCGTGGGC GCCCTCCTCT CGTCCGAGGC GAACGACTGG ACGACCGGCT TCGAGTTCAT CCGCGACCTG GTGATTGACG CTCTCACCTG CGACGTGGGC GGTCTGGCTT GGGTGAACCG GGTGGGCGGC AAGCCCATCG AAGTCATCCA CTACCGGCGC GGTGTGATGG CGGTCGAGTT CGACCAGGCA ACCGGCGAAC CCCGGTACAC GCTGAACAGC GGACCCGTGG CCTCGGCCGA GGTCATCCAC CTGCGTTCCC CCTTCGATCG CTGTCCCCTG ACCCTCGCTC GTGAGGCCAT TGGTGTGGCG GCCGTCATGG AGCGGCACGC CGCCCGCCTC TTCGGCCGCG GTGCCAGACC ATCCGGCGCC CTGGTGTTCC CGAAGGGCAT GGGTGAAGAG TCGGTGAAGA AGGCCCGCTC GGCGTGGCGG CAGACGCACG AAGGCGATGA CGCCGGGGGC CGCACGGCGA TCCTCTACGA TGGCGCCGAC TTCAAGCCCT TCACCCTGGC GAGCACCGAC GCACAGTTCC TCGAGAACCG GATCTTCCAG ATCCTCGAAA TCGCCCGCGC CTTCAGGGTC CCTCCCTCGA TGCTGTTCGA GCTGAACCGC GCGACCTGGT CGAACACGGA ACAGATGGGG CGCGAGTTCC TGGTGTACTG CCTGGAGCCG TGGCTCAAGT CACTTGAGGG GGCACTGGGT CGCGGGCTTC TGACGCAGGA AGAGCGCCGC TCCGGTCTCG CCGTCCGGTT CGACCGGGAC GACCTGACCC GAGCTGACCT TCAGACGCGG GCGACCACGA TCAATTCGCT CATTGCTTCC CTGGTGATCA ATCCGAACGA GGGCCGCAGC TGGCTCGGCC TGCCTCCGCG GGAGGGAGGC GACATGTTCC AGAACCCGAA CATCACCACC GCGGCCGGGG CGCCGAAGGA GGACACCGCC AATGCTGAAT GA
|
Protein sequence | MKLWPFSRKS LAAPSDDLLG IFGALPTAAG VPLSVTDALK VPAVASAIRI ISEAAASLDV KVVQVAGDGA ETNVPGHAVG ALLSSEANDW TTGFEFIRDL VIDALTCDVG GLAWVNRVGG KPIEVIHYRR GVMAVEFDQA TGEPRYTLNS GPVASAEVIH LRSPFDRCPL TLAREAIGVA AVMERHAARL FGRGARPSGA LVFPKGMGEE SVKKARSAWR QTHEGDDAGG RTAILYDGAD FKPFTLASTD AQFLENRIFQ ILEIARAFRV PPSMLFELNR ATWSNTEQMG REFLVYCLEP WLKSLEGALG RGLLTQEERR SGLAVRFDRD DLTRADLQTR ATTINSLIAS LVINPNEGRS WLGLPPREGG DMFQNPNITT AAGAPKEDTA NAE
|
| |