Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1751 |
Symbol | |
ID | 5083657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1787236 |
End bp | 1788498 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640483311 |
Product | HK97 family phage portal protein |
Protein accession | YP_001167949 |
Protein GI | 146277790 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.518591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCTTT TCGACTTCTT CCGGAGCGAG CCGCAGGTGG CCCGCGTGGA GCCGCCTGTG GTGGCGCAGG GCTCCGGCGA TGTGCAGAGC CCCGGCCAAT GGCGCGGCTT CGTCACCGGC GGCGTCTCGC GCTCCGGGGT GCGGGTGAAC GAGACGACGG CGCTTTCGAT CCCCGCCACG CTTCAGGCGA TCCGGGTCCT CTCCGGTGTG TTCGCCATGA CGCCGCTGCA CTATTTCCGC CGCACCGGTG ATGGCCGCGA GCGCGTGTCG GATGACATCG CAGCCCTCCT TCACGACCGG CCGAACAGCC ATCAGACCGC GTTCGCCTTC CGCGAACTGC TCAAGATGGA CCTGCTGCTG TCGGGGAACT TCTACGCCTA TGTCAGCCGC GACTTCGCCG GCCGTCCGAA GGCGCTGACG CGCCTCAAGC CCGGCAGCGT CCTGATTGCG GAGTACTTCG ATCGCTCGGA GGGGGTCACG CTCTTTTATG ATGCAACCTT GCCGGACGGG TCGCGGGAGA GGTTTCCCGC CCGGGACATC TGGCACATTG CAGGCATGAG CCGTGATGGG CTGTCCGGAC TGAACCCGAT CCAGTTCGCG CGCGACGCCA TCGGCGGGGC CATCGCCACG GCTGACCATG CCGCGAAGTT CTGGGGGAAC GGGGGGCGTC CAAGCACCCT GCTGAAGACC AAGCACAAGG TGGACCCGAT CGCGCGAAAG CAGATCAAGT CCGACTGGAA GGCGATCTAC GGCGGACCGT TCGGCGACGA CATTGCCGTC CTCGACCAGG AGTTGGAGGC CCAGTTCCTC AGCCACGACA ACAAGGCGTC GCAGTACCTT GAGACGCGCG GCTTTCAGGT CATGGACCTG GCGCGCCTCT GGGGCGTGCC GCCGCATCTG ATCTTCGACC TGTCGAGGGC CACCTTCTCG AATATCGAGC AGCAGAGCCT CGAGTTCATC GTGTTCCACC TCGGCCCGCA CTACGAGCGG GTGAGCCAGT CGGCCACGCG CCAGTTCGCC GCGGATGCCC ATTATTTCGA ACATGTCACC GACGCTCTGG TGAAGGGCGA TGTGAAGAGC CGCATGGAGG CCTACTGGCT CCAGCGGCAG ATGGGCATGG TCAACGCCAA CGAGCTGCGT CGGCGCGACA ACCTCTCGCC GATCTCTGGC GATGCCGGCG AGGAATACTG GCGTCCCGCC GCCATGACGC TGGCGGGCAC GCCGCCAGAG CAACCCGCGC AGCGGGCCTC GGCAGAACCC TGA
|
Protein sequence | MGLFDFFRSE PQVARVEPPV VAQGSGDVQS PGQWRGFVTG GVSRSGVRVN ETTALSIPAT LQAIRVLSGV FAMTPLHYFR RTGDGRERVS DDIAALLHDR PNSHQTAFAF RELLKMDLLL SGNFYAYVSR DFAGRPKALT RLKPGSVLIA EYFDRSEGVT LFYDATLPDG SRERFPARDI WHIAGMSRDG LSGLNPIQFA RDAIGGAIAT ADHAAKFWGN GGRPSTLLKT KHKVDPIARK QIKSDWKAIY GGPFGDDIAV LDQELEAQFL SHDNKASQYL ETRGFQVMDL ARLWGVPPHL IFDLSRATFS NIEQQSLEFI VFHLGPHYER VSQSATRQFA ADAHYFEHVT DALVKGDVKS RMEAYWLQRQ MGMVNANELR RRDNLSPISG DAGEEYWRPA AMTLAGTPPE QPAQRASAEP
|
| |