Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2092 |
Symbol | |
ID | 6409752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2266569 |
End bp | 2267744 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642711977 |
Product | phage portal protein, HK97 family |
Protein accession | YP_001991089 |
Protein GI | 192290484 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGATC GTCTCAAGGC CTTTCTCACC GTCCCGGAAG CCAAGACATC GCGAACCGCG CAGTTACTTG CGGTGGGATT CGGCGGAGTG GCGCGATTTA CCCCGCGGGA CTACGCCGGG CTGGCGCGAG AAGGCTACGT ACGAAATGCA ATCGTGTATC GCTGCGTGAG GCTGGTGGCG GAGAATGCCG CAGCCTGCGT GTTCGGCGTA TTCGACGGCG CGCAGGAGAA GGAGGCACAT CCGCTCGCGG CTTTGTTAGC GCGCCCCAAT CCTCGCCAGG ATGGCGCCGC GGTGCTGGAG ACGCTGTATG CGCATCTTCT GCTCGCGGGC AATGCCTATA TCGAGGCGGT GACGCTCGGT GAGGCCGTGC ACGAGCTCTA CGCGCTGCGG CCCGATCGCA TCAAACTGAT CCCGGGCGCC GATGGCTGGG CGGAGGCGTA TGATTACAGC GTCGGCGGCC GCACCGTACG GTTCGATCAG CATGCCGCTC CGGTTCCGCC GATCCTGCAT CTGACGTTTT TTCATCCGCT CGACGATCAT TACGGCCTGG CGCCGCTCGA AGCCGCCGCG GTCGCGGTCG ACACCCACAA CGCGGCGGCG CGCTGGAACA AGGCTCTGCT CGACAATTCC GCGCGGCCTT CCGGCGCGCT GGTGTACGCC GGCCCGGAAG GCGCTGTGCT CAGCGAGAAC CAGTTCGAAC GGCTGAAACG CGAATTGGAA CTCACCTACG AAGGTGCCGC CAATGCCGGC CGGCCGCTGC TGCTCGAAGG CGGGCTCGAA TGGACGGCGA TGGCGCTGTC GCCGAAGGAC ATGGACTTTC TTGAGGCCAA GCACGCCGCC GCGCGCGAGA TCGCGCTGGC GTTCGGCGTG CCGCCGATGC TGCTCGGCAT TCCCGGCGAC AACACGTTCT CGAACTATCA GGAAGCCAAC CGCAGTTTCG TGCGCCAGAC CGTGCTGCCG CTGGCGACGC GCGTCGGCAA TGCTCTGGCG CAGTGGCTGT CGCCGCAATT CGGAGATGGC GTGCGCCTGG TGATCGATAC CGACCGGATC GACGCGCTGT CACCCGACCG CACCGCGCTG TGGGACCGAG TCACCCGCGC GCCGTTCCTA ACCCTGAACG AAAAGCGCGA AGCGGTCGGC TACGCGCCGA TCGAAGGCGG GGACGGGTTG GGGTGA
|
Protein sequence | MFDRLKAFLT VPEAKTSRTA QLLAVGFGGV ARFTPRDYAG LAREGYVRNA IVYRCVRLVA ENAAACVFGV FDGAQEKEAH PLAALLARPN PRQDGAAVLE TLYAHLLLAG NAYIEAVTLG EAVHELYALR PDRIKLIPGA DGWAEAYDYS VGGRTVRFDQ HAAPVPPILH LTFFHPLDDH YGLAPLEAAA VAVDTHNAAA RWNKALLDNS ARPSGALVYA GPEGAVLSEN QFERLKRELE LTYEGAANAG RPLLLEGGLE WTAMALSPKD MDFLEAKHAA AREIALAFGV PPMLLGIPGD NTFSNYQEAN RSFVRQTVLP LATRVGNALA QWLSPQFGDG VRLVIDTDRI DALSPDRTAL WDRVTRAPFL TLNEKREAVG YAPIEGGDGL G
|
| |