Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3489 |
Symbol | |
ID | 3911291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3992035 |
End bp | 3993210 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885391 |
Product | Phage portal protein, HK97 |
Protein accession | YP_487095 |
Protein GI | 86750599 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGATC GCCTCAAAGC CTTTCTCGCG ACCCCTGAAG TCAAAGCCTC GCGCACCGCG AAACTGCTGG CGTTCGAATC CGGAGGCGTT GCACGGTGGA CACCGCGGGA CTACGCGCGG TTGTCGCGCG AAGGTTATGT CTCCAACGCG GTGGTGCATC GCTGCGTCCG GCTGATCGCC GAAAACGCGG CGGCCTGCAC GTTTCTGGTG TTCGACGGCG CGCAGGAGAA GGAGGCGCAT CCGCTGGCGC AACTGATCGC GCGGCCGAAT CCGCGGCAGG ACGGCGCCGC GCTGTTCGAG ACGCTGGTGG CGCATCTCTT GCTCGCCGGC AACGCCTATG TGGAGGCGGT GGCGCTCGGC GACGCGGTGC ACGAACTCTA CGCGCTGCGG CCGGACCGGA TGAAGGTGGT GCCTGGGCCG GACGGCTGGG CCGCGGCCTA CGACTACGTG GTCGGCGGCC GCAGCGTGCG GTTCGATCAG CACGCGACGC CGGTGCCGCC GATCCTGCAT CTGACGTTCT TTCATCCGCT CGACGATCAT TATGGTCTGG CACCGCTCGA GGCCGCCGCG GTCGCGGTCG ACACCCACAA CGCCGCGGCG CGCTGGAACA AGGCTCTGTT GGACAATTCG GCGCGGCCCT CCGGCGCGCT GATGTATGCC GGGCCGGAAG GCGCGGTGCT CTCCGACGAG CAGTTCGGCC GGCTGAAGCG CGAGCTGGAG ACGACCTATG AGGGCGCCGC CAATGCCGGC CGGCCGCTGC TGCTCGAAGG CGGGCTCGAC TGGCGGCCGA TGGCGCTGTC GCCGAAGGAC ATGGACTTCC TCGAAGCCAA ACACGCGTCC GCGCGAGAAA TCGCGCTCGC CTTCGGCGTG CCGCCGATGC TGCTCGGCAT TCCGGGTGAC AACACCTTTG CGAACTATCA GGAAGCCAAC CGCAGCTTCG TCCGCCAGAC TGTGCTGCCG CTGGCGACCC GGATCGGCAA TGCGCTGGCG CAATGGCTGG CGCCGCAATT CGGCGACGGC GTGCGCCTCG TGATCGACAC CGACCGCATC GACGCGCTGG CGAGCGACCG CGTCGCGCTG TGGGAACGCG TCAGCGCCGC GCCGTTCCTG ACGCTGAACG AGAAGCGTGA AGCCGTCGGC TACGCGCCGC TCGACGGCGG CGACCGGCTG GGGTGA
|
Protein sequence | MLDRLKAFLA TPEVKASRTA KLLAFESGGV ARWTPRDYAR LSREGYVSNA VVHRCVRLIA ENAAACTFLV FDGAQEKEAH PLAQLIARPN PRQDGAALFE TLVAHLLLAG NAYVEAVALG DAVHELYALR PDRMKVVPGP DGWAAAYDYV VGGRSVRFDQ HATPVPPILH LTFFHPLDDH YGLAPLEAAA VAVDTHNAAA RWNKALLDNS ARPSGALMYA GPEGAVLSDE QFGRLKRELE TTYEGAANAG RPLLLEGGLD WRPMALSPKD MDFLEAKHAS AREIALAFGV PPMLLGIPGD NTFANYQEAN RSFVRQTVLP LATRIGNALA QWLAPQFGDG VRLVIDTDRI DALASDRVAL WERVSAAPFL TLNEKREAVG YAPLDGGDRL G
|
| |