Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1966 |
Symbol | |
ID | 4022448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2205112 |
End bp | 2206287 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637962159 |
Product | Phage portal protein, HK97 |
Protein accession | YP_569102 |
Protein GI | 91976443 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.811617 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGATC GTCTGAAGGC TTTTCTCGCC CCGCCCGAAG CCAAGGCTTC GCGGACCGCG CAATTGCTGG CGTTTCAGGG GGGAGGGCAG CCGCGCTGGA CGCTGCGGGA CTACGCGGCG CTGGCGCGCG AGGGTTATCT GTCGAATGCG ATCGTGCATC GCTCGGTGCG GCTGATCGCC GAGAACGCGG CGGCTTGCAC CTTCCTGGTG TTCGACGGCG CGCAGGAGAA AGACGCGCAT CCGCTGGCGC AGCTGATCGC GCGGCCCAAT CCGCGGCAGG ACGGTGCCGC GCTGTTCGAG ACGCTGTATG CGCATCTGCT GCTCGCCGGA AACGCCTATG TCGAGGCGGT GGCGCTGGGC GACTCCGTGC ATGAACTCTA TGCGCTGCGG CCGGACCGGA TCAAGGTCGC GCCCGGGCCG GACGGCTGGG CCGAGGCCTA TGACTACAGC GTCGGCGGCC GCAGCGTGCG GTTCGATCAG CACGCGCCGG GCGTGCCGCC GATCCTGCAT CTGACGTTCT TCCATCCGCT CGACGATCAC TACGGCCTCG CGCCGCTGGA AGCCGCCGCC GTGGCGGTCG ACACCCACAA CGCCGCGGCG CGCTGGAACA AGGCGCTGCT CGACAATTCG GCGCGGCCCT CCGGCGCGCT GGTGTATTCC GGGCCGGAGG GCGCGCTGCT GAGCGACGCG CAGTTCGATC GGCTGAAGCG CGAATTGGAG ACCACCTATG AGGGCGCCGC CAATGCCGGC CGGCCGCTGC TGCTCGAAGG CGGGCTGGAC TGGAAGGCGA TGGCGCTGAC GCCGAAGGAT ATGGACTTTC TCGAGGCCAA GCACGCCGCG GCGCGCGAGA TCGCGCTCGC TTTCGGCGTG CCGCCGATGC TGCTCGGCAT TCCCGGCGAC AACACCTACG CGAACTATCA GGAAGCCAAC CGCTGCTTCT TCCGCCAGAG CGTGCTGCCG CTGGCGACCC GCGTCGGCAA TGCGCTGGCG CAGTGGCTCG CGCCGCAATT CGGCGACGGC GTGCGGCTGG TGATCGACAC CGACCGGATC GACGCGCTGT CCGCCGACCG CGCCGCGCTG TGGGAGCGCG TCAGCAGCGC GCCGTTCCTG ACGCTCAACG AAAAACGCGA AGCGGTCGGC TACGCCCCGA TCGCGGGCGG CGACCGGCTG GGGTGA
|
Protein sequence | MLDRLKAFLA PPEAKASRTA QLLAFQGGGQ PRWTLRDYAA LAREGYLSNA IVHRSVRLIA ENAAACTFLV FDGAQEKDAH PLAQLIARPN PRQDGAALFE TLYAHLLLAG NAYVEAVALG DSVHELYALR PDRIKVAPGP DGWAEAYDYS VGGRSVRFDQ HAPGVPPILH LTFFHPLDDH YGLAPLEAAA VAVDTHNAAA RWNKALLDNS ARPSGALVYS GPEGALLSDA QFDRLKRELE TTYEGAANAG RPLLLEGGLD WKAMALTPKD MDFLEAKHAA AREIALAFGV PPMLLGIPGD NTYANYQEAN RCFFRQSVLP LATRVGNALA QWLAPQFGDG VRLVIDTDRI DALSADRAAL WERVSSAPFL TLNEKREAVG YAPIAGGDRL G
|
| |