Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1801 |
Symbol | |
ID | 3972066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1957412 |
End bp | 1958587 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637924914 |
Product | Phage portal protein, HK97 |
Protein accession | YP_531679 |
Protein GI | 90423309 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.595196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0740588 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAATC GGCTGAAACA TCTGCTCGCC ACGCCCGAGA TCAAAGCGTC GCGCACCGCG AAGCTGCTGG CGTTCGAGAC CGGCGGGCGG GCGCGGTGGA CGCCGCGGGA TTATGCGGGG CTGGCGCGCG AGGGTTATCT CGGCAATGCC ATCGTGCATC GCTGCGTGCG GCTGATCGCC GAGAACGCCG CGGCCTGCCG CTATCTGATC TTCGACGGCG CGCAGGAGCG CGACGGCCAT CCGTTGGCGC AGCTGCTGGC GCGGCCCAAT CCGCGGCAGG ACGGCGCTGC CTTGTTGGAA ACGCTGGTGG CGCATCTGTT GCTCGCCGGC AATGGCTATC TTGAAGCGGT GACGCTCGAC GACGCGGTGC GCGAACTCCA CGCGCTGCGG CCGGACCGCA TGAAAGTGGT GCCCGGGCCG GACGGCTGGG CCGAGGCCTA CGACTATTCT GTCGGTGGCC GCAGCCTGCG GTTCGATCAG CAAGCCGGCG GGGTGCCGCC GATCCTGCAT CTGACGTTCT TCCATCCGCT CGACGATCAC TATGGTCTGG CACCGATCGA AGCCGCCGCA GTCGCGGTCG ACACCCACAA CGCCGCGGCG CGCTGGAACA AGGCGCTGCT CGACAATTCG GCGCGGCCCT CCGGCGCGCT GGTCTATGCC GCCGCGGAAG GCGCGGTGCT GTCGGATGCG CAATTCGACC GGCTGAAGCG CGAGTTGGAA GGCACCTATC AGGGCGCACT CAATGCCGGC CGGCCGCTGC TGCTGGAAGG CGGGCTGGAT TGGAAGCCGA TGTCGCTGTC GCCGAAGGAT ATGGATTTTC TCGAAGCCAA GCACGCCGCT GCCCGCGAGA TCGCGCTCGC CTTCGGCGTG CCGCCGATGC TGCTTGGCAT TCCGGGCGAC AACACCTTCG CCAACTACCA GGAAGCCAAC CGCAATTTCT GGCGGCAGAC CGTGCTGCCG CTGGCCGACC GGATCGGCGC TGCGTTGGCG CAATGGCTGG CGCCGCAATT CGGCGATCAG TTGCGCGTGG TGATCGACAC CGACCGCATC GAGGCGCTGG CGTCGGATCG CGCCGCGCTG TGGGAACGGG TCAGCGCCGC CGAGTTCCTG ACGTTGAACG AAAAGCGCGA GGCGGTCGGC TACGCGCCGA TCGCGGGCGG CGATCGGCTG AGTTAG
|
Protein sequence | MLNRLKHLLA TPEIKASRTA KLLAFETGGR ARWTPRDYAG LAREGYLGNA IVHRCVRLIA ENAAACRYLI FDGAQERDGH PLAQLLARPN PRQDGAALLE TLVAHLLLAG NGYLEAVTLD DAVRELHALR PDRMKVVPGP DGWAEAYDYS VGGRSLRFDQ QAGGVPPILH LTFFHPLDDH YGLAPIEAAA VAVDTHNAAA RWNKALLDNS ARPSGALVYA AAEGAVLSDA QFDRLKRELE GTYQGALNAG RPLLLEGGLD WKPMSLSPKD MDFLEAKHAA AREIALAFGV PPMLLGIPGD NTFANYQEAN RNFWRQTVLP LADRIGAALA QWLAPQFGDQ LRVVIDTDRI EALASDRAAL WERVSAAEFL TLNEKREAVG YAPIAGGDRL S
|
| |