Gene RPB_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3489 
Symbol 
ID3911291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3992035 
End bp3993210 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content69% 
IMG OID637885391 
ProductPhage portal protein, HK97 
Protein accessionYP_487095 
Protein GI86750599 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATC GCCTCAAAGC CTTTCTCGCG ACCCCTGAAG TCAAAGCCTC GCGCACCGCG 
AAACTGCTGG CGTTCGAATC CGGAGGCGTT GCACGGTGGA CACCGCGGGA CTACGCGCGG
TTGTCGCGCG AAGGTTATGT CTCCAACGCG GTGGTGCATC GCTGCGTCCG GCTGATCGCC
GAAAACGCGG CGGCCTGCAC GTTTCTGGTG TTCGACGGCG CGCAGGAGAA GGAGGCGCAT
CCGCTGGCGC AACTGATCGC GCGGCCGAAT CCGCGGCAGG ACGGCGCCGC GCTGTTCGAG
ACGCTGGTGG CGCATCTCTT GCTCGCCGGC AACGCCTATG TGGAGGCGGT GGCGCTCGGC
GACGCGGTGC ACGAACTCTA CGCGCTGCGG CCGGACCGGA TGAAGGTGGT GCCTGGGCCG
GACGGCTGGG CCGCGGCCTA CGACTACGTG GTCGGCGGCC GCAGCGTGCG GTTCGATCAG
CACGCGACGC CGGTGCCGCC GATCCTGCAT CTGACGTTCT TTCATCCGCT CGACGATCAT
TATGGTCTGG CACCGCTCGA GGCCGCCGCG GTCGCGGTCG ACACCCACAA CGCCGCGGCG
CGCTGGAACA AGGCTCTGTT GGACAATTCG GCGCGGCCCT CCGGCGCGCT GATGTATGCC
GGGCCGGAAG GCGCGGTGCT CTCCGACGAG CAGTTCGGCC GGCTGAAGCG CGAGCTGGAG
ACGACCTATG AGGGCGCCGC CAATGCCGGC CGGCCGCTGC TGCTCGAAGG CGGGCTCGAC
TGGCGGCCGA TGGCGCTGTC GCCGAAGGAC ATGGACTTCC TCGAAGCCAA ACACGCGTCC
GCGCGAGAAA TCGCGCTCGC CTTCGGCGTG CCGCCGATGC TGCTCGGCAT TCCGGGTGAC
AACACCTTTG CGAACTATCA GGAAGCCAAC CGCAGCTTCG TCCGCCAGAC TGTGCTGCCG
CTGGCGACCC GGATCGGCAA TGCGCTGGCG CAATGGCTGG CGCCGCAATT CGGCGACGGC
GTGCGCCTCG TGATCGACAC CGACCGCATC GACGCGCTGG CGAGCGACCG CGTCGCGCTG
TGGGAACGCG TCAGCGCCGC GCCGTTCCTG ACGCTGAACG AGAAGCGTGA AGCCGTCGGC
TACGCGCCGC TCGACGGCGG CGACCGGCTG GGGTGA
 
Protein sequence
MLDRLKAFLA TPEVKASRTA KLLAFESGGV ARWTPRDYAR LSREGYVSNA VVHRCVRLIA 
ENAAACTFLV FDGAQEKEAH PLAQLIARPN PRQDGAALFE TLVAHLLLAG NAYVEAVALG
DAVHELYALR PDRMKVVPGP DGWAAAYDYV VGGRSVRFDQ HATPVPPILH LTFFHPLDDH
YGLAPLEAAA VAVDTHNAAA RWNKALLDNS ARPSGALMYA GPEGAVLSDE QFGRLKRELE
TTYEGAANAG RPLLLEGGLD WRPMALSPKD MDFLEAKHAS AREIALAFGV PPMLLGIPGD
NTFANYQEAN RSFVRQTVLP LATRIGNALA QWLAPQFGDG VRLVIDTDRI DALASDRVAL
WERVSAAPFL TLNEKREAVG YAPLDGGDRL G