Gene RPD_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1966 
Symbol 
ID4022448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2205112 
End bp2206287 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID637962159 
ProductPhage portal protein, HK97 
Protein accessionYP_569102 
Protein GI91976443 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.811617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATC GTCTGAAGGC TTTTCTCGCC CCGCCCGAAG CCAAGGCTTC GCGGACCGCG 
CAATTGCTGG CGTTTCAGGG GGGAGGGCAG CCGCGCTGGA CGCTGCGGGA CTACGCGGCG
CTGGCGCGCG AGGGTTATCT GTCGAATGCG ATCGTGCATC GCTCGGTGCG GCTGATCGCC
GAGAACGCGG CGGCTTGCAC CTTCCTGGTG TTCGACGGCG CGCAGGAGAA AGACGCGCAT
CCGCTGGCGC AGCTGATCGC GCGGCCCAAT CCGCGGCAGG ACGGTGCCGC GCTGTTCGAG
ACGCTGTATG CGCATCTGCT GCTCGCCGGA AACGCCTATG TCGAGGCGGT GGCGCTGGGC
GACTCCGTGC ATGAACTCTA TGCGCTGCGG CCGGACCGGA TCAAGGTCGC GCCCGGGCCG
GACGGCTGGG CCGAGGCCTA TGACTACAGC GTCGGCGGCC GCAGCGTGCG GTTCGATCAG
CACGCGCCGG GCGTGCCGCC GATCCTGCAT CTGACGTTCT TCCATCCGCT CGACGATCAC
TACGGCCTCG CGCCGCTGGA AGCCGCCGCC GTGGCGGTCG ACACCCACAA CGCCGCGGCG
CGCTGGAACA AGGCGCTGCT CGACAATTCG GCGCGGCCCT CCGGCGCGCT GGTGTATTCC
GGGCCGGAGG GCGCGCTGCT GAGCGACGCG CAGTTCGATC GGCTGAAGCG CGAATTGGAG
ACCACCTATG AGGGCGCCGC CAATGCCGGC CGGCCGCTGC TGCTCGAAGG CGGGCTGGAC
TGGAAGGCGA TGGCGCTGAC GCCGAAGGAT ATGGACTTTC TCGAGGCCAA GCACGCCGCG
GCGCGCGAGA TCGCGCTCGC TTTCGGCGTG CCGCCGATGC TGCTCGGCAT TCCCGGCGAC
AACACCTACG CGAACTATCA GGAAGCCAAC CGCTGCTTCT TCCGCCAGAG CGTGCTGCCG
CTGGCGACCC GCGTCGGCAA TGCGCTGGCG CAGTGGCTCG CGCCGCAATT CGGCGACGGC
GTGCGGCTGG TGATCGACAC CGACCGGATC GACGCGCTGT CCGCCGACCG CGCCGCGCTG
TGGGAGCGCG TCAGCAGCGC GCCGTTCCTG ACGCTCAACG AAAAACGCGA AGCGGTCGGC
TACGCCCCGA TCGCGGGCGG CGACCGGCTG GGGTGA
 
Protein sequence
MLDRLKAFLA PPEAKASRTA QLLAFQGGGQ PRWTLRDYAA LAREGYLSNA IVHRSVRLIA 
ENAAACTFLV FDGAQEKDAH PLAQLIARPN PRQDGAALFE TLYAHLLLAG NAYVEAVALG
DSVHELYALR PDRIKVAPGP DGWAEAYDYS VGGRSVRFDQ HAPGVPPILH LTFFHPLDDH
YGLAPLEAAA VAVDTHNAAA RWNKALLDNS ARPSGALVYS GPEGALLSDA QFDRLKRELE
TTYEGAANAG RPLLLEGGLD WKAMALTPKD MDFLEAKHAA AREIALAFGV PPMLLGIPGD
NTYANYQEAN RCFFRQSVLP LATRVGNALA QWLAPQFGDG VRLVIDTDRI DALSADRAAL
WERVSSAPFL TLNEKREAVG YAPIAGGDRL G