Gene RPC_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1801 
Symbol 
ID3972066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1957412 
End bp1958587 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content68% 
IMG OID637924914 
ProductPhage portal protein, HK97 
Protein accessionYP_531679 
Protein GI90423309 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.595196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0740588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATC GGCTGAAACA TCTGCTCGCC ACGCCCGAGA TCAAAGCGTC GCGCACCGCG 
AAGCTGCTGG CGTTCGAGAC CGGCGGGCGG GCGCGGTGGA CGCCGCGGGA TTATGCGGGG
CTGGCGCGCG AGGGTTATCT CGGCAATGCC ATCGTGCATC GCTGCGTGCG GCTGATCGCC
GAGAACGCCG CGGCCTGCCG CTATCTGATC TTCGACGGCG CGCAGGAGCG CGACGGCCAT
CCGTTGGCGC AGCTGCTGGC GCGGCCCAAT CCGCGGCAGG ACGGCGCTGC CTTGTTGGAA
ACGCTGGTGG CGCATCTGTT GCTCGCCGGC AATGGCTATC TTGAAGCGGT GACGCTCGAC
GACGCGGTGC GCGAACTCCA CGCGCTGCGG CCGGACCGCA TGAAAGTGGT GCCCGGGCCG
GACGGCTGGG CCGAGGCCTA CGACTATTCT GTCGGTGGCC GCAGCCTGCG GTTCGATCAG
CAAGCCGGCG GGGTGCCGCC GATCCTGCAT CTGACGTTCT TCCATCCGCT CGACGATCAC
TATGGTCTGG CACCGATCGA AGCCGCCGCA GTCGCGGTCG ACACCCACAA CGCCGCGGCG
CGCTGGAACA AGGCGCTGCT CGACAATTCG GCGCGGCCCT CCGGCGCGCT GGTCTATGCC
GCCGCGGAAG GCGCGGTGCT GTCGGATGCG CAATTCGACC GGCTGAAGCG CGAGTTGGAA
GGCACCTATC AGGGCGCACT CAATGCCGGC CGGCCGCTGC TGCTGGAAGG CGGGCTGGAT
TGGAAGCCGA TGTCGCTGTC GCCGAAGGAT ATGGATTTTC TCGAAGCCAA GCACGCCGCT
GCCCGCGAGA TCGCGCTCGC CTTCGGCGTG CCGCCGATGC TGCTTGGCAT TCCGGGCGAC
AACACCTTCG CCAACTACCA GGAAGCCAAC CGCAATTTCT GGCGGCAGAC CGTGCTGCCG
CTGGCCGACC GGATCGGCGC TGCGTTGGCG CAATGGCTGG CGCCGCAATT CGGCGATCAG
TTGCGCGTGG TGATCGACAC CGACCGCATC GAGGCGCTGG CGTCGGATCG CGCCGCGCTG
TGGGAACGGG TCAGCGCCGC CGAGTTCCTG ACGTTGAACG AAAAGCGCGA GGCGGTCGGC
TACGCGCCGA TCGCGGGCGG CGATCGGCTG AGTTAG
 
Protein sequence
MLNRLKHLLA TPEIKASRTA KLLAFETGGR ARWTPRDYAG LAREGYLGNA IVHRCVRLIA 
ENAAACRYLI FDGAQERDGH PLAQLLARPN PRQDGAALLE TLVAHLLLAG NGYLEAVTLD
DAVRELHALR PDRMKVVPGP DGWAEAYDYS VGGRSLRFDQ QAGGVPPILH LTFFHPLDDH
YGLAPIEAAA VAVDTHNAAA RWNKALLDNS ARPSGALVYA AAEGAVLSDA QFDRLKRELE
GTYQGALNAG RPLLLEGGLD WKPMSLSPKD MDFLEAKHAA AREIALAFGV PPMLLGIPGD
NTFANYQEAN RNFWRQTVLP LADRIGAALA QWLAPQFGDQ LRVVIDTDRI EALASDRAAL
WERVSAAEFL TLNEKREAVG YAPIAGGDRL S