Gene Rsph17025_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1075 
Symbol 
ID5083367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1103575 
End bp1104741 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content68% 
IMG OID640482633 
ProductHK97 family phage portal protein 
Protein accessionYP_001167281 
Protein GI146277122 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.529897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.723665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTCG ACTTCCTGAG GAAAAAGGCC GAGCCGCCCG AGCGGAAGGC CTCGGCCACC 
GGCCCGCTGG TGGGCTGGAG CACGGGGCGC GTGGCCTGGA GCCCGCGGGA TACCGTGTCG
CTGACCCGCA ACGGGTTTCT CGGCAACCCG ATCGCCTTCC GTTCGGTCAA GCTGATCTCG
GAGGCGGCGG CGGCGCTTCC GCTTCTGCTG CAGGATCACG AGCGGCGCTA TGACAGCCAC
CCGATCCTGG AGCTGATCGC CCGTCCGAAC CCGCTTCAGG GCCGGGCGGA ACTGCTGGAG
GCGGTCTATG GGCAACTGCT GCTGACCGGC AACGCCTATC TGGAGGCGGT GGCGGGTCTG
TCGCGGCTGC CGGGCGAGTT GCATCTGTTG CGGTCGGACC GGATGAGCCT TGTGCCGGGG
CCGGATGGAT GGCCGGTGGC CTACGATTAT GCGGTGGGGG GGCGTCGCAT CCGGTTTGAC
ATGACCGGGA CGATGCCGAT CTGCCATATC CGCACCTTCC ATCCACAGGA TGACCACTAC
GGCTTTTCGC CGCTTCAGGC GGCGGCGGTG GCGCTGGATG TGCATGTCTC GGCCTCGGCC
TGGTCGAAGG CGCTGCTGGA CAATGCCGCG CGCCCCTCGG GCGCCATCAT CTACAGGGGT
GTGGACGGGC AGGGCGCGCT TTCCGCCGAG CAATATGACC GGCTGGTGAG CGAGATCGAG
GTGAACCATC AGGGAGCGCG CAACGCCGGC CGGCCCATGT TGCTGGAAGG GGGGCTCGAC
TGGAAGCCGA TGGGCTTCTC GCCTTCGGAC ATGGAGTTTC ACACCACCAA GGAGGCGGCC
GCGCGCGAGA TCGCCATTGC CTTCGGCGTG CCGCCGATGC TGCTCGGCAT ACCCGGCGAG
GCGACCTACG CGAATTATCA GGAGGCCCAC CGCGCCTTCT ATCGGCTGAC GGTGCTGCCT
CTGGCGGCAC GGGTCACGGC GGCGATCTCG CACTGGTTGG CCGGCTTCAC CGGAGAGGCG
GTGGAGCTTC GCCCGGATCT CGATCAGGTG CCGGCGCTGG CGGCCGAGCG GGATCAGCAA
TGGGCGCGGG TTTCGGACGC GGGTTTCCTG ACCGAGGCGG AGAAGCGGAT GCTGCTGGGG
CTGCCACGGA TCGCGGAGGA CGAATGA
 
Protein sequence
MLFDFLRKKA EPPERKASAT GPLVGWSTGR VAWSPRDTVS LTRNGFLGNP IAFRSVKLIS 
EAAAALPLLL QDHERRYDSH PILELIARPN PLQGRAELLE AVYGQLLLTG NAYLEAVAGL
SRLPGELHLL RSDRMSLVPG PDGWPVAYDY AVGGRRIRFD MTGTMPICHI RTFHPQDDHY
GFSPLQAAAV ALDVHVSASA WSKALLDNAA RPSGAIIYRG VDGQGALSAE QYDRLVSEIE
VNHQGARNAG RPMLLEGGLD WKPMGFSPSD MEFHTTKEAA AREIAIAFGV PPMLLGIPGE
ATYANYQEAH RAFYRLTVLP LAARVTAAIS HWLAGFTGEA VELRPDLDQV PALAAERDQQ
WARVSDAGFL TEAEKRMLLG LPRIAEDE