Gene Rsph17025_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1356 
Symbol 
ID5083127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1389040 
End bp1390221 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID640482913 
ProductHK97 family phage portal protein 
Protein accessionYP_001167558 
Protein GI146277399 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.423386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCT GGCCCTTTTC TCGCAAGTCG CTCGCCGCGC CGTCCGATGA TCTGCTCGGG 
ATCTTCGGGG CACTTCCGAC CGCTGCCGGG GTGCCCCTGT CCGTCACCGA CGCGCTGAAG
GTCCCCGCCG TGGCGTCGGC GATCCGCATC ATCAGCGAGG CCGCTGCCAG CCTTGATGTG
AAGGTTGTCC AGGTGGCGGG GGACGGGGCC GAGACGAACG TGCCTGGGCA TGCCGTGGGC
GCCCTCCTCT CGTCCGAGGC GAACGACTGG ACGACCGGCT TCGAGTTCAT CCGCGACCTG
GTGATTGACG CTCTCACCTG CGACGTGGGC GGTCTGGCTT GGGTGAACCG GGTGGGCGGC
AAGCCCATCG AAGTCATCCA CTACCGGCGC GGTGTGATGG CGGTCGAGTT CGACCAGGCA
ACCGGCGAAC CCCGGTACAC GCTGAACAGC GGACCCGTGG CCTCGGCCGA GGTCATCCAC
CTGCGTTCCC CCTTCGATCG CTGTCCCCTG ACCCTCGCTC GTGAGGCCAT TGGTGTGGCG
GCCGTCATGG AGCGGCACGC CGCCCGCCTC TTCGGCCGCG GTGCCAGACC ATCCGGCGCC
CTGGTGTTCC CGAAGGGCAT GGGTGAAGAG TCGGTGAAGA AGGCCCGCTC GGCGTGGCGG
CAGACGCACG AAGGCGATGA CGCCGGGGGC CGCACGGCGA TCCTCTACGA TGGCGCCGAC
TTCAAGCCCT TCACCCTGGC GAGCACCGAC GCACAGTTCC TCGAGAACCG GATCTTCCAG
ATCCTCGAAA TCGCCCGCGC CTTCAGGGTC CCTCCCTCGA TGCTGTTCGA GCTGAACCGC
GCGACCTGGT CGAACACGGA ACAGATGGGG CGCGAGTTCC TGGTGTACTG CCTGGAGCCG
TGGCTCAAGT CACTTGAGGG GGCACTGGGT CGCGGGCTTC TGACGCAGGA AGAGCGCCGC
TCCGGTCTCG CCGTCCGGTT CGACCGGGAC GACCTGACCC GAGCTGACCT TCAGACGCGG
GCGACCACGA TCAATTCGCT CATTGCTTCC CTGGTGATCA ATCCGAACGA GGGCCGCAGC
TGGCTCGGCC TGCCTCCGCG GGAGGGAGGC GACATGTTCC AGAACCCGAA CATCACCACC
GCGGCCGGGG CGCCGAAGGA GGACACCGCC AATGCTGAAT GA
 
Protein sequence
MKLWPFSRKS LAAPSDDLLG IFGALPTAAG VPLSVTDALK VPAVASAIRI ISEAAASLDV 
KVVQVAGDGA ETNVPGHAVG ALLSSEANDW TTGFEFIRDL VIDALTCDVG GLAWVNRVGG
KPIEVIHYRR GVMAVEFDQA TGEPRYTLNS GPVASAEVIH LRSPFDRCPL TLAREAIGVA
AVMERHAARL FGRGARPSGA LVFPKGMGEE SVKKARSAWR QTHEGDDAGG RTAILYDGAD
FKPFTLASTD AQFLENRIFQ ILEIARAFRV PPSMLFELNR ATWSNTEQMG REFLVYCLEP
WLKSLEGALG RGLLTQEERR SGLAVRFDRD DLTRADLQTR ATTINSLIAS LVINPNEGRS
WLGLPPREGG DMFQNPNITT AAGAPKEDTA NAE