Gene Rsph17029_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0644 
Symbol 
ID4897021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp658831 
End bp660036 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content68% 
IMG OID640111227 
ProductHK97 family phage portal protein 
Protein accessionYP_001042529 
Protein GI126461415 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATGT GGCACAAGCT GGTCAACAAG ATGCTGACCG CGCGCGATGG CGATCTCTAC 
GAGGCGGTCG GCGCCGCCGA GACGTGGGCG GGCGAGCCTG TCTCGGCGCA GGGGGCCATG
AACCTCTCGG CGTTCTTCGC CTGCGCGCGG GTGACGGCCG AGACGGTCGC CAGCCTCTCG
CTCGAGGTCA TGGAGCGGAA AGAGGACGGG ACGAAGGTTC GCGTGGCCCA TGGCCATCCG
CTGCAGGAGT TGCTGGGTGG CTCCCCGAAT GCGGACCAGA CGCCCATGGA GTTCTGGGAG
GGTCGAATCC TCGGCCTCTG CACCACCGGC AACGCCTTTG CGGAGAAGGT CTATCAGGGG
AACCGCGTCG TGGCGCTCCT GCCCATGCCC GCGACGACTG CGGTGGAGCG GCGGGGGGAC
GACCTGCTCT ATCGCTTCAA TGACCGCGGG CGGGCGGTCG TCCTGCCAGC CGACAAGGTC
TTCCACGTCA AGGCGTTCGG GGACGGCGAT GTCGGCTTGT CGCCGGTGGA ATATGCGCGC
CAGACGCTCG GGATCGCCAT CGCCTCGGAG CGCGCGGCCG GGCAGGTCTA CTCCCGCGGG
CTGCGGGCGA AGGGCTTCTT CCTCATTCCG GGGGCGCTCA CTCCGGAGCA GCGCGAGGCC
GCCCGGAAGA ACCTGGCTGA TCGCTACTCG GCCAAGGACG CGCCGGGGGT GGGCATCCTT
GAGGGCGGGG TGAAGTTCGA GGGGGTCAAC ATCACGCCGC GCGATGCGGA GTTGATCCTG
AATCGGCGCT TCAACGTCGA GGAGGTCTGC CGCTGGATGG GATGCCCTCC GATCCTCGTG
GGCCACGCCG CGCAGGGCCA GACGATGTGG GGCACGGGCG TCGAAGCCGT CATGCAGCAA
TGGCTGAACC TGTCGCTGCG GGCGCTCCTG AAGCGGATCG AGCAAGCATC GGCAAAGCGG
GTGCTGTCGG TGTCCGAGCG CGGCCGGTTC TCGGCGAAGT TCAATTACGA GGATCTGCTC
CGCAGCAACT CGGCGGCGCG GGCGGCCTAC TACACCTCGC TTCTCAACTG CGGGGTGCTG
ACCATCAACG AGGCCCGGCG GCTTGAGGGC CTGCCTCCCG TCGAGGGCGG CGATGTGCCG
CGAATGCAGA TGCAGAACGT TCCCATTACG GAGGCCGGCG CGGAACCGCC GGGTGAGCAG
CCATGA
 
Protein sequence
MGMWHKLVNK MLTARDGDLY EAVGAAETWA GEPVSAQGAM NLSAFFACAR VTAETVASLS 
LEVMERKEDG TKVRVAHGHP LQELLGGSPN ADQTPMEFWE GRILGLCTTG NAFAEKVYQG
NRVVALLPMP ATTAVERRGD DLLYRFNDRG RAVVLPADKV FHVKAFGDGD VGLSPVEYAR
QTLGIAIASE RAAGQVYSRG LRAKGFFLIP GALTPEQREA ARKNLADRYS AKDAPGVGIL
EGGVKFEGVN ITPRDAELIL NRRFNVEEVC RWMGCPPILV GHAAQGQTMW GTGVEAVMQQ
WLNLSLRALL KRIEQASAKR VLSVSERGRF SAKFNYEDLL RSNSAARAAY YTSLLNCGVL
TINEARRLEG LPPVEGGDVP RMQMQNVPIT EAGAEPPGEQ P