Gene RSP_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1645 
Symbol 
ID3718502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp238152 
End bp239486 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content73% 
IMG OID640069798 
Productputative phage-related protein 
Protein accessionYP_351691 
Protein GI77462187 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGAC AGAATCTCGA CGACCTGCGC CGCGCCCGGA AGGCCGCGGC CGACACGATG 
GCCGCGGTGG CCGCCCGCAT CGGCGCGCTC GAGGCGGCAG AGACACCGGA CGCCGCCGCG
CTCGAGGCCG AGACCGCGGC CTTTGCCGCC GCCGAAGCCG CCTTCGCCAG GGCCGATGCC
GCCGTGACGC GCGCGGCCGC TGTGGAGGCC GCGCAGGCGG CTGCAGCTCA GGGCGACGGT
GCGGGCGGCG GGAGTGGAAC GGGTGCCGCC GGCACTGACG CCGTGCCGGC GGTGGCCACC
GATCCGGCGC ATCGCGGGGT GGCAGCGGGC TTCATGGTCC AGGCGCTCGC GCGCACGAAG
GGCGACCGGG ACAAGGCCGC CCGTCTCCTC GAAGCCGAGG GCCATGGCGC GATCTCGGCC
GCGCTCTCGG GCGCGAGCGA AGGCGCGGGC GGCGTCACCA TCCCCCGTCC CCAGGCGGCC
GAGCTGATCG AGATGCTGCG CGCCCGGGTC GTCGTGCGCG CCTCGGGCGC CCGCACCCTG
CCGATGCCCG CGGGCGAGAT GCGGCACGCC AAGCAGGTGG GCTCGGCGGT CGCCGCCTAT
GCCGCCGAGA ATGCCGCCAT CGCGCCGAGC CAGCCCAGCT TCGACAAGAT CGACCAGAGC
TTCAAGAAGC TCGTCGGCAT GGTGCCCATC GGCAACTCGC TCCTGCGGCA CTCGGGCGTG
GCGATGGCGC AGCTCGTGCG CGACGATCTC CTGAAGGTCA TGGCGCTCCG CGAGGATCTG
GCCTTCCTGC GCGGCGACGG CAGCGCCGAC ACGCCGAAGG GTCTGCGTCA CTGGATGCTG
CCCGCGAACT GGTCCGCCGC ACCGGTCGCG GCCACGCCGG CGGCGGCCGA GGCGGCGATC
CGGCGGGCGG TCTCGCTCGT GGAGGATGCC GACGTGGGCA TGGTCTCGCC CGGCTGGATC
ATGCGGGCCT CGACGAAGAA CTGGCTCGCG AGCCTGAAGG ACGCGAACGG CAACCCGCTC
TTTCCCTCCA TCGGCGCGTC GGCCCAGCTC ATGGGCTTCC CGATCCGCAC GAGCTCGCAG
ATCCCCGACA ACTTGGGCGC GGGCGGCGAC GAGACCGAGA TCTACTTCGG CGACTTCGAC
GAGGCGATGA TCGGCGACAG CATGGCGCTG GTGGTGGGCT CCTCCACCGA CGCCTCCTTC
GTCGACGGCA ACGGGGCGAC CGTCTCGGCC TTCCAGAACG ACCTCACGCT GATGCGGGCG
ATCTCCGAGC ACGACTTCGC GCCGGCGCAT GACGAGGCCT TTGCCGGCTT CAACGCCTCG
GGCTGGACGC TCTGA
 
Protein sequence
MARQNLDDLR RARKAAADTM AAVAARIGAL EAAETPDAAA LEAETAAFAA AEAAFARADA 
AVTRAAAVEA AQAAAAQGDG AGGGSGTGAA GTDAVPAVAT DPAHRGVAAG FMVQALARTK
GDRDKAARLL EAEGHGAISA ALSGASEGAG GVTIPRPQAA ELIEMLRARV VVRASGARTL
PMPAGEMRHA KQVGSAVAAY AAENAAIAPS QPSFDKIDQS FKKLVGMVPI GNSLLRHSGV
AMAQLVRDDL LKVMALREDL AFLRGDGSAD TPKGLRHWML PANWSAAPVA ATPAAAEAAI
RRAVSLVEDA DVGMVSPGWI MRASTKNWLA SLKDANGNPL FPSIGASAQL MGFPIRTSSQ
IPDNLGAGGD ETEIYFGDFD EAMIGDSMAL VVGSSTDASF VDGNGATVSA FQNDLTLMRA
ISEHDFAPAH DEAFAGFNAS GWTL