Gene Rsph17029_0277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0277 
Symbol 
ID4896429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp300012 
End bp301169 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID640110860 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001042167 
Protein GI126461053 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACA TGACGAAGGA CCAGCTGCTC GGCAGCACCA AGCTCGACCG GAAGGGCGAG 
GAGGACGATC CGGCCGGGAT CGTCACGAAG GCGCTCGAGG ATCTGTCGAA GACGCTCAAT
GAGCGGATCG GCGACCTCGA GAAGAAAGCC GACACGTCGC CGCTCGTCGC GCGGCTCGAC
AAGCTCGAGG CGAAGGTGAA CCGCCCCGGC ACGACCGAGC CGAAGCCCGA GGCTGAGGTC
GAGCGCAAGG CGTTCGGCGC CTATCTGCGC TCCGGCCCCG CGGCCCCGGC CGAGGAGCTG
AAGGCGCTGA CAGTCTCCAG CGATCCGCAG GGCGGCTATC TGGCGCCCGC CGAAATGTCG
ACCGAGTTCA TCCGCGACCT GGTCGAGTTC TCGCCCGTCC GCGGCGTGGC GGCGATCCGC
GGCACGGCCG CGCCCTCGGT GATCTACCCG ACCCGTACCG GCATCACGAA TGCGAAGTGG
AAGGGCGAGA CGCAGGCGCA GGAGGCCTCC GAGCCGGGCT TCGGTCAGGC CGAGGTCGTG
GTGAAGGAGG TCAACACCTA CGTCGATATC TCGAACCAGC TCCTCGCGGA CAGCGCCGGG
CAGGCCGAGG CCGAGGTTCG CCTCGCGCTC GCCGAGGACT TCGGCCAGAA GGAGGGCCTC
GCCTTCGTGT CCGGTGACGG CGTGCTCGCG CCGGAAGGCT TCATGAACGC GGCCGGCATC
TCCTACACCG CCAACGGCCA CGCGACCGAT CTCAAGGCCG ACGCGCTCAT CACCATGCTC
TATGCGATCC CGGCGACCCA CCGGAACCGC GGCGCGTGGG CCATGAACGG CACCACGCTC
GGCGTCCTGC GGAAGCTGAA GGACGGACAG GGCAACTTCC TGTGGCAGCC GTCCTATCAG
GCGGGCCAGC CCGAGACGAT CCTCGGCCGC CCGGTGGTCG AGATGGTGGA CATGCCCGAC
CTCGAATCCG GCTCGTTCCC CATCGCCTAT GCGGACTGGT CGGGCTACCG GATCGTGGAC
CGCACGAGCC TGAGCATCCT GGTCAACCCC TACATCAAGG CGACCGAGGG CCTGACCCGC
ATCCATGCGA CCCGCCGTGT CGGCGGCCGC GTCCTGCAGC CTGCGAAGTT CCGCAAGCTG
AAGATGGCCA CCTCGTAA
 
Protein sequence
MRHMTKDQLL GSTKLDRKGE EDDPAGIVTK ALEDLSKTLN ERIGDLEKKA DTSPLVARLD 
KLEAKVNRPG TTEPKPEAEV ERKAFGAYLR SGPAAPAEEL KALTVSSDPQ GGYLAPAEMS
TEFIRDLVEF SPVRGVAAIR GTAAPSVIYP TRTGITNAKW KGETQAQEAS EPGFGQAEVV
VKEVNTYVDI SNQLLADSAG QAEAEVRLAL AEDFGQKEGL AFVSGDGVLA PEGFMNAAGI
SYTANGHATD LKADALITML YAIPATHRNR GAWAMNGTTL GVLRKLKDGQ GNFLWQPSYQ
AGQPETILGR PVVEMVDMPD LESGSFPIAY ADWSGYRIVD RTSLSILVNP YIKATEGLTR
IHATRRVGGR VLQPAKFRKL KMATS