Gene Rsph17025_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1359 
Symbol 
ID5083130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1391121 
End bp1392278 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content65% 
IMG OID640482916 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001167561 
Protein GI146277402 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.504084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.308637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACA TGAACAAGAG GCAACTGTTT GGCGGCACGA TGCTCGTCGT GAAGGGCGAC 
GATGACGAAC CGGCCGAGCT GGTGACGAAG GCCATTGCCG ACCTCACGAA GACGGTGAAT
GATCGCCTGG ATGCGCTCGA ACAGAAGGCC GATACGACGC AGATCGTGGC CCGGCTCGAC
AAGGTAGAAG CGAAGGTCAA CCGTCCTGGT GGGGCGGATC CGAAGCCCGA GGCGTCGATC
GAGCGCAAGG CCTTCGGTAC CTACCTTCGC GCTGGCAATG CCGCACCTGC CGACGAGCTG
AAGGCACTGA ACGTGTCGAG CGATCCGCAG GGGGGGTATC TCGCGCCGGC CGAGATGAGC
ACCGAGTTCA TCCGCGACCT TGTCGAGTTC TCCCCCGTGC GGGCCGTTGC GAGCGTTCGG
CAGACCGGCT CCCCGAGCAT CATCTATCCC GCGCGAACCG GCATCACGAA CGCACGATGG
AAAGGGGAGG CTCAGGCGCA GGAAGGGTCT GAGCCCGGCT TCGGCCAGGC CGAGGTGGTG
GTCAAGGAGG TCAACACGTT CGTTGACATC TCGAACCAGC TCCTTGCCGA CAGTGCGGGG
CAGGCGGAGG CGGAAGTGCG CATGGCCTTG GCTGAGGACT TCGGCCAGAA GGAAGGAGCC
GCCTTCGTAT CCGGCGACGG CATCCTTGAG CCGGCAGGCT TCATGACCCA TGCAGGCATC
GCCCATACGG TGAGCGGCGC CGCTGCCGGG ATCACGGCCG ACGCCCTGGT GAAGCTGCTC
TATGCGCTTC CCGCAACCTA TCGCGGCCGC GGTGCCTGGG CGATGAACGG CACCACTCTC
GGCGCTGTGC GTCTCCTGAA GGATGGTGAC GGGCGCTTCC TCTGGCAGCC TTCCTATCAG
GCCGGCCAGC CCGAAACGCT CCTTGGGCGT CCTGTTGTGG AGATGGTGGA CATGCCCGAC
GTAGAGGCCG GCGCGTTTCC GATCATCTAC GGCGACTGGT CGGGATACCG AATCGTGGAC
CGCATTGCGC TGAGCGTCCT GGTGAACCCC TACATCCGGG CGACCGAGGG TATCACCCGC
ATCCATGCGA CGCGGCGGGT CGGTGGCCGG GTCTTGCAGG CTGCGAAGTT CCGCAAGCTC
AAGATCGCCG GGGCCTGA
 
Protein sequence
MRHMNKRQLF GGTMLVVKGD DDEPAELVTK AIADLTKTVN DRLDALEQKA DTTQIVARLD 
KVEAKVNRPG GADPKPEASI ERKAFGTYLR AGNAAPADEL KALNVSSDPQ GGYLAPAEMS
TEFIRDLVEF SPVRAVASVR QTGSPSIIYP ARTGITNARW KGEAQAQEGS EPGFGQAEVV
VKEVNTFVDI SNQLLADSAG QAEAEVRMAL AEDFGQKEGA AFVSGDGILE PAGFMTHAGI
AHTVSGAAAG ITADALVKLL YALPATYRGR GAWAMNGTTL GAVRLLKDGD GRFLWQPSYQ
AGQPETLLGR PVVEMVDMPD VEAGAFPIIY GDWSGYRIVD RIALSVLVNP YIRATEGITR
IHATRRVGGR VLQAAKFRKL KIAGA