Gene Rsph17029_3318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3318 
Symbol 
ID4898538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp376476 
End bp377777 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID640113917 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001045186 
Protein GI126464073 
COG category 
COG ID 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0235436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATC TGAACATGCC GCTGGTGGCG TCCGCGCTCC TCGCGGCGAC CCAGCCCCAT 
GCCGTGCTCT GTGCCCCCCG GGCAGAGGGT GGCGCGGGCA ACCTCGAAGC CCTGCTGAAG
GAGGTCAAGC AGGAGCTCGA CCGCATCGGC AATGACGTCC GCAAGACGGC CGACACCGCC
TTCCAGGAGG CGAAGAACGC GGGCAAGCTC TCGGACGAGA CGAAGGCCAA GGCCGACAGT
CTGCTGACGG CGCAGAACGC CCTGCAGGAT TCGGTCGCCA AGCTGCAGCA GCGGCTGGAG
GACATGGACG CGCGCAACCT CGACATCGAG CAGCGCATGT CCGGTCGCCG GGGCGGGGGC
ACCGCGCGCC AGACCCTCGG GCAGGCGATC TCGATGGACG CCCAGGTGAA GGCCTTCAAC
GGCAAGGGCA CCATCACTCT CATCGTGCAG AACGCGATCA CCTCGGGTTC GGCCTCGGCC
GGCCCGCTGA TCGCGCCCCA GCGCGAAACC GAGATCGTGG GTCTCCCGCG CCGGCAGGTG
TTCGTCCGTG ACCTTCTGAG CCGGTCCACC ACCAACTCGA ACCTCGTGCA GTATGCCCGC
ATGAAGGCCC GCACCAATGC CGCCGGCGTC GTGGCGGAAG GCGCGCTGAA GCCCGAGAGC
GGGCTGGAGT ATGAGGCCGC TGATGCTCCG GTGCGCACCA TCGCGCACTG GATCCCGGTC
TCGCGGCAGG CGCTGGAAGA TGCCGACCAG CTGCAGGGCG AGATCGACGG CGAGCTTCGC
TACGGTCTCG ACCTGACCGA GGAGGCGGAG ATCCTCTCGG GCGACGGCGA GGGTCAGCAC
CTGTCGGGCC TGATCACCAA CGCCAGCGCC TATTCCGGCG CCTACGAGCC TGCCGGTGCC
ACGGCGATCG ACAAGCTGCG CTTCGCGCTG CTGGAGGCGA GCCTTGCTCT CTATCCGGCG
GATGGGATGG TGCTCAACGA GATCGACTGG GCGCTGATCG AGACGGCCAA GGATTCCGAG
AACCGCTACA TCTTCGCGAA CCCCCTGCAG CTGGCCGGCC CCGTGCTCTG GGGCCGCCCC
GTCGTGCCGA CGACCGAGAT CGACGAGGAC AAGTTCCTGG TGGGCGCATT CCGTGCGGCC
GCCACGATCT ACGACCGCAT GGACACCGAG GTGCTGATCT CGTCCGAGGA CCGGGACAAC
TTCGTGAAGA ACATGCTGAC CGTGCGGGCC GAGAAGCGGC TGGCGCTGGC CATCAAGCGC
GCGGCCGCGC TGATCTACGG CGACTTCGGC CGCGTCGCCT GA
 
Protein sequence
MKHLNMPLVA SALLAATQPH AVLCAPRAEG GAGNLEALLK EVKQELDRIG NDVRKTADTA 
FQEAKNAGKL SDETKAKADS LLTAQNALQD SVAKLQQRLE DMDARNLDIE QRMSGRRGGG
TARQTLGQAI SMDAQVKAFN GKGTITLIVQ NAITSGSASA GPLIAPQRET EIVGLPRRQV
FVRDLLSRST TNSNLVQYAR MKARTNAAGV VAEGALKPES GLEYEAADAP VRTIAHWIPV
SRQALEDADQ LQGEIDGELR YGLDLTEEAE ILSGDGEGQH LSGLITNASA YSGAYEPAGA
TAIDKLRFAL LEASLALYPA DGMVLNEIDW ALIETAKDSE NRYIFANPLQ LAGPVLWGRP
VVPTTEIDED KFLVGAFRAA ATIYDRMDTE VLISSEDRDN FVKNMLTVRA EKRLALAIKR
AAALIYGDFG RVA