Gene RSP_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2067 
Symbol 
ID3719457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp660237 
End bp661538 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID640070231 
Productprophage LambdaSo, HK97 family major capsid protein 
Protein accessionYP_352119 
Protein GI77462615 
COG category 
COG ID 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.023752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATC TGAACATGCC GCTGGTGGCG TCCGCGCTCC TCGCGGCGAC CCAGCCCCAT 
GCCGTGCTCT GTGCCCCCCG GGCAGAGGGT GGCGCGGGCA ACCTCGAAGC CCTGCTGAAG
GAGGTCAAGC AGGAGCTCGA CCGCATCGGC AATGACGTCC GCAAGACGGC CGACACCGCC
TTCCAGGAGG CGAAGAACGC GGGCAAGCTC TCGGACGAGA CCAAGGTCAA GGCCGACAGT
CTGCTGACCG CGCAGAACGC CCTGCAGGAT TCGGTCGCCA AGCTGCAGCA GCGGCTGGAG
GACATGGATG CGCGCAACCT CGACATCGAG CAGCGCATGT CCGGTCGCCG GGGCGGGGGC
ACTGCGCGCC AGACCCTCGG GCAGGCAATC TCGATGGACG CCCAGGTCAA GGCCTTCAAC
GGCAAGGGCA CCATCACTCT CATCGTGCAG AACGCGATTA CCTCGGGTTC GGCCTCGGCC
GGCCCGCTGA TCGCGCCCCA GCGCGAGACC GAGATCGTGG GTCTCCCCCG CCGGCAGGTG
TTCGTCCGCG ATCTTCTGAG CCGGTCCACC ACCAACTCGA ACCTCGTCCA GTATGCCCGG
ATGAAGGCCC GCACCAATGC CGCCGGCGTC GTGGCGGAAG GCGCGCTGAA GCCCGAGAGC
GGGCTGGAAT ACGAGGCCGC TGACGCTCCG GTGCGAACCA TCGCGCACTG GATCCCGGTT
TCGCGGCAGG CTCTGGAAGA TGCCGACCAG CTGCAGGGCG AGATCGACGG CGAGCTTCGC
TACGGTCTCG ACCTGACCGA GGAGGCGGAG ATCCTCTCGG GCGACGGCGA GGGTCAGCAC
CTGTCGGGCC TGATCACCAA CGCCAGCGCC TATTCCGGCG TCTACGAGCC CGCGGGCGCC
ACGGCGATCG ACAAGCTGCG CTTCGCGCTG CTGGAGGCGA GCCTCGCTCT CTATCCGGCG
GACGGGATGG TGCTCAACGA GATCGACTGG GCGCTGATCG AGACGGCCAA GGATTCCGAG
AACCGCTACA TCTTTGCGAA CCCGCTGCAG CTGGCCGGTC CCGTGCTCTG GGGCCGCCCC
GTGGTGCCGA CGACCGAGAT CGACGAGGAC AAGTTCCTCG TGGGGGCCTT CCGCGCGGCC
GCCACGATCT ACGACCGCAT GGACACCGAG GTGCTGATCT CGTCCGAGGA CCGGGACAAC
TTCGTGAAGA ACATGCTGAC CGTGCGGGCC GAGAAGCGGC TGGCGCTGGC CATCAAGCGT
GCGGCCGCGC TGATCTACGG CGACTTCGGC CGCGTCGCCT GA
 
Protein sequence
MKHLNMPLVA SALLAATQPH AVLCAPRAEG GAGNLEALLK EVKQELDRIG NDVRKTADTA 
FQEAKNAGKL SDETKVKADS LLTAQNALQD SVAKLQQRLE DMDARNLDIE QRMSGRRGGG
TARQTLGQAI SMDAQVKAFN GKGTITLIVQ NAITSGSASA GPLIAPQRET EIVGLPRRQV
FVRDLLSRST TNSNLVQYAR MKARTNAAGV VAEGALKPES GLEYEAADAP VRTIAHWIPV
SRQALEDADQ LQGEIDGELR YGLDLTEEAE ILSGDGEGQH LSGLITNASA YSGVYEPAGA
TAIDKLRFAL LEASLALYPA DGMVLNEIDW ALIETAKDSE NRYIFANPLQ LAGPVLWGRP
VVPTTEIDED KFLVGAFRAA ATIYDRMDTE VLISSEDRDN FVKNMLTVRA EKRLALAIKR
AAALIYGDFG RVA