Gene Rsph17029_0274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0274 
Symbol 
ID4896380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp297943 
End bp299121 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID640110857 
ProductHK97 family phage portal protein 
Protein accessionYP_001042164 
Protein GI126461050 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCT GGCCCCTCAC CCGCAAGTCG CTCGCCACCC CGTCCGAGGA TCTCTCGGCC 
ATCTTCGGGG TGACGCCGAC CATCTCCGGC GCCTCGGTCA CGCCGCTTGA GGCGCTGAAG
GTGCCTGCGG TCTCGGCGGC CGTGCGCACC ATCTCGGAGG CCGCCGCCAC GCTCGACGTG
AAGGTGGTCG AGATCGCCGC GAATGGCCGC GAGACCGACA CGCCGGCGCA CCCGATCCTG
CCCCTCCTGC GCGACCGGGC GAACGACTGG ACCTCCGGCT CCGAGCTGAT CCGGGATCTC
GTGATCGACG CCCTCCTGAC CGATATCGGC GCGCTGGCGT GGGTGAACCG CATCGACGGC
CGCCCTGTCG AGGTCATCCA CTACCGGCGC GGGGTGATGG CGGTCGAGTT CGACCAGGCC
ACGGGCGAAC CCCGCTACAG CCTGAACAGC ACCCCCCTGC GCTCCTCCGA CGTGATTCAC
CTGCGCGAGC CCTTCGGCCG CTGCCCGGTG ACGCTGGCGC GCGAGGCCAT CGCCGCTGCC
ATCGTGATGG AGCGCCACGC GGCTCGCCTC TTCGGCCGGG GCGCCCGCCC CTCCGGGGTC
CTATCCTTCC CGAAGGGCAT GGGCGACGAG GCAGTGAAGA AGGCGCGGAT CGCGTGGCGC
TCGACGCACG AGGGGCAGGA TGCGGGCGGC GCCACGGCGA TCCTCTACGA CGGCGCGACG
TTCCAGCCGC TCACCCTCGC CAGCACGGAT GCGCAGTTCC TCGAGAACCG CAAGTTCCAG
ATCACCGAGA TTGCGCGCGC CTTCAACATC CCGGCGCCGA TGATCGGCGA CCTCGAGCGC
GCGACCTGGG GCAACGCCGA GCAGAAGGCG AAGGAGTTCC TGAGCTACTG CCTCGAGCCG
CGCCTCAAGG CGCTGGAGGG CGCCCTCGGC CGGGCGCTTC TGACGGAGGA GGAGCGCGGG
CGCTTCGCCA TCCGCTTCGA CCGCGACGAC ATCAGCCGCG CCGATCTCGC GACCCGCTCC
ACCACGATCA ACTCGCTCAT CACAAGTCAG GTGCTGAACC CGAACGAGGG CCGCGCCTGG
CTGGGCATGG AGCCGCGTCA GGGCGGCGAC GAATTCCGCA ATCCGAACAT CACGGCCGCC
TCCGAGCCGC CGCAACAGGA GCCGCCGAAT GCTGAATGA
 
Protein sequence
MKLWPLTRKS LATPSEDLSA IFGVTPTISG ASVTPLEALK VPAVSAAVRT ISEAAATLDV 
KVVEIAANGR ETDTPAHPIL PLLRDRANDW TSGSELIRDL VIDALLTDIG ALAWVNRIDG
RPVEVIHYRR GVMAVEFDQA TGEPRYSLNS TPLRSSDVIH LREPFGRCPV TLAREAIAAA
IVMERHAARL FGRGARPSGV LSFPKGMGDE AVKKARIAWR STHEGQDAGG ATAILYDGAT
FQPLTLASTD AQFLENRKFQ ITEIARAFNI PAPMIGDLER ATWGNAEQKA KEFLSYCLEP
RLKALEGALG RALLTEEERG RFAIRFDRDD ISRADLATRS TTINSLITSQ VLNPNEGRAW
LGMEPRQGGD EFRNPNITAA SEPPQQEPPN AE