Gene Rsph17029_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4038 
Symbol 
ID4898977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1185206 
End bp1186561 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content69% 
IMG OID640114641 
Producthypothetical protein 
Protein accessionYP_001045888 
Protein GI126464775 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCT CCCGTCCCGA CCCCCGCGCC CTGCCGGTCC CGAGTGGCCG AATTTCCCGT 
CTTGCGCGAT TTGGCAGCCT CGCCTCGGGC GTAGCCGGCA ACGTGGCGTT GCAGGGCGCG
CGCCAGCTCG CCCAGGGCAG GCGGCCCACG ATGAGCGAGC TGATCCTGAC TCCGGCCAAT
GTCGCGCGGC TGGCGGAAGA GCTCGCCCGG ATGCGCGGCG CGGCCATGAA GATGGGTCAG
CTCCTGTCGA TGGATGCCGG CGAGATGCTG CCGCCCGAGC TTGCCGCAAT TCTGGCGCGG
CTCCGGGCGG ATGCGCATTA CATGCCGCCC CAGCAACTGC GCAGCGTCCT GACCGCAGCC
TGGGGACCCG ACTGGCAGCG CCGCTTCCGC AGCTTCAACG TCCGCCCGAT TGCCGCCGCC
TCCATCGGGC AGGTCCATCG CGCGATGACG AAGGACGGCG AGGACCTCGC CATCAAGGTG
CAATATCCCG GCGTGCGGCG CTCGATCGAC AGCGACGTGG ACAATGTGGC CGCCCTGCTG
CGCCTGTCGG GTCTGGTGCC GAAGGGGCTC GACGTGGCGG CGATGCTGGC CGAGGCCAAG
CGGCAACTCC ACGAGGAGGC CGATTACGGG CGGGAGGGGC GCTGTCTCGC GCGGTTCGGC
GCCCTGCTGG CGGAATCGCC TGACTTCTGC GTGCCCCGGC TGCACGAGGC GCTGACCACG
CCCGATGTGC TGGCGATGAG CCATGTGGCG GGGCAACCGG TCGAGGATCT GGCGGAGGCG
CCGCAGGACC TGCGCGACCG CGTGATGACG CTTCTGATCG GCCTCATGTT CCGCGAGCTG
TTCGACTTCG GCCTGATGCA GACCGACCCG AACTTCGCCA ACTACCGCCA CGATGCCGCG
ACCGGGCAGG TGGTTCTGCT CGACTTCGGC GCCACCCGCG ATATCGACCC CGGCATGGCA
GACGGTTACC GGCGGCTCCT GCGCGCGGGC CTCGCGGGCG ACCTGCCGGC ATCGGAGGCG
GAGATGCGGG CGCTCGGCTT CCTGTCGGAT GCGGTGCCGC CGGACCTCCG CGCCCTGATG
ATGCGGATGT TCGAGATCTC GATGGAACCT CTGCGGGCGC CGATCTTCGA CTTCGGCGCG
AACGACATGG CGCTGCGGTT GCGCGACCTC GGGATGGAGA TCGGAGAGCG GCGCGAGATC
CATCATCTGC CGCCGGTCGG GACCTTCTAT ATCCAGCGCA AGTTCGGCGG CATGTATCTG
CTGGCCAGCC GCCTTCGCGC CCGGGTCGCC ATCCGCGATC TGATTTATCC TCATATAAAC
GATGGGTTGA ACGAAGTAAC TTCAGGCAGC GCGTGA
 
Protein sequence
MSSSRPDPRA LPVPSGRISR LARFGSLASG VAGNVALQGA RQLAQGRRPT MSELILTPAN 
VARLAEELAR MRGAAMKMGQ LLSMDAGEML PPELAAILAR LRADAHYMPP QQLRSVLTAA
WGPDWQRRFR SFNVRPIAAA SIGQVHRAMT KDGEDLAIKV QYPGVRRSID SDVDNVAALL
RLSGLVPKGL DVAAMLAEAK RQLHEEADYG REGRCLARFG ALLAESPDFC VPRLHEALTT
PDVLAMSHVA GQPVEDLAEA PQDLRDRVMT LLIGLMFREL FDFGLMQTDP NFANYRHDAA
TGQVVLLDFG ATRDIDPGMA DGYRRLLRAG LAGDLPASEA EMRALGFLSD AVPPDLRALM
MRMFEISMEP LRAPIFDFGA NDMALRLRDL GMEIGERREI HHLPPVGTFY IQRKFGGMYL
LASRLRARVA IRDLIYPHIN DGLNEVTSGS A