Gene Rsph17025_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0443 
Symbol 
ID5082242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp437598 
End bp440435 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content70% 
IMG OID640481996 
Producthypothetical protein 
Protein accessionYP_001166654 
Protein GI146276495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.616672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0723383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG AGCTGCCCCC GCCGGAAACG GGCCTGCCCG TCCCGCCTGA CCTGGACGCG 
GACAGCTGGG ACGTGATGGG TGCCGGATGG AAGGCCGAGA CGATCCGGAC CGACGCCTGG
TCCTACAGCC AGAAGAAGCG CCAGGCGCTG GCCTCGGAGA TCTACGACCG GCTGACGCCC
GACGCGAAAC AGCGCATCGC CGACCAGCGC TGGGACTATG AGAACAACTG GACGGACTTC
GAGGACATGG TGCTGGGCGA GGCGGCAACC GCCGCGCAGA CCGCGCCCGA GAGCTTCGCG
GGGCTTCCCC TTTCGCGGGA TCTCTTCGAC CAGCGGATCG ACGCGGAGCG CCGCGCGGAA
ATGGACGAGG CGCAGGCAAT CCTCGACCAG CCGGGCGGCC TCATCGCCGA GTTCGTCGGC
GCCGGAGCTC GCGCCATGAC CGACCAGACC AGCCTGATGA TGGCGCCCTT CGGCGTGACC
GGGTCGGCAT GGAAGACGAT CCTGGGCGAA GCGGTCATGG GAGGCCTCGG CGAGGCCGCC
GTGCTGCCCC GCGAATACCA GGTGGCGAAG GAGCTGGGCC TGCCGGAGCC CGACCCCCTC
ATGCGGATCG GCGCGGGTGC CGTTCTGGGC GGCGGCCTCG CCGCCGGCGT CATCGGCATC
GGCCGCGGGA TCAGCCATCT GCGCAGCCGT CGCGCCGGCA TCGACACGGC TCGAGCCTTG
GGCGCCGACG ATCTGGACGC CACGGTCGCG ATTGAGGAGG CGGAGGCTGC ACTGCGCGGC
GACCGCACCG TCCAGGAGGT GGTGAAGCCG GCCGCGACGA CGCCGGAGCC CGGCACGCTG
GGGGACGTCC TGGCTGCACC TGGCGGGTTG CCACCCATTG CGCCCGATGC GCCCGAAGGC
TGGGGGCAGA TCCGCAACGG GATATTCGCG GGCGAGAGCA GCGGTGACTA TGACGCACTC
TTCGGCTTCT CGAACAGGAA GGGTGGCGAG TTCTTCCGGG TGCGTCTGAC CCAGATGACC
GTTGACCAGG CGATCGCCTT TTCCGATCCG CGCGGCCGCT ATGCGCAATG GGTGAAGAGC
AAGATCGGCC GCGTGGCCAC GCCCATGGGC GCCTATCAGA TTGTCGGCTC AACCCTGCGC
GACGCCAAGC GCGCCCTTGG CCTGCAGGGC GACGAGCTGA TGACGAAGGC CCTGCAGGAG
CGTCTCGGCC AGTGGATCTA CCGCACCCAA GGCACTGGTG CTTGGGTGGG CTACCGCGGC
CCGCGGGAGA GCTACACGCC CGACGTGGGC GGCGATGCGC CGAGCTTCGC CACCTCGCGC
GGATATACCG GCAGCGGTCA AGTGACCGCC GGCGACGCCT TCCGGATCGA CGTGGGCTAC
GAGGTGGTGG AACTGTCCAG CCTGAGCCGC GCCACCGGCG GCCTGCAGCC GCGGGACCGG
AGCCGTGTGG CTTCGGACGC CTGGATCGCC GATACGGCCG CGCGCCTTGA CCCCGCCCAG
CTGATGCCTT CGCCCACGGC CGACCGTGGC GCTCCGATCG TCGGGCCGGA TGGCGTGATC
GAGAGCGGCA ATGGCCGGAC GGCGGCCATC GCGCGGGCCT ATGAGCGGCA CCCCGACCGC
GCTCTGGCCT ATCGCCAGCA GATCGAGGCG GCCGGTTTCC AGATCCCGGC CGGAATGCGG
CAGCCGGTGC TGATCGCCCG TCGGCAGACA GAGCTTTCCC CTGCCGATCG CAGTCGCTTC
GCGATCGAGG CACAGGACAG CGGCGTGGCG GCGATGACGC CGACCGAGGT GGCCCGGGCA
TCGAGCCGCG CCATGACGCC CGAGGTGCTG GCGCGCTTCG ACCCGTTGCA GGCGCTGACG
GCCGACGCGA ATGGCGAGTT CGTTCGCTCG GCCCTGGCGG GCCTGCCGCG GTCCGCGCGC
AATGCCATGT TCGGCAGCAG CGGGATGCTG AACAAGGAGG GTCAGCGCCG CCTGCGCGAG
GCCCTCTTTG CTCGGGCATG GCCCGATCCC GAGATCCTCG CCCGGTTCAC GGAGACCGAT
GCCGGCGAGC TGAAGAGCTT GCTCGAGGCG CTCGACAGGG CGGCGCCGGC ATGGGCTGCG
CTGCGCGCCG ATATCGAGGC TGGCCGCATC CGTCCCGAAA TGGACATCGG CCCCTATGTT
CTGGACGCCA TGCGCCTGAT CGGTGCCGCG CGCGATTTGG CAAGCCGCGA GGGACTGCCG
ATCGCCCGCG CGCTCGAGGA GCTCCTGGAC GAGATCGACC TGTTGGACGG TGCCGTCGCG
CCTCTCACCG CGGCCCTCGT CCGGAAGTTC TGGAAGAACG GCCGCGCGGC CTCGGCCGAC
GAAGTGGCGA GCTTCCTGAC CCGCTTCGCC GATGACGCCC GCAAGGCCGG CGGCACGGCG
ACTCTCTTCG AGGCCCCGGG CCCGCGCGAG ATCCTGCTGG CGATCGATCG CAAGGCCTTC
GGTGAGCTTC CCGAGGATCT GGGGGCGCCG CGGCGGGCAG TGCCGGCGCA GCCGATCGAG
CTGCCTGCGC GGGGCTTCGA CGATGCGGCA GACCCCGAGG CCGTGGCGGC CGATGTCGCC
GCGCTCGAGG AGCTGGGCGG CGCCGACGTT AGACCGCCGT CTAAAGAAGC GCCCGATCAG
GTTGCCGATG CCGGCAAGGC GATGGCAGAT CGTGAGGTCG TCGATGCCAT CGCCGCCGCT
CGTTCTGAAC TGGGGGACAT GGAAATCGAG ATGCCGGACG GCACCACGCG CAGCGCGGCC
GAGCTGCTGG ATGATCTGGA CGCGGACGCG CAGGCCGATG CCGTCCTTCA GGCTTGTGCC
ATAGGAGGTG CCGCATGA
 
Protein sequence
MTDELPPPET GLPVPPDLDA DSWDVMGAGW KAETIRTDAW SYSQKKRQAL ASEIYDRLTP 
DAKQRIADQR WDYENNWTDF EDMVLGEAAT AAQTAPESFA GLPLSRDLFD QRIDAERRAE
MDEAQAILDQ PGGLIAEFVG AGARAMTDQT SLMMAPFGVT GSAWKTILGE AVMGGLGEAA
VLPREYQVAK ELGLPEPDPL MRIGAGAVLG GGLAAGVIGI GRGISHLRSR RAGIDTARAL
GADDLDATVA IEEAEAALRG DRTVQEVVKP AATTPEPGTL GDVLAAPGGL PPIAPDAPEG
WGQIRNGIFA GESSGDYDAL FGFSNRKGGE FFRVRLTQMT VDQAIAFSDP RGRYAQWVKS
KIGRVATPMG AYQIVGSTLR DAKRALGLQG DELMTKALQE RLGQWIYRTQ GTGAWVGYRG
PRESYTPDVG GDAPSFATSR GYTGSGQVTA GDAFRIDVGY EVVELSSLSR ATGGLQPRDR
SRVASDAWIA DTAARLDPAQ LMPSPTADRG APIVGPDGVI ESGNGRTAAI ARAYERHPDR
ALAYRQQIEA AGFQIPAGMR QPVLIARRQT ELSPADRSRF AIEAQDSGVA AMTPTEVARA
SSRAMTPEVL ARFDPLQALT ADANGEFVRS ALAGLPRSAR NAMFGSSGML NKEGQRRLRE
ALFARAWPDP EILARFTETD AGELKSLLEA LDRAAPAWAA LRADIEAGRI RPEMDIGPYV
LDAMRLIGAA RDLASREGLP IARALEELLD EIDLLDGAVA PLTAALVRKF WKNGRAASAD
EVASFLTRFA DDARKAGGTA TLFEAPGPRE ILLAIDRKAF GELPEDLGAP RRAVPAQPIE
LPARGFDDAA DPEAVAADVA ALEELGGADV RPPSKEAPDQ VADAGKAMAD REVVDAIAAA
RSELGDMEIE MPDGTTRSAA ELLDDLDADA QADAVLQACA IGGAA