Gene Rsph17029_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3404 
Symbol 
ID4898816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp468685 
End bp472212 
Gene Length3528 bp 
Protein Length1175 aa 
Translation table11 
GC content71% 
IMG OID640114001 
Producthypothetical protein 
Protein accessionYP_001045269 
Protein GI126464156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.24888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAA TGTTCGGAAT GAACGTGCGC GCGCTCGCGG GCCTCTCGGT CTCGCTGCTG 
GCTCTCGGCG CAGGCAGCGC GCTGGCGGAG GACTGCGCGG GCAGGCCCGG CCCATCGGCG
ATGGAAGCCT GCGATCAGGT GGCGCTGCCC GTGGGCGAGA ATACCGAGGC CGCGGGCCCG
GTGGTGCAGC CTCCGCTGGG CGGCGACGGC TTCCTGATCT CGGTGGACGG CGCGCCGGCC
GCGGGCGATC CCGCAACGGC CGACGGCCAG CGGCAGGCGG ACCTCCGGCT CTCGGACGCC
GATGTGGATC TGCGCTTCGA CGGTTTCGGC GTGGCGCCGC GGCTCGATGC GGGCCGCGCC
GATCCGGGAC CCGCCGGGCC GGGCGACCGC GTGGTGCTGC GGAGCCGGAC CAACTATCCG
GCCTTCCTCG CCCGGGCCGA GTTCCGCATC TATGACATGG ACGCCTCGGG CGGCAGCACG
CGGCTCCTGA CCACCGTTCC GGTCGCGCCG AACGGCGAGG CGGCGCTCAC GCTGCCCGAG
GGGCGCGACC TGCAATATGT GCTGCGCGTC TACGACCGCG CGGGCCGCTA TGACGAGACG
GCCCCCCTGC CCTTCGGCGC CCCCGACCGC CGCGGCCTCG CCTTCGGCGC GGGCGAGGAG
GAGGGGTCTG ACAATGCGAC CCAGCGCAAC ATCCCGATCC GGGGCGGCGC GGTGACGGTG
CATGTGCGCA ATCTCGCACC GGGCGCGCGG CTCGAGACGC TGGGCACGAG CGTGGTCCCC
GATCCGTCGG GCGCGGCCGT CGTGCAGCGC ATCCTGCCGC CGGGCCATCA CGGGATCGAC
GTGCGGGTGC CGGGCGCGCT CGACATCACG CGCGACCTGA CGATCCCCGC CTCGGACTGG
TTCACCGTGG GCGTGGCCGA CCTCAGCTTC GGCCGCCGCA TGGGCTCGCT GCCCGAGGGC
ACCGACAAGA CGTGGCAACG CGGGCGGCTG CAGTTCTATG CCAAAGGCAA GACTGCCCGC
GGCTACGAGA TCACCGCCTC GGCCGACACG CGCGAGGACG ATCTGTCGGA CCTTCTGGGC
AACGTGCTCG ACAAGGATCC GCGCGCGGTG CTCGGCCGGC TCGATCCCGA TCTCTACTAT
CCGACCTACG GCGACGACTC GATCCTCACC GACGACACGC CCACCTCCTC GGGTCTCTAT
GTGAAGGTGG AGAAGGACGG CAGCTTCGGC CTCTGGGGCG ATTTCAAGTC GAAGAGCCGC
GGCACCGAGC TTCTGCGCAA CGAGCGCAGC CTCTATGGCG GCCAGCTCGT GCTGCAGAGC
CGGGCGACCA CGCCCGAGGG CGAGGCGCGG CTGCGCTTCG AGGGCTACGC GGCCGAACCC
GACCAGCTGC CGCAGCGCGA CCTGCTGCGC GGCACCGGCG GCTCGGTCTA TTTCCTGTCG
CGTCAGGACC TTCTGGAAGG ATCCGAGACG CTGACCGTTC AGGTCCGCGA TCCCGATACC
GGCCGGGTGA TCTCGACCCG TGGCCTCGTG AACGGGCAGG ATTACTCGGT CAATTACGCG
CAGGGGGTGG TGACGCTCTA TGCGCCACTC TCCTCCTATG CGGGCACCTC CGGGCTTCTG
TCCGGCAGCG CGGTGGGCGA GGACGATCTC TATCTCGTGG CGCAATATGA ATGGGCGCCG
GTGACGGGCG ACGTCGACGG CATGGCCTTC GGCGGGCGCG TCGAGGCCTG GGCCACCGAC
CGGCTGCGGC TGGGGATGAG CGGTCAGGTG GACCGCACCG GCCTTGCCGA CCAGACCTCG
ACCGGGGCCG ACCTGCTCTG GAAACTGTCC GAGGGCACCT ATCTCGAGGC CGAAGCCGCG
CGCTCGGAGG GGCCGGGCTT CGGCTTCACC AGCTCGATCG ACGGCGGCCT CACGCTCGAG
ACGACCGATC CGGTGGACGG CACCGGCGAG GGCTACCGGC TGAAGGCGCG GGCCGATCTG
GCGGACCTCC GGCCCGGCAC AGAGGGACAT GTCGAGGTCT GGGCCGAGCG GCGCACGGCG
GGCTTCTCCT CCATCGACCA TCAGACGACC GAGGACGAAG AGCTCTGGGG CCTCGAGACC
GAGGTCGCGA CGTCGGAGCG CGGAAGGCTC GCCTTCCGCT ACGAACATTA CCGCAAGGAT
CCCGACGAGA AGCTCGACGA GGCGCGGCTG GGCTATGCCC ACCGGCTGAA CGACCGCGAC
ACGGTGGAGC TGGCCTTCGG CCATCTCGAC CGCGAGGATC CGGGCCGCGC CGACCGTACG
GGGCGCAGGC AGGATGCGGG CGCGGGCTTC CGCCGCGAGG TCTCGGACCT GCTGAGCTGG
GAGCTCTGGG GACGCACCAC CGTGGCCCGG TCGGGCGGGA TCGAGCGCGC GGACCGCGCG
GGCGTCAAGC TCGACACGGC ACTCGGGCAG GACTGGCGGC TGCAGACCGG GATCTCGGCC
GGCCACACCG GCTGGGGCGG AGAGATCATG CTGCGCCGCG AGAAGGACGC GGCCGAGAGC
ACCTATCTCG GCTATGTGCT CGACCCCGAC CGGACGCTCG ACGACGTGAC CCTGACCGGG
CGCGACCACG GCAAGTTCGT GGGCGGCGCG CGGCGCAAGC TGGGCGAGAG CACCTCGGTC
TTCGGCGAGA ACAGCTACGA CCTCTTCGGC CAGCGGCGCA CCCTCGCCTC CTCCTACGGC
GTGGAATATG CCGCGAGCGA GCGGACGGTC TGGACCGGGG CCGCCGAGTT CGGCCGCGTG
GCCGACGATG CGACGGGCGA TCTCGACCGC GTGGCGCTCT CGTTCGGGGT GCGCTACGAC
GATGGCGAGC GGCTCTCGAT GAAGGGCCGG CTGGAACTGC GCCGCGACAG CGGGACCTAC
GAGGGGCGCG ACCGCGATGC GGACACGATC CTCGGCACGG GCACGGTGCG CTATGCGCTG
AACGAGGCCG AGCGGATCGT GTCGAGCCTG CAGTTCGTGC TGCCCGACAA CGGCACCGCC
ACCCTGCCCG AGGGCGACTA CATCCGCTAC GATCTGGGCT ATGCGCTGCG GCCCACCGAC
AACGACCGGC TGAACCTGCT CGCGAAATAC CAGTATCTCT ACGACCTCTA CGGTCAGGAG
ACGGACGGGG TGCAGAGCCG GAGCCCGCGG CAGAAGACCC ATGTGGTGAG CCTCGATGCC
GAATATGACG TGACCGAGCG CTGGACGCTG GGCGGCAAGC TGGGGATGCG GTTCGGTGCG
AGCTCGGCCG CCGAGGGCGA GGCCTTCGTG GACAATGACG CCTGGCTGGG CGTGGTCAGC
GCGCGCTATC ACCTCGTGCA CAACTGGGAC CTGCTGGCCG AGGTGCGGCA GCTGCATGCC
GAACAGGCGG GCACGACCGA GACCGGCGTG CTGGTGGGCG GCTACCGGCA GGTCAACCGG
AACATGTCGA TGGGGCTGAT CTACAATTTC GGCCGGTTCT CGGACGATCT GACCGACCTC
GTGCAGGACG ACAAGGGGCT GGCGCTGAAC CTGATCGCGC AGTTCTGA
 
Protein sequence
MTRMFGMNVR ALAGLSVSLL ALGAGSALAE DCAGRPGPSA MEACDQVALP VGENTEAAGP 
VVQPPLGGDG FLISVDGAPA AGDPATADGQ RQADLRLSDA DVDLRFDGFG VAPRLDAGRA
DPGPAGPGDR VVLRSRTNYP AFLARAEFRI YDMDASGGST RLLTTVPVAP NGEAALTLPE
GRDLQYVLRV YDRAGRYDET APLPFGAPDR RGLAFGAGEE EGSDNATQRN IPIRGGAVTV
HVRNLAPGAR LETLGTSVVP DPSGAAVVQR ILPPGHHGID VRVPGALDIT RDLTIPASDW
FTVGVADLSF GRRMGSLPEG TDKTWQRGRL QFYAKGKTAR GYEITASADT REDDLSDLLG
NVLDKDPRAV LGRLDPDLYY PTYGDDSILT DDTPTSSGLY VKVEKDGSFG LWGDFKSKSR
GTELLRNERS LYGGQLVLQS RATTPEGEAR LRFEGYAAEP DQLPQRDLLR GTGGSVYFLS
RQDLLEGSET LTVQVRDPDT GRVISTRGLV NGQDYSVNYA QGVVTLYAPL SSYAGTSGLL
SGSAVGEDDL YLVAQYEWAP VTGDVDGMAF GGRVEAWATD RLRLGMSGQV DRTGLADQTS
TGADLLWKLS EGTYLEAEAA RSEGPGFGFT SSIDGGLTLE TTDPVDGTGE GYRLKARADL
ADLRPGTEGH VEVWAERRTA GFSSIDHQTT EDEELWGLET EVATSERGRL AFRYEHYRKD
PDEKLDEARL GYAHRLNDRD TVELAFGHLD REDPGRADRT GRRQDAGAGF RREVSDLLSW
ELWGRTTVAR SGGIERADRA GVKLDTALGQ DWRLQTGISA GHTGWGGEIM LRREKDAAES
TYLGYVLDPD RTLDDVTLTG RDHGKFVGGA RRKLGESTSV FGENSYDLFG QRRTLASSYG
VEYAASERTV WTGAAEFGRV ADDATGDLDR VALSFGVRYD DGERLSMKGR LELRRDSGTY
EGRDRDADTI LGTGTVRYAL NEAERIVSSL QFVLPDNGTA TLPEGDYIRY DLGYALRPTD
NDRLNLLAKY QYLYDLYGQE TDGVQSRSPR QKTHVVSLDA EYDVTERWTL GGKLGMRFGA
SSAAEGEAFV DNDAWLGVVS ARYHLVHNWD LLAEVRQLHA EQAGTTETGV LVGGYRQVNR
NMSMGLIYNF GRFSDDLTDL VQDDKGLALN LIAQF