Gene Rsph17029_4111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4111 
Symbol 
ID4894949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp51665 
End bp56728 
Gene Length5064 bp 
Protein Length1687 aa 
Translation table11 
GC content67% 
IMG OID640110510 
ProductYD repeat-containing protein 
Protein accessionYP_001041822 
Protein GI126464846 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones99 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGT CGCAATATCA GGGCTCGCAG GCCTATAACC AGTCGCTGCA AAGCAACGAA 
TACATCAACA GGAATACCGG GTCGCTGGTG GTGTCGCTGC CGCTGGTGCA GTTGCGCGGG
ATCACCGACG CCATCGGCCT GTCGCTGACG CTGGGCTATT CCGCGGGGGC GTCGGGCCAG
CTGGGCCTGC CGCAGGGCTG GGGCTGGGGG CTGCCCTTCG TCGCCGCCGG GCAATCGCTG
ACGGTCGAGG GTAAGACCTT TGTCATTGAC CCCAGCTGGA CCGACAGTTC GGGCTATCAA
TCCGGCCTGC GTTATCTGAA CGACCATGGG CGCCTGTTCC AGACCGTCGT CCCGCCGCAG
CCCGTTCCGG GCGGCGGCGG GACATACGGG TTCATGCTGC GCTATGACGA CGGATCGATC
AGCTATTTCG ACGCCACGGG CAAGCTGATC GCGCATGCCG ACCTTTACGG CAACATGCTG
CGCTATGCCT ATACCAACCC GCTGGGCGAC GTGTTCCAGA ACAGCCTAGC CAGCATCACC
GACAGTTTCG GCCAGGTTGT CACGTTCGGC TATTCCGGCG GCACGATCGT GCTGACCCTG
CCCGACGGAT CGAGCGTGAC CGTGGCGTTT TCGAGCCAGG GTGTACAGCA TGTCACCAAC
CAGATCGGCG CGGTCACCGC CTTCAGCTAT GCCTCGGCCT CGGGCACGAC GGTCGTCGGC
GCGATCACCT ATCCCACCGG GATGACAACG ACGCTGGCCT ACACCGGCCT CCGCTACATC
GACGCCGGAG GGAATTCCGG CACCCGCCCG GCCGTGCAGT CGATGACCCG GACCGGACCG
GGCGGCGCGT TCCTGGACCG GAGCGACTAC AGCTATGGCA CCGCATCGGG CGGCAATACC
TATACCGGGG CAACCGCGGG GTACCGGATG GGGACGGCCT CGGACGGGCT CATGGACAGC
AACAACACCG CCTACCAGTA CGACGTGCTG GAACAGCGCC GCGACGCGGG CGGCGCACTC
ATTGCCGCCA ACCGGGTGTT CTTCAACTAC CTGCACTGCC CGCTGACGGA GCATGCCTAT
CTGGTGGACG CAAACGGCCA TGCGCAAGAG GCGCACCGCA CCTCCTATAC CTACGACATC
GTCACCGACG CCCATGCGCG AAGCGTGAAC ATGAACAAAC CGGTGACGAC GGTGCACAGC
GTCTATGACG CGGCGGGCGG CACGTGGTCC GACAATTCGC AGGCAGATGT CGCCTATGAT
CTTTATGGAA AGATCACGTC GAGCAGCCAG TACGACCTGA GCAGCGGCAG ACCCCGGCTG
ACCGGCCAGC GGACGCACAG CTTTCTCCAG GTCGCGTGGG GAGGCGAGAT GCCGCAGCGA
ACCGACTACG TCGATGCGGT CACCGGCCAG ACGACGCGGA TCGACTATGG GCTGACGGCC
GACCAGAAAT CCATCGCCGC CACCACCGTC TCTACCCAGG CGGCGGGCGC GGGCAGTTTC
GCCCCCTACA AGACCAAGAC CTTCCAGTTC GATCCCCGGG GCTGCCAGAC CGGCTGGACC
CTGACCTGGG CCCGGGGCTA CAGCGGTCCC GAGGGGTCGG TCGCTTCGGT CGGCGAACAG
ACCTCCTACA GCTACGATGC CGCGTCGCAT CAGCTGACCG TCGACACGAC TGACGCCAAC
GGCCACACCC GGGTCGCGGT CTTCGACATG CGCCTGCCGG GCGGACCCTG CCTGAGCCAG
ACCAAACCCG GCGGCGCGCG GCGCAGCTAT GCCTACGACC CGCTGGCGCG CCTGATGCAG
GAAACCGACC CGCTGGGCGA TGTCACGACC CATGCCTACA CTCTGGCCGG AAACGGTGGC
GGCGGCACCA ACACCAGCAC CGTGACCCAG TCCAACGGCT ATGTCCTCTG CACGACCTAC
GATGCAAAGG GGCGCGCCGT CCTGATGATG GACAATGGCG ATCCAACCCA ATCGACGCCC
AGCCTGTCGC GTACCCTGCG CCGCGTGAGT TTCAATGCGC TGGATCTGAA GGCGAGCGAG
ACCGACGCCA CCGAACAGAC CCTGACGATC AGCTATGACG CGCTGGGACG TCAGATCGCC
CTGGCCGATG CCCTGGGCAA TCAGAGCACG ACCATCTTCG ATGACGCGGC CCGAACGGTC
AGCACCTCGA TGAACGGCGA TCTTCGCACG GTCATCACCC GTGACGGGCT GGGCAACACG
ATCCAAACCG ACACCCATGC CGACAGCGGC TCACCCCAGG CGGGCCAGGT GCAGCGCGTG
ACCTCGGCCT ATGACGGCTT TGGCCAAGTG GTGACCGAAA CGCACTTCAG CATCACCGGC
GGCAACGTTG TGCAGAATTC GGTCAAGACC TTCACCTATA CGCCGGACGG CAATCCCGAA
GTTGTGGATT TCACCGGCAC CCCCGCGACC GCCGCGTTGC AGGCGACGCG CTGCACCACC
ACGCAGCGCG ACCTGAACGG CAATCCGATC CTTGTCACCC GCAACGTCAG CTATAACGGC
GCGGCGCAGC CCCTCGTTGT CAGCGAGACG CTGACCTATG ATCCGGTCGG CAACCTGATC
GAGATCCGGA ACCAGGCCGG GCAGATCGAA CGGCTGAGCT ACACGACCGA CAGCGCCCTG
CAAAGCCGCA CCCGCTATGA CGGCACCCAG ACGACCTATG CCTACGACGC CGCAGGCCGG
GTGCTGAGCG AGACCACAAA CGGGCAGACG CGCACCTTCG CCTATCTTTC GAATGGCAGG
ATCGACCAGA TCACCGATCC GAGCGGCACC TTGCGCTATG CCTACTCTCT GGACGGCACG
GCCTCGTCCG TGACCTATCC CGATGGCAAG ACGCTGTCGC TGACCAAGGA CGCCACCAGC
CGGGTGGTGT CGATGACCCT GCCCGATGGC ACCGCGGCAA GCTATTCGTA CAACACCCTG
AACCAGATCA TCGCGCAGAC CATGGGCGGG GTCACACTGA GCAACACCTG GGGCACGGCC
AACCACGCCA ACGGCGTGTT GCTGAAACAG GTGCTGGGCG GCGCGACGGC GCAGACCACG
CAGTTCGGCT ATGACGGATT CGGGGCAAAC GACAGCGTCG CGGTCACCGA CGGCGCGGGC
ATCGGCGTCC TGAGCGCCGC CGCCACCCGC GACGGCTGGC GGAACCTGGT GTCGCTGACG
CTCGCCTCGG CTGTGAACAC CGATCCTTCG GTGAACGTCG CGAAGACGAT GAGCTATGAC
GGCCTGAAGC AGCTGGTGGG GATCACGCTG GCCCCGTCCG GCGGCGGGTC GCCGACCCAG
GTAAGCTATG CCTACGATGG CGCCGCCAAT GTGCTGACCC GCACCCGGAA CGGCCAGCAG
GAGAGCTTTG CCTACAACGC TCTGAACCAG ATCACCTCGG GCGCTGCCGC CTATGACGCC
AATGGCCGCA TGGTCAGGGA CGTGGACGGG TCGACCTATG GCTTCGACCC GCTCGACCGG
CTGACCAATG TGGGCATGGC CTCGGGGCCC TCGATGTCCA ACAGCTATGG TCCGCAGGGC
GCGCTGGCCT CGGTGAACGA CGGCAGCACC GAGGACAGGT TCTATCCGCT GGCCGGGACC
ATGGTCAGCG TGGCGGCGAA TGCCCAGTCG GGCACGCCGG AATGGCACGG GCTCATGTGG
GCGGGACAGA TGCCCGTCGC GCGGGTCAGC GCCGGATCGG TCACCGCCTA TGCGGCGGCG
TCGAAATCGG TCTATGTGCA CCGCACCTCG GCCAGCAGCA ACGCTCTGGC CATCTCGGCC
TATGGCACGG TCACACCGCA ATCCGCGCTC GACCGCGCCA ACAGCTTCAA CTGGAATTCG
CAGTTCACCG ACCCGGTCAG CAACCTGACC TATCTCAGGG CGCGCTGGTA CAACCCGGAA
ACCATGCGCT TCTTGTCGCT CGATCCGCGC ATCACCATGA ACCGGTACGC CTATGCCATG
GGCAACCCGA TCGCGAACTC CGACCCGCTG GGCCAGAGCT GGGAAGAGAT CGTGGGGCTG
ATCGCCGGGG CCATCGTCGG CATCGGCGCA ACCGTGCTGA CCGGCGGGGT CGCGGGCGCA
GCCGCGGCGG CGGTCTTTGG CACCGAATGC GTGGCCGCCA GCATCGGCGC GGGCGCGCTG
GCGGGTGCCG TGGGCTCGGT CGCGGGGGAT CTGACCAGTG CCGCGATCTC GGGCCAGAAG
ATCACCGGCG CGCGGGTCGG CATCGACCTG CTGAGCGGCG CGGTGGGGGG CGCGGTCGGC
GCGGGCCTGG GCGGGGCGGC GGGCCGGGTC GCGATGCGCG GGGCGCTGAA CGCGGGCTGG
TCGCAGGCGG CGATCACCCG CGTCGGCCTC ATCACCTCGG GCGCCATCGG CGGACTGACC
GGGGCCGCCG CCTCGGCCGG GGTCACCTCG GTCGCCTATC AGCAGCCGTT CTTCTCGACC
GGGAACATCG TCAGCATGGC GGTGGGCTTT GGTGCCGGCG CGGGTGGCGG GATCCTGATG
TCGGGCGCCT ATCTGGGCAA GATAAACGCC AAGATCATTC CGGTGCCCAT CGGCGAGGAC
GAACTGCACC TGATCACCCC GGCGGTCGAC ACGCGCGGCG CGGTGGGCGA GAACGAGCGC
CTTCTGGTGA TGGCGCCGCA GCCCGAGGCC GAAACCAGCG CCAACGGATT CCAGAGGCGC
CCGGGCGGGT ACAAATACGC GATGCGGCTG GATTTCGGCG AGGGCGAAGG CGAGGGCCGC
CCGCTGATGG CGCCGGGGCG CGAGGAATCG GTGGACACCA TCGCCGGGCA TGGCGCGGGC
AACACCATCT TCGCCAGTGT CGATGTGAGT GGCGACGGAG CCCCCGATTT CGTACGCCCG
ATCTCGGGCC GCAACTTTGC CCGGTATCTC GTGGACGAGG GCTGGCGCGA GCGTGAGGGG
CCGATCAAGC TCATGTCCTG CTTTGGCGCC TTCCGCAATG CACGGGTCAT CGCCGACACG
TTGGGCCGCG ATGTCTGGGC GGGTTATCCC GAGCTCGACC GCTATTCCTT CGCCGGCTGG
GTCCGCTTTC CGGCGCCGCA TTAG
 
Protein sequence
MTSSQYQGSQ AYNQSLQSNE YINRNTGSLV VSLPLVQLRG ITDAIGLSLT LGYSAGASGQ 
LGLPQGWGWG LPFVAAGQSL TVEGKTFVID PSWTDSSGYQ SGLRYLNDHG RLFQTVVPPQ
PVPGGGGTYG FMLRYDDGSI SYFDATGKLI AHADLYGNML RYAYTNPLGD VFQNSLASIT
DSFGQVVTFG YSGGTIVLTL PDGSSVTVAF SSQGVQHVTN QIGAVTAFSY ASASGTTVVG
AITYPTGMTT TLAYTGLRYI DAGGNSGTRP AVQSMTRTGP GGAFLDRSDY SYGTASGGNT
YTGATAGYRM GTASDGLMDS NNTAYQYDVL EQRRDAGGAL IAANRVFFNY LHCPLTEHAY
LVDANGHAQE AHRTSYTYDI VTDAHARSVN MNKPVTTVHS VYDAAGGTWS DNSQADVAYD
LYGKITSSSQ YDLSSGRPRL TGQRTHSFLQ VAWGGEMPQR TDYVDAVTGQ TTRIDYGLTA
DQKSIAATTV STQAAGAGSF APYKTKTFQF DPRGCQTGWT LTWARGYSGP EGSVASVGEQ
TSYSYDAASH QLTVDTTDAN GHTRVAVFDM RLPGGPCLSQ TKPGGARRSY AYDPLARLMQ
ETDPLGDVTT HAYTLAGNGG GGTNTSTVTQ SNGYVLCTTY DAKGRAVLMM DNGDPTQSTP
SLSRTLRRVS FNALDLKASE TDATEQTLTI SYDALGRQIA LADALGNQST TIFDDAARTV
STSMNGDLRT VITRDGLGNT IQTDTHADSG SPQAGQVQRV TSAYDGFGQV VTETHFSITG
GNVVQNSVKT FTYTPDGNPE VVDFTGTPAT AALQATRCTT TQRDLNGNPI LVTRNVSYNG
AAQPLVVSET LTYDPVGNLI EIRNQAGQIE RLSYTTDSAL QSRTRYDGTQ TTYAYDAAGR
VLSETTNGQT RTFAYLSNGR IDQITDPSGT LRYAYSLDGT ASSVTYPDGK TLSLTKDATS
RVVSMTLPDG TAASYSYNTL NQIIAQTMGG VTLSNTWGTA NHANGVLLKQ VLGGATAQTT
QFGYDGFGAN DSVAVTDGAG IGVLSAAATR DGWRNLVSLT LASAVNTDPS VNVAKTMSYD
GLKQLVGITL APSGGGSPTQ VSYAYDGAAN VLTRTRNGQQ ESFAYNALNQ ITSGAAAYDA
NGRMVRDVDG STYGFDPLDR LTNVGMASGP SMSNSYGPQG ALASVNDGST EDRFYPLAGT
MVSVAANAQS GTPEWHGLMW AGQMPVARVS AGSVTAYAAA SKSVYVHRTS ASSNALAISA
YGTVTPQSAL DRANSFNWNS QFTDPVSNLT YLRARWYNPE TMRFLSLDPR ITMNRYAYAM
GNPIANSDPL GQSWEEIVGL IAGAIVGIGA TVLTGGVAGA AAAAVFGTEC VAASIGAGAL
AGAVGSVAGD LTSAAISGQK ITGARVGIDL LSGAVGGAVG AGLGGAAGRV AMRGALNAGW
SQAAITRVGL ITSGAIGGLT GAAASAGVTS VAYQQPFFST GNIVSMAVGF GAGAGGGILM
SGAYLGKINA KIIPVPIGED ELHLITPAVD TRGAVGENER LLVMAPQPEA ETSANGFQRR
PGGYKYAMRL DFGEGEGEGR PLMAPGREES VDTIAGHGAG NTIFASVDVS GDGAPDFVRP
ISGRNFARYL VDEGWREREG PIKLMSCFGA FRNARVIADT LGRDVWAGYP ELDRYSFAGW
VRFPAPH