Gene Rsph17029_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3954 
Symbol 
ID4899128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1088884 
End bp1092264 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content76% 
IMG OID640114557 
Producthypothetical protein 
Protein accessionYP_001045804 
Protein GI126464691 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.573007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCG ACCGGCTCGA CCTCACGCGC TACGGCCATT TCACGGACCG GCGGCTCGCA 
TTTCCCGCGC CCACGCCGGG CGAGGCGGAC CTGCATGTGG TCTACGGGCC GAACGAGGCG
GGCAAGTCCA CGCTCTTCTC GGCCTGGCTC GACCTCCTCT TCGGCATCCC GCTCCGCACC
CGCTACGACT TCCGCCACCC CGGCCCCACG ATGCGCGTGG GCGCGCGGCT CAGCCATGCG
GGCGGCGCGC TCGATCTGGC CCGGGTGAAG CGGAACACCG GCAGCCTCCT CGACGCGCAC
GACCAGCCGG TGCCCGAGGC GCTGCTGCAA TCGGCGCTGG GCGGCCTCAC GCGCGAGGGC
TATTCGGCCA TGTTCTCGCT CGACGACGAC ACGCTGGAAA AGGGCGGCGA CAGCATCCTC
GCGAGCCGCG GCGATCTGGG CGAGATGCTC TTCTCGGCGA GCGCGGGGCT GGCCCAGCTG
AGCCCCCGGC TAGAAACGAT CCGCACGGCC CTCGACGGCT TTCACCGCAG CGGCAAGCGC
AGCGGCTGGC TTTGGGACAC GAAGAAGCGG CTGGCCGAAC TCGATGCGGA GCGGCGGCGG
CTCGACGTGT CTGCCGGCAC GATCCACAGG CTCGCGCGCG AGGCGGAGGC GGCCGAGGCC
GCCTGGCGGG CGGCGCGCGG TGCAGAGGAT GCGGCGCAGG CGGATCTGGG GCGGCTGCAG
GATCTGGCGG CGACCCTTCC GATCCGGGCG CGGCTCGAAG GGCTGCGGGC GCGGCTCGCG
CCGCTGGCCC ATCTGCCCGA AGCCGGGGGG GCCGAGCGCG ACCGGCTTGA CCGGCTCGAC
CGCGAGACCG AAGCGCTGCG CGCGCGTCGC GCCGACCGGG CCCGCCGCCT TGCGGATCTC
GCGGAAGAGG CCGATGCCTT GCCGCTCGAT CCGGCGGTGC TGGCTGCCGC CGGCGACATC
GAGGCGGCCG AGGCGCTGCG CCCCGAGCAC GAGACGGCGC AGAAGGACCT GCCCCGCCGC
GAAGCCGAGG CGTTCGAGGC CCGCGCCGAG GTGACGGCGC TTCTGGCCGA ACTCGGCCAT
CCGGGGGCAA AGCCGGAGGG GCTGGTGCTT CCGGCAACCA CGCTCGCACG GCTCCGCGCC
CTTGCCGCAG AGCGCTCCGG CCTCGAAGCG ACCGCGGCCG CCTCGGAGGC CGAGCGGCAC
GCGGCAGCCG AACGGCTCGC GCGCGAGCGC GACCGTCTGG GCGATCCGGG CCCCGAGGGC
GAGGAGGCCA CGCTGGTCGC GCTGCTCGCG CGGCTGCGGG CGCAGGATCC GGCCGAGGCC
CATGCCCGGG CGCGGCTCGA CCGCAACCTG CATCAGGCGC GCCTCACCGC CGCGCTCGAG
GCGCTTGCCC CCTGGCAGGG CGATGCCTCG GCCCTCGCGG CCCTGCCCGT GCCCTCCGCC
GCCCTGCTCG ACGGCTGGGA GCGCGGCCTC GAAGAGACCC GCCAGCGCGC GGCCGACGCC
CAACGGACGG CCGAGGCAAT CCGGGCCGAT CTCGACAGGC TGCGCCACGA TGCAGCGGCA
GAGCGGGGCG CCGCCTCGGC CACCGGCCTC ACCCTCACCG AGGCCGCCGC CGCCCGCAGC
CGGCGCGAGG CCGCCTGGGC GCGCCATCGC CGGAGCCTCG ATGCGGCCAG TGCCACCGAG
TTCGAACAGG CGCTCCGCGA AGACGACCGC ATCTCCGCCC TCCTGGCCGA GGCGCTGGCC
GAGGCCCGCC GCGCCGCCGG AGCCGAAGCC GAGGAGGCGC GGCTTGCCCG GGCGCTGGCC
GAGGCGGAGG CGGCCCGTGA CGCGGCCCGG ACCGGTCAGG CGCAGATCCG CGCCGCCCTT
GCCGAGGCGG GAGGGGCGCT CGGCCTCTGC GATGCCGACC TCTCCGGCCT GCGGCACTGG
CTGGCGCTGC GCGACGAGGC GGCGGCCCGG CAGGCGGCGC TCCGCGAGGC CGAAGCCCAC
TGCACCCGCC AGTCGGAGGC GCTGGACGCG GCCAGCCTTG CGCTCGCCGC AGCCCTCGGC
GCGCCGGAGG GCACGCCCTT CGAAACCCTC CTCTCGACCG CCATCGCCCG CACCGAAGCC
GCCGAGCGCC GGCGCGAGGC GCGGCGGCAG CTGGCCGGGC TGGCCGCCGA TCTCAAGGCC
CGCGAGGCCG CCGAGGCGCA GGCACAGCAG GCGCTCGCGC GCTGGCGCGA AAGCTGGCAC
GAGGCGAGCC GGGGCACGAT CCTCGCCGAC GGCCCTTCCG AGGGTCCGGT GCTCGATCTC
CTCGATGCGC TCGGCGCGGC CGCCCGCAGC CTCGCCGCGC TCGAGGACCG GATCGCCAAG
ATGGAGGCGA ACCGCGCCCG GTTCGAGGCC GCCCGGACAG CCCTCCTCAC GCGGCTCGGC
CTCGATCCCG ACACGGGCTG GGAGGCGCTG CGGTCCCGGC TGCGCCGCGC GCAGGATGCG
GCGCGGGACG CGGAACGGCT CGCCCAGCAG CGCACCACCG AAGACCGTCA GGAGGCCGAG
GACCGCCGCA CCCTCGCCGC GCTGGACGAA GACCGGGCCG CGCTCGCCCA AGCGCTCGGC
TGGTCCGAGG CGGACGGGCC GCTCGCGGCC CACCTCGCCT GCTGCCTCGA GGCGGCCGAG
CTGCGCCGTC AGGTGGCAGC CCTTCTGTCG GACCTCTCAG GGCGGCCCGA ACCGCAGGAG
ACCGACGATC CCGCGACGCT CACTTCCCGG ATCGAGAAAC TGCGCACCGA CCTCCAGCTC
CTGCGGAGCG AGGCCGAAAG CGGCCTCACC GCCCATCTGG ACGCCCGGCG CAGGCTCGAG
GCGGTCGGCG GCGACGATGC GCTGGCCCGG ATCGCCTCGG ACCGCGAGAC CCTGCTGGTG
GAACTGCGCG ACCGCGCCCG CGCCCATCTC GCCGCCCGCT TCGGGCTGAT GGCCTTCGAG
ACCGGGCTCC GGCGCTACCG CGACCGGCAC CGCAGCGCGA TGCTGGCCCG CGCCTCGGAC
GCCTTCTGCC GCCTCAGCCG CGGCGCCTAT GGAGGCCTCA CCGCCCAGCC CGACGGCGCG
CAGGAGGTGC TGGTGGCGCT GGCCGCCGAA GGCGGGGCGA AACTGGCGGC GGATCTCTCC
AAGGGCACGC GGTTTCAGCT CTATCTCGCG CTGCGCATCG CGGGCTTCCA CGAGCTCGCC
CAGAGCCGCC CGCCCGTGCC CTTCATCGCC GACGACATCA TGGAGACCTT CGACGACGAC
CGCTCGGCCG AGGCCTTCGC CCTGCTGGCC GACATGTCCC GCGTGGGGCA GGTGATCTAT
CTGACGCACC ACCGCCACCT CTGCGACATC GCCCGTGCCG CCTGCCCCGG CGCCTCGCTG
ATCGACCTCA CGGCACCCTG A
 
Protein sequence
MRLDRLDLTR YGHFTDRRLA FPAPTPGEAD LHVVYGPNEA GKSTLFSAWL DLLFGIPLRT 
RYDFRHPGPT MRVGARLSHA GGALDLARVK RNTGSLLDAH DQPVPEALLQ SALGGLTREG
YSAMFSLDDD TLEKGGDSIL ASRGDLGEML FSASAGLAQL SPRLETIRTA LDGFHRSGKR
SGWLWDTKKR LAELDAERRR LDVSAGTIHR LAREAEAAEA AWRAARGAED AAQADLGRLQ
DLAATLPIRA RLEGLRARLA PLAHLPEAGG AERDRLDRLD RETEALRARR ADRARRLADL
AEEADALPLD PAVLAAAGDI EAAEALRPEH ETAQKDLPRR EAEAFEARAE VTALLAELGH
PGAKPEGLVL PATTLARLRA LAAERSGLEA TAAASEAERH AAAERLARER DRLGDPGPEG
EEATLVALLA RLRAQDPAEA HARARLDRNL HQARLTAALE ALAPWQGDAS ALAALPVPSA
ALLDGWERGL EETRQRAADA QRTAEAIRAD LDRLRHDAAA ERGAASATGL TLTEAAAARS
RREAAWARHR RSLDAASATE FEQALREDDR ISALLAEALA EARRAAGAEA EEARLARALA
EAEAARDAAR TGQAQIRAAL AEAGGALGLC DADLSGLRHW LALRDEAAAR QAALREAEAH
CTRQSEALDA ASLALAAALG APEGTPFETL LSTAIARTEA AERRREARRQ LAGLAADLKA
REAAEAQAQQ ALARWRESWH EASRGTILAD GPSEGPVLDL LDALGAAARS LAALEDRIAK
MEANRARFEA ARTALLTRLG LDPDTGWEAL RSRLRRAQDA ARDAERLAQQ RTTEDRQEAE
DRRTLAALDE DRAALAQALG WSEADGPLAA HLACCLEAAE LRRQVAALLS DLSGRPEPQE
TDDPATLTSR IEKLRTDLQL LRSEAESGLT AHLDARRRLE AVGGDDALAR IASDRETLLV
ELRDRARAHL AARFGLMAFE TGLRRYRDRH RSAMLARASD AFCRLSRGAY GGLTAQPDGA
QEVLVALAAE GGAKLAADLS KGTRFQLYLA LRIAGFHELA QSRPPVPFIA DDIMETFDDD
RSAEAFALLA DMSRVGQVIY LTHHRHLCDI ARAACPGASL IDLTAP