Gene Rsph17029_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3898 
Symbol 
ID4899146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1032046 
End bp1034661 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content69% 
IMG OID640114502 
Producthypothetical protein 
Protein accessionYP_001045749 
Protein GI126464636 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein
[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.129662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.427904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTGGC GCAGCGAGAC GCCCGAAGAC GCGACGCCTC CGGACGAGGC CCTCCCGCCC 
CGGTCGCGCC CCGTGCGCGA CTGGCTCAGC CGCAACCGCA CCGTGCTGCT CGCGCTGGTG
ACATGGGTCG TGTTCGCGGC AGTCGCCTAC ACGACCTACC GGATCACGGG CGACATCCGC
TACAAGGACA TCCTCCATGC GCTCGAGGCG ACGACCTGGA CCGACATCCT GATCGCGGGC
TTCTTCACCG TGGTGAGCTT CGTCCTGCTC GCGGGCTACG ATGTCAACGC GCTGCGCCAT
CTGGGCAAGC AGGCCAATCT GGTGCAGGTG GGCATGATCG CCTTCAGCGC CTATGCGATC
GGCAACACCG TGGGCTTCGG CCCGCTGTCG GCCGGGGCGG TGCGCTACCG CGGCTACAGC
CGGCTCGGCC TCTCGGGCGA GCAGATCGCC GGCGTGATCG CCTTCGTGAC CCTCTCCTTC
GGCCTCGGTC TGACGGTGAC CACCGCGCTC GCGGCGCTGG TGGCGGCCGA TCAGGTGGCG
GGCTTTGCCG GTCTCACGCC GCAGATGCTG CGGCTTCTGT CGGCGGCCCT CCTGCTCGCG
CTGACGGTGG CGGCCGTGAT CCTCTGGCGC AGCGACGGGT GGCTGCGGCC GCACATGCCG
CGCCCGGGCA TCGCACTGGG CCAGCTTGCC ATCACCGCGG CCGATCTCAT GGTCTGCGCC
ACTGTCCTCT GGGTGCTCCT GCCGCAGGAT CTGGAGGTCA GCTGGATCTC CTTCGTCATC
ATCTATGCCA TCGCCATCGG CCTCGGCGTG CTGAGCCACG TCCCGGCGGG TCTGGGCGTC
CTCGAGGCGG TGATCCTGAC CACGCTCGGC GGCTCGACGG GGACGGATGC GCTGCTGGGG
TCGCTCGTCC TCTACCGGGT CATCTACCAT GTCGTGCCCC TCCTGATCGC GGTGGTCGTG
GTCGCCTGGA CCGAGGCGCT CGAGGCGTTC CATTCGCCCC GCCTCGAATG GGCGCAGCGG
ATGGGCACGC TGCTCGCGCC CTCGCTCCTC GGGTCGCTCG CGGTGATCTG CGGCGTCATG
CTGATCTTCT CGAGCGTGAT CCCCGCGCGC GAGGCGAACC TCGTCTGGCT CGCGGGCTAC
GTCCCCGCCC TGCTCATCGA AGGGGCGCAT TTCCTGTCGA GCCTGATCGG CCTCGTGCTG
TTCGTCGCGG CCCGGGGCCT CACGCAACGG CTGGACGGCG CCTACTGGCT GACGCTAGGT
GCGGCGAGCG CGGCCTTCCT CTTCACCTTC GTCAAGGCGC TGGCGCCTTA CGAGGCGGTG
ATGCTGGCCG CCCTGATCGG CTTTCTCCTC CTGAGCCGGC CGCTCTTCGA CCGGCCCGCC
TCGCTCTTCT CGCAGACGCT GACGCCGCCC TGGATCGCGG GGATCGCCAC CGTGGCCATT
TCGGCCATCA CGATCCTGCT CTTCGTTCAG AAGGACGTGG CATACAGCCA CGACCTATGG
TGGCAGTTCG AGATCTCGGC CGAGGCGCCG CGCGGGCTGC GCGCCCTTCT GGGCGTAGTG
GTGCTGTCGG CGCTGATCGC AATCCGCAGC CTGCTGCAGC CCTCCCGCCC CGAGCCCGGG
ATGCCCGACG AGGCCGAGCT GCAGAAGGCG CTCGCCATCG TCGAGCGTCA GGACATGGGC
GAGGCCAATC TCGTGCGGAT GCGCGACAAG AGCCTGATCT TCTCGGACGC GGGCGACGCC
TTCCTCATGT ATGCGGTGCA AGGCCAGTCC TGGATCTCGC TCTTCGGGCC CATCGGCGCC
CCGCGCGCGC AGGCCGAACT GATCTGGCGC TTCATCGAGA CCGCGCGCGC CAAGGGCGGC
CGGCCGGTCT TCTATCAGGT GCCGCCCTCG CTTCTGCCGC TCTGCGCCGA CGCGGGCCTG
CGCGGGCTGA AGCTGGGCGA GCGGGCGGTG GTCGATCTCG AGGCGATGGA TCTGCAATCG
AGCCAGTGGG CCGAGCAGCG GCAGGCCCTG CGCAAGGGCG AGCGGATGGG GCTGGCCTTC
GAGCTGCTGG AGCCCGCCGA CCTCGGCCCG ATCCTCGACG AGCTCCAGCA GGTCTCGGAC
GCATGGCTCG CGCATCACGA CACCCGCGAG AAGGGCTTCG CCCTCGGCCG GTTCGAGCGC
GACTATGTGG CCGAGCAGCC GGTGGCGGTG CTGCGCGCCG AGGGACGCAT CGTGGCCTTC
GCCACGGTCA TGCAGACGGG GACGAAGGCC GAGGCCACGC TCGATCTGAT GCGCTTTGCC
CGCAGCGCGC CGCCGGGCTC GATGGATGTG CTGCTGTGCA ACCTGCTGGT CGAGATGAAG
CGGCAGGGCT TCCGCAGCTT CAACCTCGGG ATGGCGCCGC TGTCGGGCAT CACCGCGCAT
CAGGCCGCGC CGTTCTGGAA CCATCTCGGC CAGTCCGTCT TCGAACATGG CGAGCGGTTC
TACAATTTCC GCGGCCTTCG GTCCTTCAAG GCCAAATACC GTCCCGACTG GCAGTCGCGC
TACCTCGTGA CGCCGGGCGG GGTCTCGCCT CTGGCGGCGC TGGTCGACGT CACGCTGCTG
ATCGGCGGCG GCCTCCGGGG CGTGATGCGG AAGTGA
 
Protein sequence
MSWRSETPED ATPPDEALPP RSRPVRDWLS RNRTVLLALV TWVVFAAVAY TTYRITGDIR 
YKDILHALEA TTWTDILIAG FFTVVSFVLL AGYDVNALRH LGKQANLVQV GMIAFSAYAI
GNTVGFGPLS AGAVRYRGYS RLGLSGEQIA GVIAFVTLSF GLGLTVTTAL AALVAADQVA
GFAGLTPQML RLLSAALLLA LTVAAVILWR SDGWLRPHMP RPGIALGQLA ITAADLMVCA
TVLWVLLPQD LEVSWISFVI IYAIAIGLGV LSHVPAGLGV LEAVILTTLG GSTGTDALLG
SLVLYRVIYH VVPLLIAVVV VAWTEALEAF HSPRLEWAQR MGTLLAPSLL GSLAVICGVM
LIFSSVIPAR EANLVWLAGY VPALLIEGAH FLSSLIGLVL FVAARGLTQR LDGAYWLTLG
AASAAFLFTF VKALAPYEAV MLAALIGFLL LSRPLFDRPA SLFSQTLTPP WIAGIATVAI
SAITILLFVQ KDVAYSHDLW WQFEISAEAP RGLRALLGVV VLSALIAIRS LLQPSRPEPG
MPDEAELQKA LAIVERQDMG EANLVRMRDK SLIFSDAGDA FLMYAVQGQS WISLFGPIGA
PRAQAELIWR FIETARAKGG RPVFYQVPPS LLPLCADAGL RGLKLGERAV VDLEAMDLQS
SQWAEQRQAL RKGERMGLAF ELLEPADLGP ILDELQQVSD AWLAHHDTRE KGFALGRFER
DYVAEQPVAV LRAEGRIVAF ATVMQTGTKA EATLDLMRFA RSAPPGSMDV LLCNLLVEMK
RQGFRSFNLG MAPLSGITAH QAAPFWNHLG QSVFEHGERF YNFRGLRSFK AKYRPDWQSR
YLVTPGGVSP LAALVDVTLL IGGGLRGVMR K