Gene Rsph17025_3400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3400 
Symbol 
ID5086061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp274239 
End bp275450 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content69% 
IMG OID640484966 
ProductO-antigen polymerase 
Protein accessionYP_001169582 
Protein GI146279424 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATCC TCGTGGTGGT CAGCGAGTTT CCGAAGCTGA CCGAGACGTT CGTCTATCGC 
AACATCGCCG AGTATCGCCG AGCGGGACAT GGGGTCCGGC TCTTCTACGC CAAGAAGCAC
TTTCCGCAGG AACTGGTCCA CGGCTTCGCC CGCGAGACCG CCGACAGCGC CTTCACCTTC
GGCTTCCTCG CACCGCAAAG CCTTCTGGCG CTCGGGCGCG AGGTGGTGCG CCACCCGGTC
CGACAGATAC GGCTCTGGAA GATCCTCGCC CGCAGCCACC GCCACGAGAT CGGGCGGGGG
CTGCGCAGCT TCGCGGTGCT GCCGAAGTCG GTGGCGCTGG GGCACTGGTG CCGGTCGCAG
GGGATCGACC ATATCCATGC CGAGTTCGCG GGCTTCCCCG CGACGGTCGC CATGATCGCC
GCGCGGGTGT CGGGTGTGCC GTTCAGCTTC TCGGCGCATG CCAACGACAT CTTCGTGTCG
CAGGCGCTCT TGCCTGAGAA GGCGGCCGAG GCGCGTTTCG TGCGCGCCAT CAGCCGCTAC
AACATCGACT GGCTGGGCCG CCTGCCGAGC TTTCCGGCCG ACCGGCTGCG GCTGATCCAT
TGCGGCGTTC CGCGCGCGCT GCTCGAAGCA CCCGAGCCCG ACCCGCCGGG CTCGGGCCCG
CTCAATGTGC TCTACGTCGG CTCGCTGATC GAGAAGAAGG GGGTCTTCCA TCTTCTCGAA
GCCCTCGCGC AACTGCGGGA CCGGATGCCC CTGCGCTGCC GGATCATTGG CCGCGGCGAT
CTGGAAGGTG CGATGCGGCA AGCGGCGGAG CGGCTCGGTC TGAACGGGAT CGTCACCTTC
GACGGACCAA AGGATGCGGA GGAGGTGCGC GCCGCCTACG GCTGGGCTCA TGCCGTGGTG
GTGCCCTCGG TCGTGGGCGC CGGGGGGCGG GTGGAAGGCA TCCCCGTCGT GGCGATGGAG
GCGCTGGCTC ACGCCCGGCC GCTGGTCGCC TCGCGCCTGT CGGGGATCCC GGAGCTGGTC
GAGGATGGTG TGACCGGCTG GCTTACGGAA CCCGGGGATG CTGCCGGGAT CGCCCGGGCG
CTGGCGGCCA TCCGGGAGGA CTGGCCACGG GCCGTCACTC TTGCGCGCGC CGGGCGCGAC
CGCGTGCGCG CCGAATATCT GATCGAGGAC AATGCCGCAG AGCTTCTGCG CGCGATCGAG
GAGGCCGCAT GA
 
Protein sequence
MKILVVVSEF PKLTETFVYR NIAEYRRAGH GVRLFYAKKH FPQELVHGFA RETADSAFTF 
GFLAPQSLLA LGREVVRHPV RQIRLWKILA RSHRHEIGRG LRSFAVLPKS VALGHWCRSQ
GIDHIHAEFA GFPATVAMIA ARVSGVPFSF SAHANDIFVS QALLPEKAAE ARFVRAISRY
NIDWLGRLPS FPADRLRLIH CGVPRALLEA PEPDPPGSGP LNVLYVGSLI EKKGVFHLLE
ALAQLRDRMP LRCRIIGRGD LEGAMRQAAE RLGLNGIVTF DGPKDAEEVR AAYGWAHAVV
VPSVVGAGGR VEGIPVVAME ALAHARPLVA SRLSGIPELV EDGVTGWLTE PGDAAGIARA
LAAIREDWPR AVTLARAGRD RVRAEYLIED NAAELLRAIE EAA