Gene Rsph17029_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0190 
Symbol 
ID4896593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp208585 
End bp209592 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID640110773 
Productglycosyl transferase family protein 
Protein accessionYP_001042081 
Protein GI126460967 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.801609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACCG CAGTCATCAT CCCCTTCTAT CAGCGGGAGG CGGGGATCCT CTCTCGCGCG 
CTCGACTCGG TCGATGGGCA GATCCTTCCC GAAGGCCATA GCCTCACCGT CTTCGTCATC
GACGACGAGT CCCCCGTGCC CGCCCGGTCC GAGGTCGAGG GGCGGCAGGG CAAGGTTCCC
GTGCGGCTGA TCGCCCAGAA GAACGGCGGC CCCGGTGCCG CGCGGAACGC GGGGCTCGAT
GCCGTGGCGG CGGAGGGCTT CGACCATGTG GCCTTCCTCG ATTCCGACGA CATCTGGCAA
CCGACCCATC TCGCGGATGC GCTCGATCTG CTCGCGCGGG GCTACGACTT CCATTTCTGC
GACCACCAGC GCACCGACGA CGACATCACC TATTTCGAGC GCACCCCCGC CCTGCGCCGG
ATGCGCGAGG AGCGGCACGC GGGCGTCACC GTGCTCGATG CCGAGGCACC GATCCTCGCC
TTCGACCAGC CCTCGATCAT GGCGGCGTCG GTCGATACCT ACCTCAGCCA GACCTCGACG
GTCGTGGTGC GGCAGAGCTT CGTCGAGACG CTGCGCTTCG ACCCGCGGCT GCGGAACGCC
GGCGAGGACC AGCTCTTCTG GCTGTCGCTG ATCGCGGCCG GGGCGCGCAC CGTCGTTTCG
TGGAAGATGA ACGTGCTCTG CGGCCGGGGC GTGAACGTCT ATTTCGACGC GTTCGACTGG
AAATCCACCA AGGTGGTGGA CCGCACGGGC TACATGCTGA TGTTCTTCCA CACGGTCGGC
CGGCGGCTCT CGCTGACGGC GTCCGACCGC CGGACGGTGG CCGACCGCAT CCGCCGCTAC
CGCCGCGCCT ACAGCTACCT CTTCCTGCGC GCGCTCCTGC AGGGCCGGGT GCCGACGCTC
TCGCTCACCT GGAAGCTCGC GGCGCTGGAC CCGGGGCTCG TGCCCGCCAT GCCGCTGCGG
TTCCTGGCGG TGCTGCCCAA CCGCGAGGCC GAGAGCCAGC AGTGGTAG
 
Protein sequence
MRTAVIIPFY QREAGILSRA LDSVDGQILP EGHSLTVFVI DDESPVPARS EVEGRQGKVP 
VRLIAQKNGG PGAARNAGLD AVAAEGFDHV AFLDSDDIWQ PTHLADALDL LARGYDFHFC
DHQRTDDDIT YFERTPALRR MREERHAGVT VLDAEAPILA FDQPSIMAAS VDTYLSQTST
VVVRQSFVET LRFDPRLRNA GEDQLFWLSL IAAGARTVVS WKMNVLCGRG VNVYFDAFDW
KSTKVVDRTG YMLMFFHTVG RRLSLTASDR RTVADRIRRY RRAYSYLFLR ALLQGRVPTL
SLTWKLAALD PGLVPAMPLR FLAVLPNREA ESQQW