Gene Rsph17029_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1764 
Symbol 
ID4896422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1860604 
End bp1862391 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content69% 
IMG OID640112358 
Productglucosyltransferase MdoH 
Protein accessionYP_001043646 
Protein GI126462532 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.834244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGG AGCGGCGCCG TCGCGCCGTC ACGCTGGCCT CGCGTCTCGT TGCGGCGGCG 
ATCAGCCTCA CCGCAGCAGC CGGGGCCTTC TTTCTTTTTC TGCAGTTCGG CTCGACCGAC
GGGCTCGATT CGATGGACAT CACGCGGAGC GTGCTGATCC TCGTCTCGAC CTCGTGGCTC
GGGTGGGGTG CCGCCCATGC GGTGCTCGGC CTCTTCTCGC GCCCGCAGCG GCCCGCGAAC
GTGTCGCCCG ACGCGCCGAT CTCGACGCGT ACGGTCATTC TCGTCCCCGT CTATAACGAG
GATCCGGTCG CGACCTTCTC GCGCATCGCG GCGATGGACG CCTCGCTCGC CGCCACGCCC
TGGCGCGACC TCTTCCATTT CGCGATCCTC TCGGACACAA GGGACGAGGC CATCGCCGCG
CGCGAGCGGT TCTGGTTCCT GCGCCTTCTC CGCGAGCGCG ATGCCGAGGG CCGCATCTTC
TACCGCCGCC GCGCGGTGAA CCGCGGCCGC AAGGCGGGCA ATATCGAGGA TTTCATCCAG
AAGTCCGGCT CCGCCTATCC GTTCGCCGTG ATCCTCGATG CCGACAGCCT GATGGAAGGC
GAAACGCTGG TCGACATGGT GCGCCGGATG GAGGCCGAGC CGCGTCTCGG GCTGCTCCAG
ACGCTGCCGG TGGTGACGAA GGCGCGCGCC CGCTTCGGGC GGTCAATGCA GTTCTCGGCC
GCGCTCCATG CGCCGGTCTT CGCGCGCGGG CTGGCGATGA TGCAGGGCCG CACCGGCCCG
TTCTGGGGCC ACAATGCCAT CGTGCGGGTG CAGGCTTTCG CGGAAAGCTG CGGCCTGCCC
GAGCTGTCGG GTCCGCCCCC CTTCGGCGGC CATGTCATGA GCCACGATTA TGTCGAGGCG
GCGCTCCTTG CACGCGCAGG CTGGATCGTC CGGTTCGACG ACGACATCCG CGGCTCCTAC
GAGGAAGGCC CGGAAAATCT GGTGGACCAT GCGAAGCGCG ACCGGCGCTG GTGTCAGGGC
AACCTCCAGC ACGGGCGCAT CCTGTTCGCG CCCGGCCTCT GCGGCTGGAA CCGCTTCGTG
TTCCTGCAGG GCATCATGGC CTATATCGCG CCACTCTTCT GGCTGGGCTT CATCATGGCC
TCGATCGCGG CGCCCTTCTT CGCCCCGCCG CTCGATTATT TCCCGGTGCC CTACTGGCCC
TTCCCGGTCT TCCCCTCGGA CGAGACCTGG AAAGCCATCG GCCTCGCCGT GGGCATCTTC
GGCCTCCTGC TGCTGCCCAA GCTCATGATC GCCATCGAAG CCATCGTGAC CGGCCGGGCG
GCGGGCTTCG GCGGCGCCGG GCGCGTCCTC ATCTCGACGC TGGCCGAGCT TGTCTTCTCC
AGCATCATCG CGCCGATCCT CATGGCCTTT CAGACCCGTT CGGTGCTGCA GGTGCTGCTC
GGCCGCGATG GCGGCTGGCC CACCAACAAT CGCGGCGACG GCAGCCTCTC CGTGGCGCAG
GCCTGGTCGG CCAGCCACTG GATCGTGACC TGGGGCCTTA TCGGCATCGG CGCCACCTAT
TACTTCGCTC CGGGCCTCGT GCCCTGGCTG CTTCCGGTGG CCCTTCCCAT GATCTTCTCG
CCGCTCGTGA TCGCGGTCAC CTCGAAGCGC AGCCGCTCGG CGCTCTTCAC CATGCCGCTC
GAAGTCGCGC CGACGCCGGT GCTTCTGGCG CATGACGCAA TCCTCGCCGA CTGGGAGCGC
AGCCCCGCGC CCGAGGCTGT TCCGGCGCTG GCGGTGAGCC ATGCCTGA
 
Protein sequence
MPAERRRRAV TLASRLVAAA ISLTAAAGAF FLFLQFGSTD GLDSMDITRS VLILVSTSWL 
GWGAAHAVLG LFSRPQRPAN VSPDAPISTR TVILVPVYNE DPVATFSRIA AMDASLAATP
WRDLFHFAIL SDTRDEAIAA RERFWFLRLL RERDAEGRIF YRRRAVNRGR KAGNIEDFIQ
KSGSAYPFAV ILDADSLMEG ETLVDMVRRM EAEPRLGLLQ TLPVVTKARA RFGRSMQFSA
ALHAPVFARG LAMMQGRTGP FWGHNAIVRV QAFAESCGLP ELSGPPPFGG HVMSHDYVEA
ALLARAGWIV RFDDDIRGSY EEGPENLVDH AKRDRRWCQG NLQHGRILFA PGLCGWNRFV
FLQGIMAYIA PLFWLGFIMA SIAAPFFAPP LDYFPVPYWP FPVFPSDETW KAIGLAVGIF
GLLLLPKLMI AIEAIVTGRA AGFGGAGRVL ISTLAELVFS SIIAPILMAF QTRSVLQVLL
GRDGGWPTNN RGDGSLSVAQ AWSASHWIVT WGLIGIGATY YFAPGLVPWL LPVALPMIFS
PLVIAVTSKR SRSALFTMPL EVAPTPVLLA HDAILADWER SPAPEAVPAL AVSHA