Gene RSP_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1872 
Symbol 
ID3719140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp469645 
End bp471459 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content72% 
IMG OID640070032 
Productglycosyl transferase family protein 
Protein accessionYP_351923 
Protein GI77462419 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTGA TGCTGCTGCG CGAAGGGCAT CTCGCGCCGC ACCGGATCAT GGCGGCCCTC 
AGTCACGGCG GGCGGCCGTC CGCACCCCTC GCCGATCTGC TGCTCGCCGA AGGCGCCCTG
TCCGAGGACG AGATCCTCGC CATGATGGCG CGGCGGAGCG GGCTGCCGGT GCTCGACCCC
GCGGCCGAGC GGCCCGATCC CCGGCTCATC GACCGGCTGG GGGTGCGGGA CTGTCTGCGC
GAGGGCCTCC TGCCCCTCCG CGACACGGGC AGCGCCGTCC TGCTGGCGGC GGCGGCCCCC
GAGAGCTTCC GCCGCCACCG GCCGCGGCTC GAGGAGCTGT TCGGCACCGT GATCCCCGCC
CTCGCCAGCC GCTCGTCCAT CGAGGACGCG CTGCAGGAGC TGCGCGCGGA CGCCATCGGA
GCCGCGGCCG AACTTCGGGT CGCACCGGAG GAAAGCTGCC GCGACTGGCG CACGGGGCGG
ATGACTCGGC TCGCGGCGCT GGCGGGCCTC GCCCTCGCCG CGGGCCTCGC CTTGGCACCG
GGCCTTGTGC TGCTCGCCCT GACCGCCTGG GCGCTTCTGG CGCTAGCCTG CGGCACAGCG
CTGCGGCTGG CAACCGCGGT GGCGAGCCTG CGCCGCCCTC CGCCCGAGCC CGAAAGCCCG
CCGCTCCTGC ATCTGCCGAT GGTCTCGATC ATCGTGGCGC TCTATCGCGA AGAGGATATC
GCGGGCCGTC TCGTGGCGCG CCTCGGCCGC CTCGACTATC CCCACGACCG GCTCGAGATC
CTGCTTGTGG TGGAAGAGGC CGACCGACGG ACACGGCGGG CGCTGCTCGA GGCGCGCCTG
CCGCCCTGGA TGCGGATCGT GGTCTCGCCC AAAGGCGCGA TCCGCACCAA GCCGCGGGCG
CTCAACGTGG CGCTCGACCA TTGCCGGGGC TCCATCGTGG GCGTCTACGA CGCCGAGGAC
GCGCCCGAGC CCGACCAGAT CCGCCGCGTG GTCGAGGGCT TCAGCCGGCG CGGCTCGCAC
GTCGCCTGCC TGCAGGGACG GCTCGACTAT TACAACCCGC GCACCAACTG GCTGTCGCGC
TGCTTCACCA TCGAATATGC GGCCTGGTTC CGGCTGATGC TGCCGGGGCT CGACCGGCTG
GGGCTCGTGG TCCCGCTCGG AGGCACCACC CTCTTCTTCC GCCGCGCGGC GCTCGAGGAG
CTGGGCGCCT GGGACGCGCA TAACGTGACC GAGGATGCGG ATCTCGGCAT CCGCCTCGCG
CGGCACGGCT ACCGCACCGA CCTCATCGAC ACGGTGACGG CCGAGGAAGC CAACTGCCGC
GCCATCCCCT GGATCAAGCA GAGATCGCGC TGGATCAAGG GCTTCATGAT GACATGGGCC
GTCCATATGC GCGCGCCGCG GCTGCTCTGG CGGCAACTCG GCCCCTGGCG CTTTGCAGGC
TTCCAGGTGA TGTTCCTCGG CTCGATCTCG CAGACCCTGC TCGCGCCGGT GCTCTGGTCC
TTCTGGCTGC TGGCGCTCGG CCTGCCGCAT CCGGTGGCGC CGCTCGTGCC CGAGCCGCTG
CTCTGGTCGA TGATCGGGCT TCTCATCGGA TCGGAGGGCA CCGCCATTGC CATGGGCATC
CTCGCCCTGC GGCAGACCCG GCACCGCCTG AACCCGCTCT GGGTGCCGAC CCTGCATCTC
TACAACCCGC TCGCCACCTT CGCGGCCTAC AAGGCGCTGT GGGAGCTCCT GCGCGCGCCC
TTCTACTGGG ACAAGACCTG CCACGGGGTC TTCGACGCCC AGGCCCGCGG CCGCCCTCTC
CTGCAGCCCG CCTGA
 
Protein sequence
MGVMLLREGH LAPHRIMAAL SHGGRPSAPL ADLLLAEGAL SEDEILAMMA RRSGLPVLDP 
AAERPDPRLI DRLGVRDCLR EGLLPLRDTG SAVLLAAAAP ESFRRHRPRL EELFGTVIPA
LASRSSIEDA LQELRADAIG AAAELRVAPE ESCRDWRTGR MTRLAALAGL ALAAGLALAP
GLVLLALTAW ALLALACGTA LRLATAVASL RRPPPEPESP PLLHLPMVSI IVALYREEDI
AGRLVARLGR LDYPHDRLEI LLVVEEADRR TRRALLEARL PPWMRIVVSP KGAIRTKPRA
LNVALDHCRG SIVGVYDAED APEPDQIRRV VEGFSRRGSH VACLQGRLDY YNPRTNWLSR
CFTIEYAAWF RLMLPGLDRL GLVVPLGGTT LFFRRAALEE LGAWDAHNVT EDADLGIRLA
RHGYRTDLID TVTAEEANCR AIPWIKQRSR WIKGFMMTWA VHMRAPRLLW RQLGPWRFAG
FQVMFLGSIS QTLLAPVLWS FWLLALGLPH PVAPLVPEPL LWSMIGLLIG SEGTAIAMGI
LALRQTRHRL NPLWVPTLHL YNPLATFAAY KALWELLRAP FYWDKTCHGV FDAQARGRPL
LQPA