Gene Rsph17029_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1978 
Symbol 
ID4895183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2094120 
End bp2096486 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content69% 
IMG OID640112572 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001043854 
Protein GI126462740 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03030] cellulose synthase catalytic subunit (UDP-forming) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.693808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC GAGCCAAGGC CCGCTCCCCG ATAAGGGTCG TTCCCGTCCT GCTGTTCCTG 
CTGTGGGTGG CTCTCCTCGT GCCGTTCGGG CTGCTGGCCG CCGCGCCGGT CGCGCCCTCG
GCGCAGGGCC TCATCGCTTT GTCGGCGGTG GTGCTGGTGG CGCTGCTCAA GCCCTTCGCC
GACAAGATGG TGCCGCGCTT CCTGCTTCTG TCCGCGGCCT CGATGCTGGT GATGCGCTAC
TGGTTCTGGC GCCTGTTCGA AACGCTGCCG CCGCCCGCGC TCGACGCCTC GTTCCTCTTC
GCTCTGCTGC TCTTCGCGGT CGAGACCTTC TCGATCTCCA TCTTCTTCCT CAACGGCTTT
CTCAGCGCCG ACCCGACCGA CCGGCCCTTC CCGCGGCCGC TGCAGCCCGA GGAGCTGCCG
ACGGTCGACA TTCTCGTGCC CTCCTACAAC GAGCCTGCCG ACATGCTGAG CGTGACGCTC
GCGGCGGCCA AGAACATGAT CTATCCGGCG CGGCTGCGCA CGGTGGTGCT CTGCGACGAC
GGGGGCACCG ACCAGCGCTG CATGTCGCCC GACCCGGAGC TTGCGCAGAA GGCGCAGGAG
CGGCGGCGCG AGTTGCAGCA GCTCTGCCGC GAGCTGGGCG TGGTCTATTC GACGCGCGAG
CGGAACGAAC ATGCCAAGGC CGGCAACATG TCGGCCGCGC TCGAGCGGCT GAAGGGCGAG
CTCGTGGTGG TGTTCGATGC CGACCACGTC CCGAGCCGCG ACTTCCTGGC CCGGACGGTG
GGCTATTTCG TCGAGGATCC CGACCTCTTC CTCGTCCAGA CGCCGCACTT CTTCATCAAC
CCCGACCCGA TCCAGCGCAA CCTCGCGCTC GGCGATCGCT GCCCGCCCGA GAACGAGATG
TTCTACGGCA AGATCCACCG CGGCCTCGAC CGCTGGGGCG GCGCCTTCTT CTGCGGGTCG
GCCGCGGTCC TGCGCCGCCG CGCGCTGGAC GAGGCGGGCG GCTTTGCCGG CGAGACCATT
ACCGAGGATG CCGAGACCGC GCTCGAGATC CATTCCCGCG GCTGGAAGAG CCTCTATATC
GACCGCGCCA TGATCGCGGG GCTCCAGCCC GAGACCTTCG CCTCCTTCAT CCAGCAGCGC
GGCCGCTGGG CCACCGGCAT GATGCAGATG CTGCTGCTGA AGAACCCGCT CTTCCGCCGC
GGCCTCGGGA TTGCGCAACG CCTGTGCTAC CTCAACTCGA TGAGCTTCTG GTTCTTCCCG
CTGGTGCGGA TGATGTTCCT CGTGGCGCCG CTCATCTATC TGTTCTTCGG CATCGAGATC
TTCGTCGCCA CCTTCGAGGA GGTGCTGGCC TACATGCCGG GCTATCTGGC GGTGAGCTTC
CTCGTGCAGA ACGCGCTGTT TGCGCGGCAG CGATGGCCGC TCGTCTCCGA AGTCTACGAG
GTGGCACAGG CGCCCTATCT GGCGCGCGCC ATCGTGACCA CGCTGCTGCG GCCGCGCAGT
GCCCGCTTTG CGGTGACCGC GAAGGACGAG ACGCTGAGCG AGAATTACAT TTCCCCCATC
TACCGTCCGC TCCTCTTCAC CTTCCTGCTC TGCCTGTCCG GGGTGCTCGC CACGCTGGTG
CGCTGGGTGG CCTTCCCCGG CGACCGGTCG GTCCTCCTCG TCGTGGGCGG CTGGGCGGTG
CTCAACGTGC TTCTCGTGGG CTTCGCTTTG CGGGCGGTGG CCGAGAAGCA GCAGCGGCGC
GCGGCCCCCC GTGTGCAGAT GGAGGTGCCG GCCGAGGCGC AGATCCCCGC CTTCGGCAAC
CGCCCGCTGA CCGCGACCGT GCTCGACGCC TCGACCAGCG GCGTGCGCCT TCTGGTCCGG
CTGCCCGGCG TGGGCGATCC GCACCCGGCG CTCGAGGCGG GGGGGCTCAT CCAGTTCCAG
CCGAAGTTCC CCGACGCGCC GCAGCTCGAG CGCATGGTGC GCGGCCGCAT CCGCTCGGCG
CGCCGCGAGG GCGGAACGGT GATGGTGGGC GTGATCTTCG AGGCGGGCCA ACCGATCGCT
GTGCGCGAGA CGGTGGCCTA TCTCATCTTC GGCGAGAGCG CGCACTGGCG CACGATGCGC
GAGGCCACGA TGCGGCCCAT CGGGCTCCTG CACGGGATGG CGCGGATCCT GTGGATGGCG
GCCGCCAGCC TGCCCAAGAC CGCGCGCGAC TTCATGGACG AACCGGCCCG CCGCCGGCGC
CGCCACGAGG AACCGAAGGA GAAGCAGGCG CATCTTCTGG CCTTCGGCAC CGACTTCAGC
ACCGAACCCG ACTGGGCGGG CGAGCTGCTC GATCCGACGG CGCAGGTCTC CGCGCGTCCC
AACACGGTCG CCTGGGGGTC GAACTGA
 
Protein sequence
MTVRAKARSP IRVVPVLLFL LWVALLVPFG LLAAAPVAPS AQGLIALSAV VLVALLKPFA 
DKMVPRFLLL SAASMLVMRY WFWRLFETLP PPALDASFLF ALLLFAVETF SISIFFLNGF
LSADPTDRPF PRPLQPEELP TVDILVPSYN EPADMLSVTL AAAKNMIYPA RLRTVVLCDD
GGTDQRCMSP DPELAQKAQE RRRELQQLCR ELGVVYSTRE RNEHAKAGNM SAALERLKGE
LVVVFDADHV PSRDFLARTV GYFVEDPDLF LVQTPHFFIN PDPIQRNLAL GDRCPPENEM
FYGKIHRGLD RWGGAFFCGS AAVLRRRALD EAGGFAGETI TEDAETALEI HSRGWKSLYI
DRAMIAGLQP ETFASFIQQR GRWATGMMQM LLLKNPLFRR GLGIAQRLCY LNSMSFWFFP
LVRMMFLVAP LIYLFFGIEI FVATFEEVLA YMPGYLAVSF LVQNALFARQ RWPLVSEVYE
VAQAPYLARA IVTTLLRPRS ARFAVTAKDE TLSENYISPI YRPLLFTFLL CLSGVLATLV
RWVAFPGDRS VLLVVGGWAV LNVLLVGFAL RAVAEKQQRR AAPRVQMEVP AEAQIPAFGN
RPLTATVLDA STSGVRLLVR LPGVGDPHPA LEAGGLIQFQ PKFPDAPQLE RMVRGRIRSA
RREGGTVMVG VIFEAGQPIA VRETVAYLIF GESAHWRTMR EATMRPIGLL HGMARILWMA
AASLPKTARD FMDEPARRRR RHEEPKEKQA HLLAFGTDFS TEPDWAGELL DPTAQVSARP
NTVAWGSN