Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1978 |
Symbol | |
ID | 4895183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2094120 |
End bp | 2096486 |
Gene Length | 2367 bp |
Protein Length | 788 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640112572 |
Product | cellulose synthase (UDP-forming) |
Protein accession | YP_001043854 |
Protein GI | 126462740 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03030] cellulose synthase catalytic subunit (UDP-forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.693808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTC GAGCCAAGGC CCGCTCCCCG ATAAGGGTCG TTCCCGTCCT GCTGTTCCTG CTGTGGGTGG CTCTCCTCGT GCCGTTCGGG CTGCTGGCCG CCGCGCCGGT CGCGCCCTCG GCGCAGGGCC TCATCGCTTT GTCGGCGGTG GTGCTGGTGG CGCTGCTCAA GCCCTTCGCC GACAAGATGG TGCCGCGCTT CCTGCTTCTG TCCGCGGCCT CGATGCTGGT GATGCGCTAC TGGTTCTGGC GCCTGTTCGA AACGCTGCCG CCGCCCGCGC TCGACGCCTC GTTCCTCTTC GCTCTGCTGC TCTTCGCGGT CGAGACCTTC TCGATCTCCA TCTTCTTCCT CAACGGCTTT CTCAGCGCCG ACCCGACCGA CCGGCCCTTC CCGCGGCCGC TGCAGCCCGA GGAGCTGCCG ACGGTCGACA TTCTCGTGCC CTCCTACAAC GAGCCTGCCG ACATGCTGAG CGTGACGCTC GCGGCGGCCA AGAACATGAT CTATCCGGCG CGGCTGCGCA CGGTGGTGCT CTGCGACGAC GGGGGCACCG ACCAGCGCTG CATGTCGCCC GACCCGGAGC TTGCGCAGAA GGCGCAGGAG CGGCGGCGCG AGTTGCAGCA GCTCTGCCGC GAGCTGGGCG TGGTCTATTC GACGCGCGAG CGGAACGAAC ATGCCAAGGC CGGCAACATG TCGGCCGCGC TCGAGCGGCT GAAGGGCGAG CTCGTGGTGG TGTTCGATGC CGACCACGTC CCGAGCCGCG ACTTCCTGGC CCGGACGGTG GGCTATTTCG TCGAGGATCC CGACCTCTTC CTCGTCCAGA CGCCGCACTT CTTCATCAAC CCCGACCCGA TCCAGCGCAA CCTCGCGCTC GGCGATCGCT GCCCGCCCGA GAACGAGATG TTCTACGGCA AGATCCACCG CGGCCTCGAC CGCTGGGGCG GCGCCTTCTT CTGCGGGTCG GCCGCGGTCC TGCGCCGCCG CGCGCTGGAC GAGGCGGGCG GCTTTGCCGG CGAGACCATT ACCGAGGATG CCGAGACCGC GCTCGAGATC CATTCCCGCG GCTGGAAGAG CCTCTATATC GACCGCGCCA TGATCGCGGG GCTCCAGCCC GAGACCTTCG CCTCCTTCAT CCAGCAGCGC GGCCGCTGGG CCACCGGCAT GATGCAGATG CTGCTGCTGA AGAACCCGCT CTTCCGCCGC GGCCTCGGGA TTGCGCAACG CCTGTGCTAC CTCAACTCGA TGAGCTTCTG GTTCTTCCCG CTGGTGCGGA TGATGTTCCT CGTGGCGCCG CTCATCTATC TGTTCTTCGG CATCGAGATC TTCGTCGCCA CCTTCGAGGA GGTGCTGGCC TACATGCCGG GCTATCTGGC GGTGAGCTTC CTCGTGCAGA ACGCGCTGTT TGCGCGGCAG CGATGGCCGC TCGTCTCCGA AGTCTACGAG GTGGCACAGG CGCCCTATCT GGCGCGCGCC ATCGTGACCA CGCTGCTGCG GCCGCGCAGT GCCCGCTTTG CGGTGACCGC GAAGGACGAG ACGCTGAGCG AGAATTACAT TTCCCCCATC TACCGTCCGC TCCTCTTCAC CTTCCTGCTC TGCCTGTCCG GGGTGCTCGC CACGCTGGTG CGCTGGGTGG CCTTCCCCGG CGACCGGTCG GTCCTCCTCG TCGTGGGCGG CTGGGCGGTG CTCAACGTGC TTCTCGTGGG CTTCGCTTTG CGGGCGGTGG CCGAGAAGCA GCAGCGGCGC GCGGCCCCCC GTGTGCAGAT GGAGGTGCCG GCCGAGGCGC AGATCCCCGC CTTCGGCAAC CGCCCGCTGA CCGCGACCGT GCTCGACGCC TCGACCAGCG GCGTGCGCCT TCTGGTCCGG CTGCCCGGCG TGGGCGATCC GCACCCGGCG CTCGAGGCGG GGGGGCTCAT CCAGTTCCAG CCGAAGTTCC CCGACGCGCC GCAGCTCGAG CGCATGGTGC GCGGCCGCAT CCGCTCGGCG CGCCGCGAGG GCGGAACGGT GATGGTGGGC GTGATCTTCG AGGCGGGCCA ACCGATCGCT GTGCGCGAGA CGGTGGCCTA TCTCATCTTC GGCGAGAGCG CGCACTGGCG CACGATGCGC GAGGCCACGA TGCGGCCCAT CGGGCTCCTG CACGGGATGG CGCGGATCCT GTGGATGGCG GCCGCCAGCC TGCCCAAGAC CGCGCGCGAC TTCATGGACG AACCGGCCCG CCGCCGGCGC CGCCACGAGG AACCGAAGGA GAAGCAGGCG CATCTTCTGG CCTTCGGCAC CGACTTCAGC ACCGAACCCG ACTGGGCGGG CGAGCTGCTC GATCCGACGG CGCAGGTCTC CGCGCGTCCC AACACGGTCG CCTGGGGGTC GAACTGA
|
Protein sequence | MTVRAKARSP IRVVPVLLFL LWVALLVPFG LLAAAPVAPS AQGLIALSAV VLVALLKPFA DKMVPRFLLL SAASMLVMRY WFWRLFETLP PPALDASFLF ALLLFAVETF SISIFFLNGF LSADPTDRPF PRPLQPEELP TVDILVPSYN EPADMLSVTL AAAKNMIYPA RLRTVVLCDD GGTDQRCMSP DPELAQKAQE RRRELQQLCR ELGVVYSTRE RNEHAKAGNM SAALERLKGE LVVVFDADHV PSRDFLARTV GYFVEDPDLF LVQTPHFFIN PDPIQRNLAL GDRCPPENEM FYGKIHRGLD RWGGAFFCGS AAVLRRRALD EAGGFAGETI TEDAETALEI HSRGWKSLYI DRAMIAGLQP ETFASFIQQR GRWATGMMQM LLLKNPLFRR GLGIAQRLCY LNSMSFWFFP LVRMMFLVAP LIYLFFGIEI FVATFEEVLA YMPGYLAVSF LVQNALFARQ RWPLVSEVYE VAQAPYLARA IVTTLLRPRS ARFAVTAKDE TLSENYISPI YRPLLFTFLL CLSGVLATLV RWVAFPGDRS VLLVVGGWAV LNVLLVGFAL RAVAEKQQRR AAPRVQMEVP AEAQIPAFGN RPLTATVLDA STSGVRLLVR LPGVGDPHPA LEAGGLIQFQ PKFPDAPQLE RMVRGRIRSA RREGGTVMVG VIFEAGQPIA VRETVAYLIF GESAHWRTMR EATMRPIGLL HGMARILWMA AASLPKTARD FMDEPARRRR RHEEPKEKQA HLLAFGTDFS TEPDWAGELL DPTAQVSARP NTVAWGSN
|
| |