Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2627 |
Symbol | |
ID | 5085051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 2669273 |
End bp | 2671159 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640484190 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001168819 |
Protein GI | 146278660 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAAAC TCCTGTTCGG TCTGGTGGAC CGACTGACCC GAGGTCAGAA GCGCGCCGCG ATGCTTCTGT CGGATCTTCT GGTGGCGCCG CTCGCGCTGC TTCTGACCGG GATCTTCATC CGGGGTCAGG CCTCCGCATT CGAGTGGGTG CTCTTCCCGG CCTCCGTGCT GATCGCGGGC GGGATTTCCA TGCTGCTCGG GATGCCGCGG ATCAAGCTCA ATGCCTATGA GTCGGTGGCG ATCCTGAAGA CCGCCGCCTT CGCCTCGATC CTGACCATGG CGCTGGCGCT GGTGGCCACG CTTTCGGGTG TGGATGTTCC TGCGGCGGCA GCCATCCTCC ATGGCTTGCT GTTCTTCATC CTTGCCGTGG GCACCCGGAT GGCGATGCTG CACGCCCTGC TCTGGGTGCT TCAGGTCGAG CAGAAGGGGT GCCGGGTCAT CATCTATGGC GCGGGCAACA CGGGCACGCA GCTTGCGGCC GCGCTCCGGT CGAGGGGGGC CATCCGTCCC ATCGCCTTCG TCGATGACAA CCGGACGCTT CACGGCATGA TCATCGGCGG CCTCAAGGTC CATCCGGCCG AGCGGCTCGA GCGGCTGGTG CGCGAGCGCG ACGTCACGCG TGTCCTCCTG GCGATGCCAT CGGAGTCGCC TGCCCGCCTT GCGAAGATCG TCCAGCGGCT TCAGGCGGTG GGTCTCGACG TCCAGACGGT TCCGTCCTTC GCGCAACTCG TGGGCGAGGA GAAGCTTGTC GACAACCTCT CTCCCTTCAC CTTCGGGCGG TTCCTGGGGC GTCAGCAGAT CGAGGATGCG CTGCCGCAGG GCGCCGAAGC CTATGTCGGG CGCTCGGTCC TTGTCTCCGG CGCGGGTGGC TCGGTCGGGT CGGAACTTTG CCGTCAGCTT CTGCTGATCC GCCCCCGGTG CATCGTGCTG TTCGAGATCA GCGAGATCGC GCTCTACAAC ATCGACCGCG AACTGCGAGA GATGGCTGAG GGCACCGGAG TGGAGATCGT GCCCGTCCTC GGCTCGGTGA CGGATTCGCG CCTCGCGCGC ATGGTGATGC AGGATCACGG GATCGAGGTC GTCTTCCATG CGGCCGCCTA CAAGCATGTT CCGCTGGTCG AGCACAATCC GATTGCCGGT CTGGCCAACA ACGTGCTTGG CACCCGCACG CTCGCCGATG CGGCACATGA GGCCAGGGTG GCGCGCTTCA TCCTGATCTC GACCGACAAG GCGGTGCGTC CCACCAATGT CATGGGCGCC TCCAAGCGTC TGGCCGAACT GGTGATCCAG GATCTGGCCA AGAGGTCGAA GGGCACGATC TTCTCGATGG TGCGTTTCGG GAACGTGCTT GGATCCTCGG GCTCCGTCAT CCCGCTCTTC AAGGAGCAGA TCGCGCGCGG CGGCCCCGTG ACGCTCACAC ACGAGGATGT CACCCGCTTC TTCATGACGA TTTCCGAGGC GGCGCGTCTG GTGCTGCTGG CGGGATCCTT CGCCCATCAG GCTGAAAGCT GCGGGGGCGA CGTCTTCGTG CTCGACATGG GCAAGCCGGT GCGCATCCGC GATCTGGCCG TGCAGATGAT CGAGGCCGCG GGCAAGTCGG TTCGGGATGA GCGCCACCCC TTTGGCGATA TCGAGATCGT GGTGACGGGC CTTCGTCCCG GCGAGAAGCT GCATGAGGAA CTGCTCATCG GCGAGGGACT GCTCACGACG CCGCATTCCA AGATCCTCCG GGCGCAGGAG GACAGCCTCT CCGAACTCGA GATGGCGACG GCCCTTCGGG CGCTGCGAAG CGCAATGGCG ACCGGCGATG CAGAGGCCGC GCGCCGGGTG ATTCTCTCGT GGGTTGAGGG CTACAGACTC CCCGAGGTTG TGGCGGCAAA GCGATAG
|
Protein sequence | MKKLLFGLVD RLTRGQKRAA MLLSDLLVAP LALLLTGIFI RGQASAFEWV LFPASVLIAG GISMLLGMPR IKLNAYESVA ILKTAAFASI LTMALALVAT LSGVDVPAAA AILHGLLFFI LAVGTRMAML HALLWVLQVE QKGCRVIIYG AGNTGTQLAA ALRSRGAIRP IAFVDDNRTL HGMIIGGLKV HPAERLERLV RERDVTRVLL AMPSESPARL AKIVQRLQAV GLDVQTVPSF AQLVGEEKLV DNLSPFTFGR FLGRQQIEDA LPQGAEAYVG RSVLVSGAGG SVGSELCRQL LLIRPRCIVL FEISEIALYN IDRELREMAE GTGVEIVPVL GSVTDSRLAR MVMQDHGIEV VFHAAAYKHV PLVEHNPIAG LANNVLGTRT LADAAHEARV ARFILISTDK AVRPTNVMGA SKRLAELVIQ DLAKRSKGTI FSMVRFGNVL GSSGSVIPLF KEQIARGGPV TLTHEDVTRF FMTISEAARL VLLAGSFAHQ AESCGGDVFV LDMGKPVRIR DLAVQMIEAA GKSVRDERHP FGDIEIVVTG LRPGEKLHEE LLIGEGLLTT PHSKILRAQE DSLSELEMAT ALRALRSAMA TGDAEAARRV ILSWVEGYRL PEVVAAKR
|
| |