Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2075 |
Symbol | |
ID | 3909890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2357683 |
End bp | 2358594 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637883967 |
Product | glycosyl transferase family protein |
Protein accession | YP_485692 |
Protein GI | 86749196 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00350475 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGGCGA TGTCGCAGCC GCCGGCGATC TCGATCATCA TCCCGACCCG CGACAAGCCG GAGCGGTTGG TGCTGATGCT GTATGCACTG CTGTGCCAGC GCAGCGGCAA TGCACGCCGC ATCGAGACCA TCCTGGTCGA TGACGGCAGC GCCGCGCCGA TCGAACCCTT GCTGGCGCCG CTGCGCGCGC AGGGTCTCGA GGTCGAATTG ATCCGGACCG CAGGCATCGG CCAGGCGGCG GCGCGCAATC GCGGCGCGGC GGCGGCGCGC GGCGAGTTGC TGCTGTTCGT CGACGACGAC GTGCTGCTGT CGCCGGACTA TGTCGCGCGT TGCGTGACGT TGTGCGGCGG CCGTCCCGAT CGCGTCGTCC GCGCGCCGGT GTATCAGCTG CGCTATCTCG CGGCGTTTCG CGATCCCGAG CGCGGCCTGC GCTACGACGG CCGCGCGGCG GACGCGCGGC TTTTCGGCGA GCGCATCTCC CGCGCGATGA TCACCGATGA CTGGCCCGCC ATCACGCGAA AATGCCGGCA TCGCAATCGC TTCGAGCGGC TGGTGTCGGC GGCGCTGGCG CAGCGGCCGC CGCGGTTTCC ATGGCTCGGC TATTCCGGCT CCGGCGTCGC GCTGTCGCGC GCGCTGTTTA TGCAGAGCGG CGGTTATGAC GAGGCATTCG GCCTGCGCTG GGGCGCCGAG GCGATCGAGC TCGGCTACCG GCTGTGGCGC GGCGGCGCGC GCTTCGTCGA GGCCGACGGG ATCTACTCGG CGCATCTGGA TCATCCGCGT GGCGCCTCGC TGTCGAGCTT CGACGACAGT TTCGATCTGT TTTTCGCCAA GCACCGCGAC CCCGCGATCC GCGAGGTGCA GGCGCTGATC CTCGCCGACG ACCGGCGCCC GGCGCTGTCA GCGGCGGGCT GA
|
Protein sequence | MMAMSQPPAI SIIIPTRDKP ERLVLMLYAL LCQRSGNARR IETILVDDGS AAPIEPLLAP LRAQGLEVEL IRTAGIGQAA ARNRGAAAAR GELLLFVDDD VLLSPDYVAR CVTLCGGRPD RVVRAPVYQL RYLAAFRDPE RGLRYDGRAA DARLFGERIS RAMITDDWPA ITRKCRHRNR FERLVSAALA QRPPRFPWLG YSGSGVALSR ALFMQSGGYD EAFGLRWGAE AIELGYRLWR GGARFVEADG IYSAHLDHPR GASLSSFDDS FDLFFAKHRD PAIREVQALI LADDRRPALS AAG
|
| |