Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4237 |
Symbol | |
ID | 3912045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4812559 |
End bp | 4813644 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637886140 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_487839 |
Protein GI | 86751343 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.905319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTCA TGCTGGCGAT CTTCAAACTC GACAGACTGG GCGGCAAGGA ACGCGATTGC ATGGCGATCG CGCGGCACCT TGCGGCGCGT GGTCACGACG TCACCGTGCT GACGACGTCG GCCGACGTCG CGGCCATCGA CGATCTGCGG ATCGAGAGCC TGCGCGCCCG CGGGCTCGCC AATCACGTGC TGCTGCGCAA TTTTGCGCGC GACGTGATCG ACCGCAGGCA GCGCGAGCGG CCCGATGCGC TGCTGTCGTT CGAGCGAATC CCCGACGCCG ATTATCACTA CGTCGCCGAC GGCGCCGCGA TCCTGCGCGC CTGGCAGCTG CTGGCGTGGC CGCCGCGCCG CCGCGCCAAG CTGGCGCTGG AGCGCGCGGT GTTCGCGGCG CCGGCCGCGA CCCGGTTGTT CTTCCTCACC GAGCGCCAGC GCGACGAATA CATCATCGCC TATGATTTCG AGCCGGCCCG CGCCAGCGTG CTGCCGATGG TGCTGCACGA CGACCGCTAC GCCGCCGCGC GCAAGCTCGG CGCCTCCCGC TGGCGCAGCG AGCTCGGCAT TCCCGGCGAC GCGCTGATGG CGGTGTCGGT CGCGGTCGAT CCGAAGCTCA AGGGCGTCGA TCGTAGCCTC GCCGCGCTGG CGTCCTATCC GAAGCTTCAC CTCGTCGTCG CCGGTTCGGA TTCGCCGTGG CTGCATCGCG GCGTGGTGCG GCGCGATCTC GAGCGGCGCG TGCACATCGT GCCTTACGTC GCCGAGGTGA TGGAGCTGAT CGCGGCGGCC GATTTCATGC TGCACCCCGC GCGCTCCGAA GCGGCCGGGC AAGTGATCGG CGAGGCGCTG CTCGCCGGCG TGCCGGTGCT CGCCTCGGCC GCCTGCGGCT ATGCCGGCGA GATCGAGCGC AGCGGCGCCG GCCTGGTGCT GCCGGAGCCG TTCCAGCAGG AGGCGCTGGT CGCCGGCATC GCCGCGATGA TCGACGCACT GCCGGCGATG CGCAAGCAAG CGGCGGCGCG CGCGAAGAGC CTGCAGCAGC AGCGCGGCGC GTGGCTGTTG GCGATCGCCG AACGGATCGA ACAGCGCGAC GTCTGA
|
Protein sequence | MKVMLAIFKL DRLGGKERDC MAIARHLAAR GHDVTVLTTS ADVAAIDDLR IESLRARGLA NHVLLRNFAR DVIDRRQRER PDALLSFERI PDADYHYVAD GAAILRAWQL LAWPPRRRAK LALERAVFAA PAATRLFFLT ERQRDEYIIA YDFEPARASV LPMVLHDDRY AAARKLGASR WRSELGIPGD ALMAVSVAVD PKLKGVDRSL AALASYPKLH LVVAGSDSPW LHRGVVRRDL ERRVHIVPYV AEVMELIAAA DFMLHPARSE AAGQVIGEAL LAGVPVLASA ACGYAGEIER SGAGLVLPEP FQQEALVAGI AAMIDALPAM RKQAAARAKS LQQQRGAWLL AIAERIEQRD V
|
| |