Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4579 |
Symbol | |
ID | 6412263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4934873 |
End bp | 4936003 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714459 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001993548 |
Protein GI | 192292943 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0140485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACGTCA TTCTGGTCAA TGTCTGGCTC GATGCGGAGC GCGGCGGCGG CACGGCTGAG CGAACCCGGC GACTCGCGGT GCATCTGTCG CGACTCGGCT GCCGATGTAC AATCGTCACC ATGGGGCCGA CCCCATGGGG CGACGAGTTC GGCCGGGCTG GCGTCACCGT CATCAGCGTG CCCTTCATCG GCCATCGCTT CCCCGCGCCG TTGGTCAATC CGTTCGCCCT CTACCGTCTG TTTCGCGACG CCGACATCGT CCATGTCATG GGCTTCTGGT TTCTCCTGGC GTCGTTCAGC TCTGCGATCG CCTGCGCCGC CGGCACGCCG CTGCTGCTGT GCCCGGCCGG CTCCCTCACC CAGTACGGTA GAAGCGCCGC GATCAAGCGC GTCTTTACCG CGCTCGCCGG GCGACCGATG CTGCGCAGCG CCGCTGCGAT CATTGCGACG ACGCGTCAGG AAGAAGCGCT GCTGGTCTCG GATTTCGCGA TTCCGGCAGA CTCGATCCTG ATCGCGCCGA ACGGCATCGA GCTTCCCGGA GAAGGACGAC CGGGAGGTAT GGTGATCCCG GACAAACGAT TCGTCCTGTT CGTTGGCCGG CTGACCGCGA TCAAGGCGCC GGACCTGCTG CTCGAAGCGT TCGCGCGGAT CGCCCCGGAA ATAGCGGATG TGAGCCTTGT GATCGCCGGC CCCGATCTCG GGATGCGGCC TCAGCTCGAA CGCCGGACCG CAGAACTGGG GCTTCAGGCG CGGGTGCATT TTGCGGGCTT CGCCGATGAG GCGCAGCGGA CGGCGCTGCT GGCCCGGGCG TCGCTGCTCG CGGTTCCGTC GCATTCCGAA GTGATGTCGA TGGTGGCGCT CGAGGCCGGC GCGATGGGCG TCCCGGTCCT GCTCACCGAC CGCTGCGGCT TCGACGAGGT CGAACAGATC GGCGGCGGCC GCGTGGTGCC GGTCGACGTT GGAGCCATTG CGGAAGGTTT GCGTCAAATG TTGTCGGACG ACGACGCACT GCGGCAATCG GGGCAAGCGC TGCGTGGTTT CGTGCTCGAG CACTACGAAT GGTCGCGGGT GGCGGCGGCG TTGCTCCGCG ATTTCCGCCG CTTGGCAGCG CAACGTCACG GACCCCGCTG A
|
Protein sequence | MNVILVNVWL DAERGGGTAE RTRRLAVHLS RLGCRCTIVT MGPTPWGDEF GRAGVTVISV PFIGHRFPAP LVNPFALYRL FRDADIVHVM GFWFLLASFS SAIACAAGTP LLLCPAGSLT QYGRSAAIKR VFTALAGRPM LRSAAAIIAT TRQEEALLVS DFAIPADSIL IAPNGIELPG EGRPGGMVIP DKRFVLFVGR LTAIKAPDLL LEAFARIAPE IADVSLVIAG PDLGMRPQLE RRTAELGLQA RVHFAGFADE AQRTALLARA SLLAVPSHSE VMSMVALEAG AMGVPVLLTD RCGFDEVEQI GGGRVVPVDV GAIAEGLRQM LSDDDALRQS GQALRGFVLE HYEWSRVAAA LLRDFRRLAA QRHGPR
|
| |