Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2928 |
Symbol | |
ID | 6410598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3192962 |
End bp | 3195796 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642712809 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001991911 |
Protein GI | 192291306 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.904629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCTG TCGTCGCCGT TCTGCTTTTC GTCACCGCCG CTCACGCCGG GCTCTGGGGC CTCCTTCGGG AGAAACAACA AGCCGCCGAC TTCACCGGCA TCCTGCCGAG CGTGTCCTAC GCGCCGTTCG ATGGCTCCGG CCATCCGGAC GTCGACAACT TCCCGACCGC TGAACGGATC CGCTCCGACC TGAAAAAGCT GTCCGCGCAG ACCCGCGCCA TCCGTCTGTA CTCGTCGACC GGCGGCCCCG AGATGGTACC GCCGATCGCC AATGAGTTCG GCCTGAAGGT CAATGTCGGC GCCTGGATCG ACAAGGACGT CACCCGCAAC GAGCGCGAGA TCCAGGCCGC GATCGACCTC GCCAAGCACA ACGCCAATGT CAGCGGCATC GTGGTCGGCA ACGAAACGGT GTATCGCGGC GATCAGATCC CGTTGGAAAA TCTCGGGCTG AGCGAGGAAG AGCGCTACCG GCTGGTGGCC GAAGAGAACC AGCGGGTTCG CGACGCTGAA GCCCAGCCGG CCGACAAGCG CGACGAAGCG GTGCGCTGGG CCACGGCCGA AAACAACGTC CGCCGGTTGA CCCGGCTAAT CCAGCGCGTG AAGTCTCAGG TCAAAGCGCC GGTCACCTCT GGCGAAATCT GGAACATATG GCTCGAGCAT CCCGAGCTCG CCTCATCGGT CGACTTCATC GCCGCGCACG TCCTGCCTTA CTGGGAAGGC TTCTCCGCCA AGCAGGCGGT CGATCAGGCG ATGATCATCT ACCAGAAGCT GCGCGAGGCC TTCCCGGGCA AGCGCATCGT GATCGCCGAA TTCGGCTGGC CGAGCGCCGG CTATAACCGC AAGGCTGCGG TGCCCGGTCA GTTCGAACAG GCGGTGACGC TGCGCAACTT CGTCAGCCGC GCCGACTCGA TCGGCATGGA ATACAACATC GTTGAAGCGA TCGATCAGCC GTGGAAGTTC TTCGAAGGCG GCGTCGGTCC GTATTGGGGC TTCCTCGACG CCTCTCGCCA GCCGAAGTTC GCCTGGACCG GCCCGGTGGT CGATCCGAAC TACTGGAAGC TCGCCGGCAT CGCGTTGCTG GTCGGCATCC TGCTATCGCT GCCAATCCTG CAGCTCGCCG CCCCGACCGC GATGCAGACG CTGCTGCTGT CGGCCGCGGC GCACGGCGCC GGCGCCTGGG CCGCGACGGT GTTCGCCTAT TGGAACGGGC ACTACTTCCT GTTCGGCTCG GCCTTCGCGC TGACGCTTGG CATGATCCTG CTGATCCCGC TGGTGGCAAT CGCCTTGGCG CGGATCGAGG AAATCTCGGC CGTGGCCTTC GGCCGCAAAC CGCGCCGGCT GATCACCCGT GCCCTCACCG ACGCTCAGGA AACCAAGCGT GCCGCAGCGA TCGCCAGCGG CGAGCCGGTC AATGTTCCGA AGGTCTCGAT CCACGTTCCG GCCTATTTCG AACCGCCGGA GATGCTGAAG CAGACCCTGG ATGCGCTGGC TCGGCTCGAT TACCCGAATT TCGAAGTCGT GGTGATCATC AACAATACAC CTGATCCGGC TTTCACCCAG CCGATCCAGG ATCACTGCCG CGAGCTCGGC GAACGCTTCA AGTTCATCAA CGCCGAGAAG GTCAAAGGCT TCAAGGCTGG CGCGCTGCGG ATCGCGATGG AGCGCACCGC GGTCGATGCA GAGATCATCG GCATCATCGA CGCCGACTAT GTGGTGACAC CGGACTGGCT GAAGGACCTG GTGCCTGCGT TCGACGATCC GCGCGTCGGC CTGGTGCAGG CGCCGCAGGA ACACCGCGAC GGCGACCGCT CGTTGATGCA CTACATCATG AACGGCGAAT ATGCCGGGTT CTTCGACATC GGCATGGTGC AGCGCAACGA ATACAACGGC ATCATCGTGC ACGGCACGAT GTGCCTGATC CGCCGCGCAG CGATGGACAT GGCCGGCGGC TGGTCGAGCG ACACCATCTG CGAAGACTCC GATCTCGGCC TCGAGATCAT GGAGCACGGC TGGCTCACCC ACTACACCAA CACCCGCTAC GGCTACGGCC TGCTGCCGGA CACCTACGAG GCGTTCAAGA AGCAGCGGCA TCGCTGGGCC TATGGCGGCT TCCAGATCAT CAAGAAGCAC TGGCGCCGAT TCCTGCCCGG CAACAGCCGC CTCAGCCGCG ACCAGCGCCG CGAATTTGGC CTCGGCTGGC TGAACTGGCT CGGCGCCGAA AGCCTCGGCG TGGTGGTGGC GATCCTCAAC CTGATCTGGG TCCCGATCGT CGCGTTCGCC GACATCGCGA TTCCCGACAA GATCCTGACC TTGCCGATCA TCGCCTCCTT CATCGTCACG CTGGCGCACT TCCTGGTGCT GTACCGGCTG CGGGTGAAGA TCGGCGTACC GCAGATGCTC GGCGCGATGA TCGCGGCGAT GTCGGTGCAA TGGACTGTGT CGCGCGCCGT CGCCCAGGGC CTGATCACCG AGCACCTCGC CTTCGCCCGC ACCTCCAAGG GCGGCCTGAC GATGATGTCG GTCGAATTCC AGGCGTTCTG GGAAGCGGTC ATCGGCGTGC TGCTGCTGAT CGGTGCCGGG ATCCTGGTGG TCTCCAACAG CAACATCCAG ATCACCGAGA TCTACATCTT CGCCGGCGTC TTGGTGCTGC AGAGCCTTCC GTTCCTGGCG GCGGTAGCGA TCGCGATTCT GGAGAACTCC CGAATCAATC AGTTCGGCTG GTGGACCGCG ACCGCGGTGC GGACTGCCGA ATTGATCGGC CTGCGCCCGG TGGCGCTGCC GACCCCGATC CCGGCGCCGC AGAAGGTCGC CTCGGAGCTT CACCGCGACG CGTAA
|
Protein sequence | MRAVVAVLLF VTAAHAGLWG LLREKQQAAD FTGILPSVSY APFDGSGHPD VDNFPTAERI RSDLKKLSAQ TRAIRLYSST GGPEMVPPIA NEFGLKVNVG AWIDKDVTRN EREIQAAIDL AKHNANVSGI VVGNETVYRG DQIPLENLGL SEEERYRLVA EENQRVRDAE AQPADKRDEA VRWATAENNV RRLTRLIQRV KSQVKAPVTS GEIWNIWLEH PELASSVDFI AAHVLPYWEG FSAKQAVDQA MIIYQKLREA FPGKRIVIAE FGWPSAGYNR KAAVPGQFEQ AVTLRNFVSR ADSIGMEYNI VEAIDQPWKF FEGGVGPYWG FLDASRQPKF AWTGPVVDPN YWKLAGIALL VGILLSLPIL QLAAPTAMQT LLLSAAAHGA GAWAATVFAY WNGHYFLFGS AFALTLGMIL LIPLVAIALA RIEEISAVAF GRKPRRLITR ALTDAQETKR AAAIASGEPV NVPKVSIHVP AYFEPPEMLK QTLDALARLD YPNFEVVVII NNTPDPAFTQ PIQDHCRELG ERFKFINAEK VKGFKAGALR IAMERTAVDA EIIGIIDADY VVTPDWLKDL VPAFDDPRVG LVQAPQEHRD GDRSLMHYIM NGEYAGFFDI GMVQRNEYNG IIVHGTMCLI RRAAMDMAGG WSSDTICEDS DLGLEIMEHG WLTHYTNTRY GYGLLPDTYE AFKKQRHRWA YGGFQIIKKH WRRFLPGNSR LSRDQRREFG LGWLNWLGAE SLGVVVAILN LIWVPIVAFA DIAIPDKILT LPIIASFIVT LAHFLVLYRL RVKIGVPQML GAMIAAMSVQ WTVSRAVAQG LITEHLAFAR TSKGGLTMMS VEFQAFWEAV IGVLLLIGAG ILVVSNSNIQ ITEIYIFAGV LVLQSLPFLA AVAIAILENS RINQFGWWTA TAVRTAELIG LRPVALPTPI PAPQKVASEL HRDA
|
| |