Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3054 |
Symbol | |
ID | 4023557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3398496 |
End bp | 3400430 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637963253 |
Product | glycosyltransferase |
Protein accession | YP_570181 |
Protein GI | 91977522 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCGC CGCCGATCTG CAGAGGACGA CGCACCATCA AGCGCAAGCA GCCCAAGAAC CGTCGACCGA AGAAGAAGCA TCAGGGCGGC CGGAGCCAAG CGGGCTCAGT CGCCAGTTCG GCGGCAACGC CTGCTTCTTC CGACGCCGCA GCCCCAGCCG CGCCCGGCGA TACCGCCGTT GCGCCTGCGC CTGCTGCTGC TGCGCCGCCG CCGGCTGCTG CGAACAAGGC CGAGCGCGCG CCGGTCCGCC CTGCTCCCGT CGGTCACAAG CGCTCCGCTC CGCCGCCGAA GCCCGCGCAA TACAAATCAG CCCCCCGCCA ACCGGCCGCG CTGCCGACGC CGAACAGTCC AGCCGCCAAA CCGGTCGCCG CAGCGCCTCC CGGTCTGTTG CTCAGCGCCT GCGACACCGT GCTGCAGATC CTACGCAACG AGCCGCGGCG GCTGTTGCTG TGGATTCTCG GCATCTATGG CGGGCTGTGG TTCGCAGCCG CAATCAGCTT CCCGAGCCTG CCTGCCGTCA GCTACGAGAT GGCGCTGTTC GGCAAGGAAC TGGAGGCCGG ATACTGGAAA TATCCGCCGC TCGCGCCGTG GCTGACCGAG ATCGCCGCAC TGCTGACCGG CCGCTGGGAC GGCTCGCAAC TCGTCCTGTC GATCGGCTCG GCGCTGGCGG CGCTGGTTCT GGTGTGGCGG CTCGGCGCCG GCATCGTCGG TCATAGCGGC GCGACGCTGG CGGTGGCGCT GACTATCCTG ATCGGCTGTT TCGGCCCGCA GGTCACCGCC TTTGATCCGG CGATCGCCAG CCTGCCACTC GTGGTCGCGG CGGTGGCGCT GTATCGCAAA GCGGTGCTGG GCCAGGCGCG CTGGAGCTGG GTCGGGCTCG GAATCGTCTG CGCGCTCCTG GCGAACGCGA ACCACGCCGG TTTCGCTTTG ATGCTGGTCC TCATCGGACA TTTGTTGCTG ACGCGCGAGG GCCGCCGCCA TCTGCTCACC GCCGGCCCCG CGATCGCGGC GTTGGCCTGC TTCGTCGTGC TGCTGCCGCA TCTGATGTGG CTAGCGCAGA TGAACGCGGC AGGAACTCTC ACGCCCGTGG TTCATGCGTC CGATCTCTTG TCGCGGATCG CGACGGCCTT TGCCTTCGTA TTCGGCCAAG CGGGCTTGCA CCTCGGTTTG ATCCTGGTGG CGGTGCTGGC GATGCTGCCG CGGGTCCCGC TGCAAGGCGA ACCCGCCGTC CTCCAGCTCG AGGCGCCGAC CGGGTTCGAT CGCTCGCTGA TCCTCGCCGC CGCCTTTGTG CCATCGCTGC TGGTTGCGGC CGGCAGCGTG GTCGGATGGT TCACGATCGG CGCCTACACC GGCAGCGCGC TGGTGGCGTT GTCGGGTCTT GCGCTCGTCC TGCTGCTGCC ACCGCAAATC GTGCTGCGCG CGCCGCGCCT CGCGGTGGTG GTGTGGCTGC TGGTCCTGAT CGGCGTGCCG TTCGGCGCCA CCGCATCGAT CTATTCGAAA GCCTATGGCA GCGGCCCGCT GCCGACCGAG CTCTACCCCG CGCGAGCGCT GTCAAATGCG ATGCAGGCCG TCTGGAAGAG CCGCACCACC CGGCCGCTCG ACATCGTCAC CGGAAGCACC CGGCAGGCCG GATTCGTCGC GCTCACCGCG TCGCCGCGGC CGTCGGTGTT CATCGACGCG GATTTCGCCA AGAGCCCGTG GATCACGCCT GACCGGCTGA AACAATCCGG CACACTGGTG GTGTGGTCGA CCGACGAATT CGCACGGACC GATGAAATAC CGGCACCGTA TCGCAGCGCG CTCGGAGCGG CCGCGCCGCT GTTTGGCACC ATGGTGCTAC CGCTCGGCCG CGGCAAACTG AAGGCCTATG GCTGGGCGAT GATCGCGCCC GACGGCGTCC AGCTGCCGGC GGCAGCACCG CCTGCTCCCA AATAA
|
Protein sequence | MTAPPICRGR RTIKRKQPKN RRPKKKHQGG RSQAGSVASS AATPASSDAA APAAPGDTAV APAPAAAAPP PAAANKAERA PVRPAPVGHK RSAPPPKPAQ YKSAPRQPAA LPTPNSPAAK PVAAAPPGLL LSACDTVLQI LRNEPRRLLL WILGIYGGLW FAAAISFPSL PAVSYEMALF GKELEAGYWK YPPLAPWLTE IAALLTGRWD GSQLVLSIGS ALAALVLVWR LGAGIVGHSG ATLAVALTIL IGCFGPQVTA FDPAIASLPL VVAAVALYRK AVLGQARWSW VGLGIVCALL ANANHAGFAL MLVLIGHLLL TREGRRHLLT AGPAIAALAC FVVLLPHLMW LAQMNAAGTL TPVVHASDLL SRIATAFAFV FGQAGLHLGL ILVAVLAMLP RVPLQGEPAV LQLEAPTGFD RSLILAAAFV PSLLVAAGSV VGWFTIGAYT GSALVALSGL ALVLLLPPQI VLRAPRLAVV VWLLVLIGVP FGATASIYSK AYGSGPLPTE LYPARALSNA MQAVWKSRTT RPLDIVTGST RQAGFVALTA SPRPSVFIDA DFAKSPWITP DRLKQSGTLV VWSTDEFART DEIPAPYRSA LGAAAPLFGT MVLPLGRGKL KAYGWAMIAP DGVQLPAAAP PAPK
|
| |