Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2553 |
Symbol | |
ID | 4023047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2858300 |
End bp | 2860042 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637962749 |
Product | glycosyl transferase family protein |
Protein accession | YP_569684 |
Protein GI | 91977025 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.182713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA CCTCTCCAAC GCCCCGCTTC GGTGGCCCCC GCGAGCCCAG AAATGCCATC GACCCCGGCC GCGGGCTGGT GGCGGTGCTG GATTTTGCGT CCGTCAGCCA TCTGCGCGCA GTGGCGTTCT TGGTACTGGT CGGGCTGCTG TTTTTTCTGC CGGGTTTCTT CAACATCCCG CCGATCGACC GCGACGAGGC CCGCTTCGCC CAGGCCACCA AGCAGATGGT CGAGAGCGAT GATTTCATCG ATATCCGGTT CCAGGACGAG GTCCGCTACA AGAAGCCGGT CGGGATCTAC TGGTTGCAAG CCGCGGTCGT CGAAACCGCG TCGCGGCTCG GCCTGCCGCG CGCCGAAGTC CGGATCTGGC TCTATCGCGT GCCGTCGCTG GCCGGTGCGA TCGGCGCGGT GCTGATGACC TATTGGGCGG CGCTCGCCTT TGTCGGGCGG CGCGGGGCGG TGATCGCCGG TCTTCTGTTG TGCAGCTCGA TCTTGCTGGG TGTCGAGGCG CGACTGGCCA AGACCGACGC GTTCTTGCTG TTCACGGTGA CCGCCGCGAT GGGCGCGATG GCGCATGTCT ATCTCGCCTG GCAGCGCGGC GACGACCGCT ACCACTCCTG GATCACGCCG GCCATTTTCT GGACCGCGGT CGCCGCGGGC ATTCTTCTCA AGGGCCCGCT GATCCTGATG TTCATCGCGC TGACGGTGGC CGCGCTGGCG TTTGTCGATC GCTCGGCGGT GTGGCTGTGG CGTCTGAAGC CGCTGGCCGG CGTGCTGTGG ATGCTGGTGC TGGTGCTGCC GTGGTTCATC GCGATCTTCC TGCGCGCCGG CGACACCTTC TTCGCCGACT CGGTCGGCGG CGACATGTTG AGCAAGATCG CCAGTCCCAA GGAATCCCAC GGCGCGCCGC CGGGCCTGTA TTTCCTGCTG TTCTGGGTGA CGTTCTGGCC GGGCGCGCCG TTGGCCGCGA TGGCTGCGCC TGCGGTGTGG CGGGCACGGC GCGAGCCGGG CGCGCAATAT TTGCTGGCCT GGGTGATCCC GTCCTGGATC GTGTTCGAGC TGGTGATCAC CAAGCTGCCG CACTATGTGC TGCCGCTGTA TCCGGCGATC GCGATCATGA CCGCTGGCGC GATCGAGCAC AGCGTGCTGT CGCGCTCCTG GCTGACCCGC GGCGCGGCGT GGTGGTTCGC GATTCCAGTC GTCGTGCTGT CGCTCGCGAT CATCGGCGCC ATCATCCTGA CCCGGCAGCC GGCGTTCCTG GCGTGGCCGT TCGTCGCGGC CTCGCTGATT TTCGGGCTGT TCGCGTGGCG GCTGTTCGAC CAGAACCGCG CCGAAGCCTC GCTGCTCAAC GCCTCGCTGG CGTCGCTGTT TCTGATGGTC GCCGCGCTCG GCGTCGTGGT GCCGACGCTG CGGCCGGTGT TCCCGAGCGT CGAGATCGCG CAGGCGCTGC GCAAGGTGGT GTGCGTCGGG CCTAAGGCCG CGGCCGTGGG CTTCCACGAG CCGAGCCTGG TGTTCATGAC CGGCACCGAT ACGTTGCTGA CCGACGGCTC CGGTGCCGCC GACTTCCTGC TCGGCGGAAG CTGCCGCTTC GCGCTGGTGG AAGCTCGCAG CGAGCGCGCA TTCGCGGCGC GGGCCGAGGC GATCGGGTTG CACTACAACG TGGCGACCCG GATCGACGGC TACAATTTCT CGCAGGGCAA GCCGGTGTCG ATCGCGATCT TCCGTTCCGA AGGCACGCAG TAA
|
Protein sequence | MTETSPTPRF GGPREPRNAI DPGRGLVAVL DFASVSHLRA VAFLVLVGLL FFLPGFFNIP PIDRDEARFA QATKQMVESD DFIDIRFQDE VRYKKPVGIY WLQAAVVETA SRLGLPRAEV RIWLYRVPSL AGAIGAVLMT YWAALAFVGR RGAVIAGLLL CSSILLGVEA RLAKTDAFLL FTVTAAMGAM AHVYLAWQRG DDRYHSWITP AIFWTAVAAG ILLKGPLILM FIALTVAALA FVDRSAVWLW RLKPLAGVLW MLVLVLPWFI AIFLRAGDTF FADSVGGDML SKIASPKESH GAPPGLYFLL FWVTFWPGAP LAAMAAPAVW RARREPGAQY LLAWVIPSWI VFELVITKLP HYVLPLYPAI AIMTAGAIEH SVLSRSWLTR GAAWWFAIPV VVLSLAIIGA IILTRQPAFL AWPFVAASLI FGLFAWRLFD QNRAEASLLN ASLASLFLMV AALGVVVPTL RPVFPSVEIA QALRKVVCVG PKAAAVGFHE PSLVFMTGTD TLLTDGSGAA DFLLGGSCRF ALVEARSERA FAARAEAIGL HYNVATRIDG YNFSQGKPVS IAIFRSEGTQ
|
| |