Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2397 |
Symbol | |
ID | 3909531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2751683 |
End bp | 2753608 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637884296 |
Product | glycosyltransferase |
Protein accession | YP_486013 |
Protein GI | 86749517 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.628145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCGT CGCCGATCTG CAGAGGACGA CGCACCATCA AGCGCAAGCA GCCCAGGAAC ACGTCGAAAT ACCGTCGGCC GAACAAGAAG CGCCAGGGCG GCAAGAGCGG CGCGCGTCCG GCTGCGGCCG CGAATTCGGC CGGCTCCGAT CTGGCTGCCG AGACCGCCAC GACCCCGGTC ACTGCGCCCG CTCCGGTCGC TCCCCCGACG CCCGCCGTCG AGACGAAGGG CGCGCCGCCC CGCCCGGCTC CCCACGTTTC AAAGCGGCCC ACAGCGCCGC CGAAGCCATC CAGTTCGAAG ACATCGCCCG GTCAGCCGAC TGCACCACCG TCACCTGACG CCGTTGCGGT CACGATGCCC GGGGGTGCGG CCCCCGGTCT GCTGCTGCGC GCCTGCGACG TGGTGCTGCA GATCCTGCGC GGCGAGCCGC GGCGGCTGTT GCTGTGGATT CTCGGCATCT ATGGCGGGCT GTGGTTCGTC ACAGCCTTCA GCTTCCCGAG CCTGCCGGCG ATCAGCTACG AGATGGCGCT GTTCGGTAAG GAACTGCAGG CGGGCACTTG GAAGTATCCG CCGCTGGCGC CCTGGCTGAC CGAGATCGCC TCGCTGCTGA CCGGCGGGTG GAGCGGATCG CAGCTCCTGC TCTCGATCGG CTCAGCGCTG GCGACGCTGG TGCTGCTGTG GCGGCTCGGC GCTGGCATCG TCGGCGCAGC CGGTGCGACG CTGGCGGTCG CGCTGACGAT CCTGATCGGC TGCTTCGGCC CGCAGGTCAC CGGCTATGAT CCCGCCATCG CCGGCCTGCC GCTGACGGTC GCGGCGGTGC TGCTGTATCG GCAGGCGGTG CTCGGGCAAG CGCGGTCGAG CTGGATCGGC CTCGGCGTCG TCTGCGCGCT GCTGGGCAAT GCCAATCACG CCGGCTTCGC CCTGATCCTG GTGCTGCTCG GCCATCTGCT GCTGACGCGC GAGGGTCGCC GGCAGTTGGG GACGCTGGGC CCGCCGATCG CCGCGGTGGT GTGCTTTGTC GTGTTGCTGC CGCATCTGAT CTGGCTCGGT CAGGCGAATG CATCGGCGTC CGCCGCAGCG GGCGCGTCGG CCGATTTGCT GCCACGGATC GGCGCGGCCT TCGCCTTCGT GTTCGGCCAG GTCGGGCTGC ACGCTGGGCT GATCCTGATC GCGGTGCTGG CGGTGCTGCC GCGACTGCCG CTGCAGGGCG CCCCCGCAAC GATCGAGCTC GACACGCCGA GCAGCTTCGA TCGCTCGCTG ATCCTCGCCG CCGCCTTCGT GCCGTCGATG CTGGTTGCGG TCGGCAGCGT GCCGGACTGG TTCACGATCG GCGCCTACAC CGGCAGTGCG CTGGTGCCGC TATCGGGGCT GGCGCTGCTC CTGCTGCTGC CGCGGCGGCT GGTGCTACGC GCCCCGCGCC TCGCGGTGGT GGCGTGGCTG CTGGTGCTGG TGGGCGTTCC GATCGCCACC ACGGCATCGA TCTACGCCAG AGCCTATGGT GACGGCCCGC CGCCGACCGA GCTGTATCCG GCTCGCGCGC TGTCGCAGGC GATGCAGGCG GCGTGGAGAA GCCGGACCAC CCGGCCGCTC GATAGCGTCA CCGGGAGTGC CCGCCAAGCC GGCTTCGTCG CATTCGACGC CTCGCCGCGA CCATCGGTGT TCATCGATGC CGACTTCGCC AAGAGCCCGT GGATCACGCC GCAACGGCTG AAGCAATCCG GCACGCTGGT GGTGTGGTCG ACCGACGAAT TCGCCCGCAC CGACGAAATC CCGGCGCCCT ATCGCGGCAC GCTCGGCAGC AGCACGCCGG TGTTCGGCAC CATGGTGCTG CCGCTCGGCC GCGGCAAACT GAAAGCCTAT GGCTGGGCGA TGATCGCGCC GGAAGGCGAT CCACCGCAGG CGCCGGCACC GGCGCCGGCG AAGTAA
|
Protein sequence | MAASPICRGR RTIKRKQPRN TSKYRRPNKK RQGGKSGARP AAAANSAGSD LAAETATTPV TAPAPVAPPT PAVETKGAPP RPAPHVSKRP TAPPKPSSSK TSPGQPTAPP SPDAVAVTMP GGAAPGLLLR ACDVVLQILR GEPRRLLLWI LGIYGGLWFV TAFSFPSLPA ISYEMALFGK ELQAGTWKYP PLAPWLTEIA SLLTGGWSGS QLLLSIGSAL ATLVLLWRLG AGIVGAAGAT LAVALTILIG CFGPQVTGYD PAIAGLPLTV AAVLLYRQAV LGQARSSWIG LGVVCALLGN ANHAGFALIL VLLGHLLLTR EGRRQLGTLG PPIAAVVCFV VLLPHLIWLG QANASASAAA GASADLLPRI GAAFAFVFGQ VGLHAGLILI AVLAVLPRLP LQGAPATIEL DTPSSFDRSL ILAAAFVPSM LVAVGSVPDW FTIGAYTGSA LVPLSGLALL LLLPRRLVLR APRLAVVAWL LVLVGVPIAT TASIYARAYG DGPPPTELYP ARALSQAMQA AWRSRTTRPL DSVTGSARQA GFVAFDASPR PSVFIDADFA KSPWITPQRL KQSGTLVVWS TDEFARTDEI PAPYRGTLGS STPVFGTMVL PLGRGKLKAY GWAMIAPEGD PPQAPAPAPA K
|
| |