Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3221 |
Symbol | |
ID | 6410891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3467506 |
End bp | 3469671 |
Gene Length | 2166 bp |
Protein Length | 721 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642713097 |
Product | glucosyltransferase MdoH |
Protein accession | YP_001992198 |
Protein GI | 192291593 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.152061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCCG TGATCGCGCC GCGGCACGAG GACTGTCGCG ACGCCGAGCG CCGAGCCGAA CCGTTGCCTG CGAACGCACC GTTATCGATG CCGGTGCAGT CGCTGCTGCA GAGTCCCCAG GCTGCCGCAG TCCGATCGCC GGCCGCCTGG CGCCGCGCGC TGATCATGGT CGCGACCGCA GTGCTGTCCG CGGCGGGCAT CTACGAGATG TATCAGGTGC TGCAGGTCGG CGGCATCACG GTTCTCGAAG GCGTGGTGCT GGTGCTGTTC GCCGCGCTGT TCGCCTGGGT CGCGCTGTCG TTCGTCTCGG CGCTGGCCGG CTTCACCGTG CTGTGCTGCG GCTGGCGCGA CGATGTCGGG ATCATGCCGG ATGGCTCTAT GCCGGCGGTC TCCTCCAAGA TCGCGATGCT GCTGCCGACC TACAACGAAG ACGCCCCGGT GGTGTTCGCG AGGTTGCAGG CGACGCGGCA ATCGGTCGAC GAAACCGGCC GCGGCGCGCA ATTCGACTGG TTCGTGCTCA GCGACTCCAC CGATCCGTCA GTGTGGATCG ACGAAGAGCG CTGCTATGCC GAACTCGCCG CCACACACGA CCGTCTGTAC TATCGGCACC GGCCCTACAA TACGGCACGC AAGTCCGGCA ACATCGCCGA CTGGGTCGAG CGCTTCGGCG GCGCTTACGA CTTCATGGTC ATCCTCGATG CCGACAGCGT GATGACCGGC GACGTGCTGG TGCGTATCGC CGCCGCGATG GAGACGAACA GCGACGTCGG ATTGATCCAG ACTCTGCCGG TCGTGGTTCA GGCGCGCACG CTGTTCGCGC GGGTCCAGCA ATTCGCCGGC AGCATCTACG GGCCGATGAT CGCCGCCGGC ACCGCATGGT GGCACGGCTC CGAGAGCAAC TATTGGGGCC ACAATGCGAT CATCCGGGTA TCGGCGTTCG CGGGCAGCGC CGGGCTGCCG ACGTTGGCGG GCCGCAAACC ATTCGGCGGC GAAATCCTCA GCCACGATTT CGTCGAGGCG GCGCTTATGC GCCGCGGAGG CTGGCGCATT CACCTCGCCC CGACGCTGCG CGGCAGCTAC GAGGAGTGCC CGCCGTCACT GCTCGATTTC GCCGCGCGCG ACCGGCGCTG GTGCCAGGGC AATCTGCAGC ACGGCAAGCT GCTCACCGCC CGTGGACTGC ATTGGGTGTC GCGGCTGCAT TTCCTCACCG GCATCGGCGC TTATCTCACC GCGCCGATGT GGCTGGCGTT TCTCGTTGCC GGCATCCTGA TTTCGCTGCA GGCGCAGTTC GTCCGCCCCG AATATTTCCC CAAGGACTTC TCGCTGTTTC CGATCTGGCC GGCGCAGGAC CCGGTGCGCG CCGCCTGGGT GTTCGCCGGC ACGATGGGAC TGTTGATCCT GCCGAAGCTG CTGGCGCTCT TGCTGGTACT GATCCGCAGC CAGACCCGGA GGCGGTTCGG CGGCGGCCTG CGCACCTTCG GCGGCGTACT GCTGGAGACG ATGATCTCAG CTCTGACCGC ACCGGTGATG ATGGTGTTTC AATCAACGGC TGTGATCGAG ATCCTGCTCG GCCGCGACGC CGGCTGGCAG GTGCAGCATC GCGGCGATGG CGCGATCCCG CTGCGTGAAG TCGTCCGCCG CTACGCGCTG CCGACAGCGC TGGGCGCGAC CATGGCGGTC GGAGCGTGGC TGGTGTCGTG GCCGCTGCTG CTGTGGATGA CGCCGGTCAT CGTCGGCCTG CTGCTCGCGA TCCCGGTGGC GCTGCTGACG ACGCGTGTCT CCCGCTCGCG TCCGTTGCTG ATGACGACGC CGGAGCAGAT CGATCCGCCG GCGATCCTGG CGCAGGTGCA CGCACTCGCC GATCGTCTTC GCCCGGCGAA CCAGACAACC GATCCGCTGA GCGCGCTTTG TAGCGACCGA CGACTCCGAG AGCTCCATCT CGCCGCTCTG GCCTTCCATC CTCCGCGCCG GCGCGGCCGT ATCGATCCGC ATCTGGCGAC CGCGCGCGTG CTGATCGACG ACGCTGAAAG TTATAGCGAG GCGGCCGGCT GGCTCGGCCC ACGCGAAATC CGCGCTGTGC TCGGTGATCG TGAGACCCTG CAACGGCTGC TGAAATTGTC CGGCGAACAC GCGCAGCTCG CAGTCGGCAG CGAGCCCAGC GGTTAG
|
Protein sequence | MDAVIAPRHE DCRDAERRAE PLPANAPLSM PVQSLLQSPQ AAAVRSPAAW RRALIMVATA VLSAAGIYEM YQVLQVGGIT VLEGVVLVLF AALFAWVALS FVSALAGFTV LCCGWRDDVG IMPDGSMPAV SSKIAMLLPT YNEDAPVVFA RLQATRQSVD ETGRGAQFDW FVLSDSTDPS VWIDEERCYA ELAATHDRLY YRHRPYNTAR KSGNIADWVE RFGGAYDFMV ILDADSVMTG DVLVRIAAAM ETNSDVGLIQ TLPVVVQART LFARVQQFAG SIYGPMIAAG TAWWHGSESN YWGHNAIIRV SAFAGSAGLP TLAGRKPFGG EILSHDFVEA ALMRRGGWRI HLAPTLRGSY EECPPSLLDF AARDRRWCQG NLQHGKLLTA RGLHWVSRLH FLTGIGAYLT APMWLAFLVA GILISLQAQF VRPEYFPKDF SLFPIWPAQD PVRAAWVFAG TMGLLILPKL LALLLVLIRS QTRRRFGGGL RTFGGVLLET MISALTAPVM MVFQSTAVIE ILLGRDAGWQ VQHRGDGAIP LREVVRRYAL PTALGATMAV GAWLVSWPLL LWMTPVIVGL LLAIPVALLT TRVSRSRPLL MTTPEQIDPP AILAQVHALA DRLRPANQTT DPLSALCSDR RLRELHLAAL AFHPPRRRGR IDPHLATARV LIDDAESYSE AAGWLGPREI RAVLGDRETL QRLLKLSGEH AQLAVGSEPS G
|
| |