Gene Rpal_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3221 
Symbol 
ID6410891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3467506 
End bp3469671 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content67% 
IMG OID642713097 
Productglucosyltransferase MdoH 
Protein accessionYP_001992198 
Protein GI192291593 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.152061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCCG TGATCGCGCC GCGGCACGAG GACTGTCGCG ACGCCGAGCG CCGAGCCGAA 
CCGTTGCCTG CGAACGCACC GTTATCGATG CCGGTGCAGT CGCTGCTGCA GAGTCCCCAG
GCTGCCGCAG TCCGATCGCC GGCCGCCTGG CGCCGCGCGC TGATCATGGT CGCGACCGCA
GTGCTGTCCG CGGCGGGCAT CTACGAGATG TATCAGGTGC TGCAGGTCGG CGGCATCACG
GTTCTCGAAG GCGTGGTGCT GGTGCTGTTC GCCGCGCTGT TCGCCTGGGT CGCGCTGTCG
TTCGTCTCGG CGCTGGCCGG CTTCACCGTG CTGTGCTGCG GCTGGCGCGA CGATGTCGGG
ATCATGCCGG ATGGCTCTAT GCCGGCGGTC TCCTCCAAGA TCGCGATGCT GCTGCCGACC
TACAACGAAG ACGCCCCGGT GGTGTTCGCG AGGTTGCAGG CGACGCGGCA ATCGGTCGAC
GAAACCGGCC GCGGCGCGCA ATTCGACTGG TTCGTGCTCA GCGACTCCAC CGATCCGTCA
GTGTGGATCG ACGAAGAGCG CTGCTATGCC GAACTCGCCG CCACACACGA CCGTCTGTAC
TATCGGCACC GGCCCTACAA TACGGCACGC AAGTCCGGCA ACATCGCCGA CTGGGTCGAG
CGCTTCGGCG GCGCTTACGA CTTCATGGTC ATCCTCGATG CCGACAGCGT GATGACCGGC
GACGTGCTGG TGCGTATCGC CGCCGCGATG GAGACGAACA GCGACGTCGG ATTGATCCAG
ACTCTGCCGG TCGTGGTTCA GGCGCGCACG CTGTTCGCGC GGGTCCAGCA ATTCGCCGGC
AGCATCTACG GGCCGATGAT CGCCGCCGGC ACCGCATGGT GGCACGGCTC CGAGAGCAAC
TATTGGGGCC ACAATGCGAT CATCCGGGTA TCGGCGTTCG CGGGCAGCGC CGGGCTGCCG
ACGTTGGCGG GCCGCAAACC ATTCGGCGGC GAAATCCTCA GCCACGATTT CGTCGAGGCG
GCGCTTATGC GCCGCGGAGG CTGGCGCATT CACCTCGCCC CGACGCTGCG CGGCAGCTAC
GAGGAGTGCC CGCCGTCACT GCTCGATTTC GCCGCGCGCG ACCGGCGCTG GTGCCAGGGC
AATCTGCAGC ACGGCAAGCT GCTCACCGCC CGTGGACTGC ATTGGGTGTC GCGGCTGCAT
TTCCTCACCG GCATCGGCGC TTATCTCACC GCGCCGATGT GGCTGGCGTT TCTCGTTGCC
GGCATCCTGA TTTCGCTGCA GGCGCAGTTC GTCCGCCCCG AATATTTCCC CAAGGACTTC
TCGCTGTTTC CGATCTGGCC GGCGCAGGAC CCGGTGCGCG CCGCCTGGGT GTTCGCCGGC
ACGATGGGAC TGTTGATCCT GCCGAAGCTG CTGGCGCTCT TGCTGGTACT GATCCGCAGC
CAGACCCGGA GGCGGTTCGG CGGCGGCCTG CGCACCTTCG GCGGCGTACT GCTGGAGACG
ATGATCTCAG CTCTGACCGC ACCGGTGATG ATGGTGTTTC AATCAACGGC TGTGATCGAG
ATCCTGCTCG GCCGCGACGC CGGCTGGCAG GTGCAGCATC GCGGCGATGG CGCGATCCCG
CTGCGTGAAG TCGTCCGCCG CTACGCGCTG CCGACAGCGC TGGGCGCGAC CATGGCGGTC
GGAGCGTGGC TGGTGTCGTG GCCGCTGCTG CTGTGGATGA CGCCGGTCAT CGTCGGCCTG
CTGCTCGCGA TCCCGGTGGC GCTGCTGACG ACGCGTGTCT CCCGCTCGCG TCCGTTGCTG
ATGACGACGC CGGAGCAGAT CGATCCGCCG GCGATCCTGG CGCAGGTGCA CGCACTCGCC
GATCGTCTTC GCCCGGCGAA CCAGACAACC GATCCGCTGA GCGCGCTTTG TAGCGACCGA
CGACTCCGAG AGCTCCATCT CGCCGCTCTG GCCTTCCATC CTCCGCGCCG GCGCGGCCGT
ATCGATCCGC ATCTGGCGAC CGCGCGCGTG CTGATCGACG ACGCTGAAAG TTATAGCGAG
GCGGCCGGCT GGCTCGGCCC ACGCGAAATC CGCGCTGTGC TCGGTGATCG TGAGACCCTG
CAACGGCTGC TGAAATTGTC CGGCGAACAC GCGCAGCTCG CAGTCGGCAG CGAGCCCAGC
GGTTAG
 
Protein sequence
MDAVIAPRHE DCRDAERRAE PLPANAPLSM PVQSLLQSPQ AAAVRSPAAW RRALIMVATA 
VLSAAGIYEM YQVLQVGGIT VLEGVVLVLF AALFAWVALS FVSALAGFTV LCCGWRDDVG
IMPDGSMPAV SSKIAMLLPT YNEDAPVVFA RLQATRQSVD ETGRGAQFDW FVLSDSTDPS
VWIDEERCYA ELAATHDRLY YRHRPYNTAR KSGNIADWVE RFGGAYDFMV ILDADSVMTG
DVLVRIAAAM ETNSDVGLIQ TLPVVVQART LFARVQQFAG SIYGPMIAAG TAWWHGSESN
YWGHNAIIRV SAFAGSAGLP TLAGRKPFGG EILSHDFVEA ALMRRGGWRI HLAPTLRGSY
EECPPSLLDF AARDRRWCQG NLQHGKLLTA RGLHWVSRLH FLTGIGAYLT APMWLAFLVA
GILISLQAQF VRPEYFPKDF SLFPIWPAQD PVRAAWVFAG TMGLLILPKL LALLLVLIRS
QTRRRFGGGL RTFGGVLLET MISALTAPVM MVFQSTAVIE ILLGRDAGWQ VQHRGDGAIP
LREVVRRYAL PTALGATMAV GAWLVSWPLL LWMTPVIVGL LLAIPVALLT TRVSRSRPLL
MTTPEQIDPP AILAQVHALA DRLRPANQTT DPLSALCSDR RLRELHLAAL AFHPPRRRGR
IDPHLATARV LIDDAESYSE AAGWLGPREI RAVLGDRETL QRLLKLSGEH AQLAVGSEPS
G