Gene Rpal_5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5066 
Symbol 
ID6412760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5451978 
End bp5453105 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content64% 
IMG OID642714951 
Productglycosyl transferase group 1 
Protein accessionYP_001994030 
Protein GI192293425 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCTG CCGTGCCGCC GAGATCAATG CGTCGCCGCG TGCTGGTGAT CCAGACCCAG 
GCCGAGAACG CCGGCGCGCA GGAAATCTCA CGCCTCCTCG GCGCGGGCCT CACAGCGCGC
GGTTACGATG TCCATAACCT GTTCTTCTTC CGGCAATCGC GTGGCTTCGA CGAGCCGGTG
AACACCAGCT ATTGCGCCCC GCGGCGCCCC GGCAATCCGC TGGCGTTCCT ACGTTTCCTG
GTGTCTCTCG GCCGGCAGAT CCGCGCCGCC CGTCCCGACG CGATCCTGAC GTTTCAGCAC
TATGGCAACG CCATCGGCGG CGCGATCGCG CGGCTGGTGA GCCCGGCGCC GGTGATTGCC
AATCAAGTGT CAGCGCGACT GACAATGCCC GGCTGGCTGC AGAAGCTCGA TCTGATGATG
GGGCGGCTCG GCGTCTTCAA CACCATCACG GTGAACTCCG AAGATATGCT GCATGACTAC
TCGCGCTATC CGGATTCGTA CCGCAGGCGC CTCACCCATG TTCCGCACGG CTTTGATCAG
AAGCACTCGG CGCTGACCGG GGCCGAAGCG CGCCGCCGCT TCGCATTGCC GGTGGACGGC
GTCATGCTCG GCACTGCGGC GCGGCTACAT CCGCTTAAAC AGCTCGATGC CACCATCCGG
GTATTGCCGG CGAAACCTTC CTGGCGGCTT GCGCTGGCTG GCCACGGGCC GGACGAAGTG
CGGTTGCGCG CACTGGCCGA ACAGCTTGGC GTCGCCGATC GGGTATTCTT TGTCGGCGAG
ATCACACCGG AACAAGTCGC CGATTTCCTT GCCAGCCTTG ACGTCTTCGT ATTCCCCTCG
CTGGCAGAGA CTTTCGGGCT TGCAGCAGTC GAGGCCGCCC ATGCGGGCGT GCCGGTGGTC
GCCAACGATC TGCCGGTGCT CCGTGAAGTA TTAGCCTATC AGGGAGAACC GGCGGCGTTG
CTGGTCGACG CCTCCGACAC GGCCGCTCTC GGCGCGGCGA TATCGGCGGT GCTCGACGAC
CCTGCGTTGA GGGTGCGACT TCAGCGCAGT GGCGAGGGCT TGAAGACCAG ATACTCTGTC
GACGCCATGG TCGATGAATA CGTCCATATT CTCGAACAGG CGATGTGA
 
Protein sequence
MVSAVPPRSM RRRVLVIQTQ AENAGAQEIS RLLGAGLTAR GYDVHNLFFF RQSRGFDEPV 
NTSYCAPRRP GNPLAFLRFL VSLGRQIRAA RPDAILTFQH YGNAIGGAIA RLVSPAPVIA
NQVSARLTMP GWLQKLDLMM GRLGVFNTIT VNSEDMLHDY SRYPDSYRRR LTHVPHGFDQ
KHSALTGAEA RRRFALPVDG VMLGTAARLH PLKQLDATIR VLPAKPSWRL ALAGHGPDEV
RLRALAEQLG VADRVFFVGE ITPEQVADFL ASLDVFVFPS LAETFGLAAV EAAHAGVPVV
ANDLPVLREV LAYQGEPAAL LVDASDTAAL GAAISAVLDD PALRVRLQRS GEGLKTRYSV
DAMVDEYVHI LEQAM