Gene RPD_1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1649 
Symbol 
ID4022129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1857299 
End bp1858648 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content67% 
IMG OID637961844 
Productglycosyl transferase, group 1 
Protein accessionYP_568787 
Protein GI91976128 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0395854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAG TGACGGACAC CCCCTTGGCG CAGCCCCCCG ACACACAGGC CCCTCTGTCT 
CAACCCTGGC TGTGGATGGA CGTGTCGACC AGCTTTCGCT CGCGCTCCGG CCAGATGAAC
GGCACGCTCC GCGTCGAGCA AAGCATGGCG ACGGCGCTGT CCGAACTGAT GGCGCCGCAG
CTCCGGTTCT GCCGCTACGA TCCGCTGCGG CGGGACTACG TGCCGCTCGC CAATGCGCCG
GATCTCGGCG ACAAGCCGGT CGCCGCCGCA CCGAAGCAGA AGCGCACGAC GGCGCTGTCG
TCGATCAAGC CGCTCGGCAA GAAAATCGAA CGTGCGATCC GAACGTCGGT GCGCAGCGCC
GCGGCGCCGT TGTTGCAAAA GATATCAGGC AGCAATGGAC TGCCGCTGAT CGGCGGCGCC
GACGGCCGGG AAGTCCTGCT GCTCGCCGGC GAAAACTGGT CGCGGGTGAA CTACGCCGCG
GTGGCGCGGA TGCGCCGCGA ACGCGGCACC AGGGTCGCCG CGGTGTGCCA GGATTTCATC
CCGGTGATGG CGCCGCAATT CTTCGCCGAC GGCGATTTCG TCACGATGTT CGACGCCTAT
GCGCAATTCC TGATCCGCGA ATGCGACCTG ATCATCTCGA TCTCGCAATC GACCAGCGCT
GATGTGATGG CCTATGCGCA GCGCCATGGC GGCCTGCGCG GCGCGATCGA GCTGGTGCAT
CTGGGGGCTG ATCTGGCCGC GCCGGAAGCC GGGCGGCGGC CGCAGGCGCT CAGCGACGCG
CAGGCCAAGC GCTTCGTGAT CAGCGTCTCG AGCATTCAGT CGCGCAAGAA TTTCGACCTG
CTGTATCACC TCTGGCGGCG CCTGACCGAG CAGGGCACCC CGGACCTGCC GACGCTGGTG
CTGGTCGGCC AACCCGGCTT CGGCAGTTCG GACCTGTTGT GGCAGATCGC GCATGATCCG
GTCACCGCGT CCTCGATCGT GCATTTGCCG CGCGCCGGCG ACGCGGAGCT GGCGTGGCTT
TACCGGCACT GCGCGTTCAC GCTGTATCCC TCGTTCTACG AAGGCTGGGG ACTGCCGGTA
TCGGAGAGTC TGGCGTTCGG CAAATACTGT CTCGCATCGA ACACGTCATC TCTGCCGGAG
GCCGGAGCGG GGCTGGCCGG CCACCTCGAT CCGCTGGATT TCGCCGCCTG GCGCGAAGCG
GTGCTTGACC TGATCCGTGC GCCTGATCAA CTTGCACGGC ACGAGGCGGC GATCCGGCAG
AACTATCGCC CGGTGACCTG GGCGCAATCG GCGCATCGGC TGGCGGAGGT GTTGCGCAGC
CTGTCGGCCG CGTCCTACCC CAAGGTCTAG
 
Protein sequence
MTKVTDTPLA QPPDTQAPLS QPWLWMDVST SFRSRSGQMN GTLRVEQSMA TALSELMAPQ 
LRFCRYDPLR RDYVPLANAP DLGDKPVAAA PKQKRTTALS SIKPLGKKIE RAIRTSVRSA
AAPLLQKISG SNGLPLIGGA DGREVLLLAG ENWSRVNYAA VARMRRERGT RVAAVCQDFI
PVMAPQFFAD GDFVTMFDAY AQFLIRECDL IISISQSTSA DVMAYAQRHG GLRGAIELVH
LGADLAAPEA GRRPQALSDA QAKRFVISVS SIQSRKNFDL LYHLWRRLTE QGTPDLPTLV
LVGQPGFGSS DLLWQIAHDP VTASSIVHLP RAGDAELAWL YRHCAFTLYP SFYEGWGLPV
SESLAFGKYC LASNTSSLPE AGAGLAGHLD PLDFAAWREA VLDLIRAPDQ LARHEAAIRQ
NYRPVTWAQS AHRLAEVLRS LSAASYPKV