Gene RPD_0748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0748 
Symbol 
ID4021221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp839030 
End bp840040 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content55% 
IMG OID637960937 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_567887 
Protein GI91975228 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAAA ACAAGACACT CCTCATCACC GGCGGAACAG GGTCTTTTGG CAACGCCGTC 
CTTCACCGCT TCCTAAAGTC GGACTTCCAG GAAATCCGGA TCTTCAGCCG TGACGAGAAG
AAGCAGGAGG ACATGCGCAT CGCGCTGAAG GACGACCGCG TGAAATTCTA TATCGGGGAC
GTCCGCGATT ACGAAGCTGT CGATGACGCG ATCAACGGTG TCGACTACGT CTTCCACGCC
GCAGCCCTCA AGCAGGTTCC ATCCTGCGAG TTCTATCCGA TGGAAGCTAT CAGGACCAAT
GTGCTGGGCG CTGAGAACGT CATGCGGGCC GCCGTTAACC GCGGCGTCAG CAGGTGTGTT
GTGCTGAGCA CAGACAAGGC TGTCTATCCG ATCAACGCCA TGGGCATGTC AAAGGCGATG
ATGGAGAAGG TGATGGTAGC CAAATCCCGT CTCTGCCAGC CCGGACAGAC GATCCTCTGC
GCAACGCGTT ATGGCAATGT TATGGGGTCG CGCGGCTCGG TCATTCCTCT GTTCATTGAC
CAGCTGCAGC AGCGTAAGCC GCTGACGATC ACCGATCCCA GCATGACTCG CTTTCTCATG
TCACTGGAAG AGTCTGTCGA CCTGGTTCTT TACGCGTTCC AGAATGCTCG CGCCGGCGAC
ATATTCGTGC AGAAGGCCCC GGCCTCCACG GTCGGCGACC TCGCTTTCGC GCTACGTGAA
CTGCTCTCCC GAGACAATCC GATTAAGATC ATCGGCACCC GGCATGGCGA GAAGCTATAT
GAATCGCTGA TCTCGCGGGA AGAAATGCTT CGCGCCGAAG ATCTGGGTGA CTACTATCGC
ATTCCGGCCG ACAGTCGAGA CCTGAACTAC GACAAGTATT TCAGTGAAGG TGAGGTGCGC
ATTGAGACAA TTGACGACTA CACATCTCAC AATACGCATA GACTAGATAT TGAGGGCATC
AAGAAGACTC TGATGAAGCT CGACATCGTG AAGCGGGCTC TGAATGCTTA A
 
Protein sequence
MFENKTLLIT GGTGSFGNAV LHRFLKSDFQ EIRIFSRDEK KQEDMRIALK DDRVKFYIGD 
VRDYEAVDDA INGVDYVFHA AALKQVPSCE FYPMEAIRTN VLGAENVMRA AVNRGVSRCV
VLSTDKAVYP INAMGMSKAM MEKVMVAKSR LCQPGQTILC ATRYGNVMGS RGSVIPLFID
QLQQRKPLTI TDPSMTRFLM SLEESVDLVL YAFQNARAGD IFVQKAPAST VGDLAFALRE
LLSRDNPIKI IGTRHGEKLY ESLISREEML RAEDLGDYYR IPADSRDLNY DKYFSEGEVR
IETIDDYTSH NTHRLDIEGI KKTLMKLDIV KRALNA