Gene RPD_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1631 
Symbol 
ID4022111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1828624 
End bp1829682 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID637961826 
Productglycosyl transferase, group 1 
Protein accessionYP_568769 
Protein GI91976110 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.54694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC TGATCGCCAC TGACGCCTGG CATCCGCAGG TCAACGGCGT GGTGCGGACG 
CTGACCATGA TGGCCGAGGC GGCGAAGTCG CTCGGCGCCG AAGTCACGTT CCTGACGCCT
GAAACCTTTT CGACCGTGCG ACTGCCGAGT TACCCGGATC TGCGGATCGC GATCCCGAAT
CCGGCCAAGG TCGCGCGGAT GATCATCGCG GCGCAGCCCG ACTGTATCCA CATCGCGACC
GAAGGGCCGA TTGGGCTGGC CGCGCGGCGC TACTGCCGCA AGCGCGGCCT GCGCTTCACC
ACCAGTTTTC ACACTCGCTT CCCGGAATAC GTCTCCGCAC GCATGCCGAT CCCGGAATCC
TGGGTGTGGG CCTTGCTTCG CCGGTTTCAC GGCGCCAGCC ACGCGGTGAT GGCGGCGACG
CCGGCGCTGG CCGATGAGCT GCGCGGACGG GGCTTCCGCA ATGTGGTGCT GTGGCCGCGC
GGGGTCGACG GCGAGCTGTT TCATCCCCGC GCGGGCGCCG ATCTCGGCCT GCCGCGGCCG
GTGTTCCTGT CGGTCGGACG CGTCGCGGTC GAGAAGAACC TCGAAGCGTT CCTCGGGCTC
GATCTGCCCG GCACCAAGGT CGTGGTCGGG GACGGGCCGG CGCGGGCGGC GCTGCAGCGC
GACTTCCCGC AGGCGGTGTT CCTCGGCGCC AAGCAGGGCG AGGCGCTGGC GCAGGTCTAT
GCTGCAGCGG ATGTGTTCGT GTTTCCGAGC CTGACCGACA CTTACGGGCT GGTGCTGCTC
GAAGCGCTGG CGAGCGGCGT CCCGGTCGCC GCGTTCCCGG TGACCGGCCC GCGCGACGTG
ATTGGCGATG CGCCGGTCGG CGTCCTCAGC GACGACCTGC GACAGGCCTG CCTCGGGGCG
CTCGGGATCT CGCGCGACGC CTGCCTCGGC TTTGCCGCGG ACCACACCTG GACCGCGTCG
GCGCGCGCTT TCATCGACAA TGTCACCCGG GTCTGGATGA TGGACCCCGG TCAAGTTCTC
GCCGCGGATT CCGCAAAACC GCGGCGTCTG GTCGCCTGA
 
Protein sequence
MRILIATDAW HPQVNGVVRT LTMMAEAAKS LGAEVTFLTP ETFSTVRLPS YPDLRIAIPN 
PAKVARMIIA AQPDCIHIAT EGPIGLAARR YCRKRGLRFT TSFHTRFPEY VSARMPIPES
WVWALLRRFH GASHAVMAAT PALADELRGR GFRNVVLWPR GVDGELFHPR AGADLGLPRP
VFLSVGRVAV EKNLEAFLGL DLPGTKVVVG DGPARAALQR DFPQAVFLGA KQGEALAQVY
AAADVFVFPS LTDTYGLVLL EALASGVPVA AFPVTGPRDV IGDAPVGVLS DDLRQACLGA
LGISRDACLG FAADHTWTAS ARAFIDNVTR VWMMDPGQVL AADSAKPRRL VA