Gene RPD_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2553 
Symbol 
ID4023047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2858300 
End bp2860042 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content67% 
IMG OID637962749 
Productglycosyl transferase family protein 
Protein accessionYP_569684 
Protein GI91977025 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.182713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CCTCTCCAAC GCCCCGCTTC GGTGGCCCCC GCGAGCCCAG AAATGCCATC 
GACCCCGGCC GCGGGCTGGT GGCGGTGCTG GATTTTGCGT CCGTCAGCCA TCTGCGCGCA
GTGGCGTTCT TGGTACTGGT CGGGCTGCTG TTTTTTCTGC CGGGTTTCTT CAACATCCCG
CCGATCGACC GCGACGAGGC CCGCTTCGCC CAGGCCACCA AGCAGATGGT CGAGAGCGAT
GATTTCATCG ATATCCGGTT CCAGGACGAG GTCCGCTACA AGAAGCCGGT CGGGATCTAC
TGGTTGCAAG CCGCGGTCGT CGAAACCGCG TCGCGGCTCG GCCTGCCGCG CGCCGAAGTC
CGGATCTGGC TCTATCGCGT GCCGTCGCTG GCCGGTGCGA TCGGCGCGGT GCTGATGACC
TATTGGGCGG CGCTCGCCTT TGTCGGGCGG CGCGGGGCGG TGATCGCCGG TCTTCTGTTG
TGCAGCTCGA TCTTGCTGGG TGTCGAGGCG CGACTGGCCA AGACCGACGC GTTCTTGCTG
TTCACGGTGA CCGCCGCGAT GGGCGCGATG GCGCATGTCT ATCTCGCCTG GCAGCGCGGC
GACGACCGCT ACCACTCCTG GATCACGCCG GCCATTTTCT GGACCGCGGT CGCCGCGGGC
ATTCTTCTCA AGGGCCCGCT GATCCTGATG TTCATCGCGC TGACGGTGGC CGCGCTGGCG
TTTGTCGATC GCTCGGCGGT GTGGCTGTGG CGTCTGAAGC CGCTGGCCGG CGTGCTGTGG
ATGCTGGTGC TGGTGCTGCC GTGGTTCATC GCGATCTTCC TGCGCGCCGG CGACACCTTC
TTCGCCGACT CGGTCGGCGG CGACATGTTG AGCAAGATCG CCAGTCCCAA GGAATCCCAC
GGCGCGCCGC CGGGCCTGTA TTTCCTGCTG TTCTGGGTGA CGTTCTGGCC GGGCGCGCCG
TTGGCCGCGA TGGCTGCGCC TGCGGTGTGG CGGGCACGGC GCGAGCCGGG CGCGCAATAT
TTGCTGGCCT GGGTGATCCC GTCCTGGATC GTGTTCGAGC TGGTGATCAC CAAGCTGCCG
CACTATGTGC TGCCGCTGTA TCCGGCGATC GCGATCATGA CCGCTGGCGC GATCGAGCAC
AGCGTGCTGT CGCGCTCCTG GCTGACCCGC GGCGCGGCGT GGTGGTTCGC GATTCCAGTC
GTCGTGCTGT CGCTCGCGAT CATCGGCGCC ATCATCCTGA CCCGGCAGCC GGCGTTCCTG
GCGTGGCCGT TCGTCGCGGC CTCGCTGATT TTCGGGCTGT TCGCGTGGCG GCTGTTCGAC
CAGAACCGCG CCGAAGCCTC GCTGCTCAAC GCCTCGCTGG CGTCGCTGTT TCTGATGGTC
GCCGCGCTCG GCGTCGTGGT GCCGACGCTG CGGCCGGTGT TCCCGAGCGT CGAGATCGCG
CAGGCGCTGC GCAAGGTGGT GTGCGTCGGG CCTAAGGCCG CGGCCGTGGG CTTCCACGAG
CCGAGCCTGG TGTTCATGAC CGGCACCGAT ACGTTGCTGA CCGACGGCTC CGGTGCCGCC
GACTTCCTGC TCGGCGGAAG CTGCCGCTTC GCGCTGGTGG AAGCTCGCAG CGAGCGCGCA
TTCGCGGCGC GGGCCGAGGC GATCGGGTTG CACTACAACG TGGCGACCCG GATCGACGGC
TACAATTTCT CGCAGGGCAA GCCGGTGTCG ATCGCGATCT TCCGTTCCGA AGGCACGCAG
TAA
 
Protein sequence
MTETSPTPRF GGPREPRNAI DPGRGLVAVL DFASVSHLRA VAFLVLVGLL FFLPGFFNIP 
PIDRDEARFA QATKQMVESD DFIDIRFQDE VRYKKPVGIY WLQAAVVETA SRLGLPRAEV
RIWLYRVPSL AGAIGAVLMT YWAALAFVGR RGAVIAGLLL CSSILLGVEA RLAKTDAFLL
FTVTAAMGAM AHVYLAWQRG DDRYHSWITP AIFWTAVAAG ILLKGPLILM FIALTVAALA
FVDRSAVWLW RLKPLAGVLW MLVLVLPWFI AIFLRAGDTF FADSVGGDML SKIASPKESH
GAPPGLYFLL FWVTFWPGAP LAAMAAPAVW RARREPGAQY LLAWVIPSWI VFELVITKLP
HYVLPLYPAI AIMTAGAIEH SVLSRSWLTR GAAWWFAIPV VVLSLAIIGA IILTRQPAFL
AWPFVAASLI FGLFAWRLFD QNRAEASLLN ASLASLFLMV AALGVVVPTL RPVFPSVEIA
QALRKVVCVG PKAAAVGFHE PSLVFMTGTD TLLTDGSGAA DFLLGGSCRF ALVEARSERA
FAARAEAIGL HYNVATRIDG YNFSQGKPVS IAIFRSEGTQ