Gene RPD_2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2544 
Symbol 
ID4023037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2844148 
End bp2845263 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID637962739 
Productglycosyl transferase, group 1 
Protein accessionYP_569675 
Protein GI91977016 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG CCTTCGCCAT CGTCACCTTG TTTTCCGCCG GCGGTCTGCA GCGAGACTGC 
ATGGCGATTG CCGCGCGGCT GGCCGCGCGC GGCCATGACG TGACGATTTT CACCGAACGT
CGGAAGGGCG AAATTCCGCC CGACCTCCAC GTCGAACTGC TGCCCAACCG GAAGCTCAGC
AATCACCGCC GCGACCTGAA ATTCGCTGAG GCCGTGCTGC AGCGATGCGA GGGCCAGTTC
GACCGCGTGG TCGGGTTCGG CAAGCTGCTT GGTCTGGACG TGCTGTATTG CGCCGATCCG
TGTCTCGCCG CCCGCCGGGT GGGCTGGCTG TCGAAATGGA GTTCGCGCCG TCGGATTCAA
TTGCTGCTCG AAGCCGACAG CTTCAAGCAG GGGCAGAACA CCATTTGCCT GCTGTTGAGC
GACAATCAGG TGCGCGAGTT TCGCAGCGCC TGGTCGACCG AACCCGACCG TATCGAAGTT
CTGCCGCCGA CCATCGATCT CGGCCGGCGG CATTCCGAGT TTCGAACCGA CGGCACGCGC
GAGCGGATCC GCGCCAGCCT GGGCGTCGCT CCGACGGATC AGCTCTGGCT TGCGATCGCG
AACCAGCCCA ACGTGAAGGG ACTCGATCGC ACGCTAACAG CGATGAAAGA GTTCGCAACG
GTCCGGCTGG CGATCGCCGG AATCAAACAG GGCAGCAAGC AGGCGACGCA GGTGCTCGGC
TGGGCGCGCA GCGTCGGCGT CGCGGATCGC GTTCAGTTTC TCGGTTTCCG CGCCGACGTT
CCCGAACTGA TGGCCGCTGC CGATCTGCTG GTTCATCCGG CGCGCTACGA CACCACCGGA
ACCGTCATCC TCGAAAGCCT GATCAACGGC CTGCCGGTGA TCACCACGGC CGAGTGCGGC
TACGCCCCCC ACGTCGCCAA GGCCGATGCG GGGTTTGTCG TTCCGAGCCC GTTCGCGCAG
GAGACGCTGA CCAGGGCATT GGCGGCAGCA TCGGACACCC AGCGCGACCA CTGGAGCCGG
AATGGCGTCG CCTACGGTGT TGCGGAAGAG CTTTACCACG GGCTCGACAG GGCCACGGAC
ATCATTGCTG ACCCGGAGTT TCTGTCCGTC CGCTGA
 
Protein sequence
MKIAFAIVTL FSAGGLQRDC MAIAARLAAR GHDVTIFTER RKGEIPPDLH VELLPNRKLS 
NHRRDLKFAE AVLQRCEGQF DRVVGFGKLL GLDVLYCADP CLAARRVGWL SKWSSRRRIQ
LLLEADSFKQ GQNTICLLLS DNQVREFRSA WSTEPDRIEV LPPTIDLGRR HSEFRTDGTR
ERIRASLGVA PTDQLWLAIA NQPNVKGLDR TLTAMKEFAT VRLAIAGIKQ GSKQATQVLG
WARSVGVADR VQFLGFRADV PELMAAADLL VHPARYDTTG TVILESLING LPVITTAECG
YAPHVAKADA GFVVPSPFAQ ETLTRALAAA SDTQRDHWSR NGVAYGVAEE LYHGLDRATD
IIADPEFLSV R