Gene RPD_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0465 
Symbol 
ID4020932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp534811 
End bp535989 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID637960651 
Productalginate o-acetyltransferase AlgJ 
Protein accessionYP_567604 
Protein GI91974945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0314343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.328045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGG CAACCACACG GGCGGGGCAA TGGCTGCGGC GGTTGGCGGC GGCGGCAGTC 
CTGCTGATGC CGGTGCTGAC GCTGTGGAAC ATCGCCGTCC CCTCGCGCGC CTTCAATATC
GGGCCGTCGC TGATCGGCGT CACCAAGCAG ACGCCGCTCG ACCTGTCGCT GCGCGCCTTT
CTCGACGGCA CCCTGCAGAA GACCGCGGCG ATCCGGATCG CCGAAGCGAT GCCGCTGCGG
CGGACGCTGA TCCGGCTCAA CAATCAGATC GCCTATTCGC TGTTCGGCGA GGTCAATGCG
CCGGGGATAC TGGCAGGCGT CGGCGGCCAG TTGGTGGAGC GCAGCTATCT CGAAGAGTAT
TGCACGCGCA GCGACGGCGA CGCCGACCGG CTGGCGGATA CGGTGGTCCC GCTGCTGCGC
AACCTTCAAG CCTACTACCG CGACCGCGGA GCGATTTTTC TCTATGTCGT GACGCCATCA
AAGGTCGCGC ATCTGCCACA ACACTTCGTG CATCTGATCT CCTGCCGCAG CACGGACGCT
GCCCGCGTCG AGCTTGTGCC GCGCTACGTT GCGCGGCTTC GCGCGGCTGG CATCGATGTG
GTCGATGCAG CGACCTCTAC CCATGCGCTG AAGGGCCAAT ATCCGGTCGA GTTGTTCCCA
AGGGGTGGGA TCCACTGGAA CGACCAGGCA ATGGCGCGGG CGTCGCAGCA AATCGTCGAA
GCGGTCAATC GGCAGGCCGG GCGTGAGCTG CTGCCGCAAT TCGACTTCAC CTCGACGGTG
AGCCTGCCAC CGGAAGGCCG CGATCGCGAT CTCGCCGAGC TCATTAACCT ACTCGTCTCG
CCGCTCGACT ACGCCACGCC GAAGTTGACC TTCTCGAATC CGCCGTGTGC GGGCCAACCT
GCGCGGACTA TTGACGCCGC GATTGTCGGC GGCAGTTTCA TGGACGCGGT CGGTGAGGTG
CTGACCGAGA GTGCGTGCAT GGCGCGGCTA AGTCAGTATT TCTACCTTAA ACTCGGTCGC
TATGGCGGCA CGCGCCGGCA ATTGATTCAG GAGAATCTGT CCGACTCCGA CCTGCAGCGG
CTGCGCGACG TCGACATCAT GCTGCTCGAA GAGAACGAAA GCGCGATCGG CCGCCAAGGA
TATCTGACGC TGCTGCACCA TATCGTGACC GACAAATAG
 
Protein sequence
MAAATTRAGQ WLRRLAAAAV LLMPVLTLWN IAVPSRAFNI GPSLIGVTKQ TPLDLSLRAF 
LDGTLQKTAA IRIAEAMPLR RTLIRLNNQI AYSLFGEVNA PGILAGVGGQ LVERSYLEEY
CTRSDGDADR LADTVVPLLR NLQAYYRDRG AIFLYVVTPS KVAHLPQHFV HLISCRSTDA
ARVELVPRYV ARLRAAGIDV VDAATSTHAL KGQYPVELFP RGGIHWNDQA MARASQQIVE
AVNRQAGREL LPQFDFTSTV SLPPEGRDRD LAELINLLVS PLDYATPKLT FSNPPCAGQP
ARTIDAAIVG GSFMDAVGEV LTESACMARL SQYFYLKLGR YGGTRRQLIQ ENLSDSDLQR
LRDVDIMLLE ENESAIGRQG YLTLLHHIVT DK