Gene RPD_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1031 
Symbol 
ID4021507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1180230 
End bp1182329 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content69% 
IMG OID637961223 
ProductCoA-binding 
Protein accessionYP_568170 
Protein GI91975511 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.143696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAGGGG TTGGCGTGTC GGCGATGACT GATTTCGATA TCCATGCCCT GATCGAGCCG 
AAATCGATCG CAATGGTCGG CGTGTCTCCC GGTGCGCCGA ATTCATGGGG GTTCCGCACC
ATGCGGGTGC TGGCCGAGGG CGGCTATACT GGCGCGCTTT ACGCCGTGCA TCCGACCAAA
ACCGTTCCCG GTTTCGAGAC CGTTCGCTCG CTTCGCGACA AGCCGGCGCC CGATCTGGTC
GCGGTCTGCG TGCGCGCCGA GCAGGCCGTC GATGTGGTCC GCGAGGCGCG GGAGATCGGC
GCCAAGGCCG CGATCGTTTT CGCGTCGAAT TTTGCCGAAA TCGGCGAGGA TGGCGTCCGC
CTGCAGCGTG AGCTGATCGA GGCCGCCGGC GATATGCCGT TTCTCGGGCC GAATTGTCTC
GGCTTTTCCA ATCGCACCGC GTCGGTGAAG ATGTCGGTGG CGCCGTTCCT CAATCGGCCG
TTGCTGCCGC CCGGTCCCGT TGCGCTGGTG GCGCAATCCG GCGCGCTCGG CCTCGTCCTT
GCGCAATGCG TGGAGGAGAG CGGCGTCGGC TACTCGCACT TCATCAGCGT CGGCAATGAA
TGCGTGGTGA CGGCGTCGAT GCTGGCGCGG CAGCTCGTCG AGCGCGACGA TGTCGGGATC
GTTTTCATCT ATCTCGAGAC GCTGCGCGAC CCGCAGGTGC TGGCGGAAGC CGCCGCGCGC
GCCCACGCGC TCGGCAAGCG GATCATCGTG TTGAAGGCCG GCGCCTCCGA CGCGGGCCGG
CGCGCGGCGC TGTCGCACAC CGCGGCGATT GCGGGCAACG ACACGCTGTT CGGGGCGCTT
GCGCGCGACC TGGGCATCGT CAGCATTCGC GACGACGAAG GCGTTCAGCC CGTGCTGGCC
GCGCTCCGGC GCGACTGGGT CATGCCGCCG AAGCCGCGGG TGGCGATCCT CAGCAATTCG
GGCGGCGCGG GCGCGCTGCT GGCCGACCGG CTGGTGGCGG AAGGTGCGCG CGTCGAGGCG
TTTTCCGAGC CGTTGCGGCA GGCGATCCGC CAGACCGGGC TGGTCGAGGC CGGCGATCAA
AATCCGCTCG ACATCGGCGG TGGTTGGGAA GCGTTGCTGG ATCGCGTCGA GCCGTGTCTC
GAAGTTCTCG ACCACGCCGA AGAGGTCGAC GCCGTCGTCG TCTACTATGC GTTCGGCGAC
ATCATCGGCG CGAAGGTCGC GCCGATCGCC GACTATTGCG CGGCGATGTC GAAGCCGGCG
GTGTTCGTCT GGCAGGCTGC GCCGTCGGAA TTCTATGCGA GCGTTACGGC GCGCGACGTT
CTCACCGCGA CGATCGGCGG CGGGGTGCGC GCGGTCGTCG CGCAGATGGC TCTCGCCTCC
GCCGGCGAGG TCGCGTGGCA GCGCCGCGAT GTCGCCGCGG TGGCGTTGCC GGTGGTCCAA
GCGGGCCAGT CCACCATCGC CGAACTCGAC GCGGGCGCCG TCCTGCGAAA GCTTGGGATC
GGCGTCGTCG ATGCGGTCGT GTCCGCGCGC GGACAGGCTG CCGCGGCAAT CGCCGAAGTC
GGCGCCAAGG GATGGACGCG CTGCGTCGTC AAGGGCAACG CCGCCGACGT CCTGCATCGC
AACCGTGTCG GCCTGGTCGA AGTCGGCGTG CCGGTCGAAC GGCTCGCCGA AGTCCTGGAG
CGTTTCGAGC GGCGGCTGGA CGAGGTGTCG TCCGATCCGC AGCGCAGTCT GCTGATTCAG
CCGATGATCG CCTTCGAGGA CGAAATCGGC GTCGGCGCTC TGCTCGATCC GAATTACGGC
CCCGCGATCC TGATCGGGCC GGGCGGCGTC GGCATCGAGG CGGCTTCGGG CGAGCGGCAC
GTCCTGCTGC TGAGCGCGTC CGATGAAGCG CGCGCGGCTT ACCAGAGCCG TGTCGAAGAC
GCTTACGGCC TCGCGCCCGG AACGCTCGAG CCGGTCGTCG CCGGACTCGA GCGGCTGCTC
GCGACGCCGA CTATTTCCGA GATCGATATC AATCCGATGG TGCGAACGCC CGACGGCGGT
CTCATCGCGC TGGATGCTCT CATCGTCGTC GAACCGCATC ACCCGACGGC CGCCGCCTGA
 
Protein sequence
MRGVGVSAMT DFDIHALIEP KSIAMVGVSP GAPNSWGFRT MRVLAEGGYT GALYAVHPTK 
TVPGFETVRS LRDKPAPDLV AVCVRAEQAV DVVREAREIG AKAAIVFASN FAEIGEDGVR
LQRELIEAAG DMPFLGPNCL GFSNRTASVK MSVAPFLNRP LLPPGPVALV AQSGALGLVL
AQCVEESGVG YSHFISVGNE CVVTASMLAR QLVERDDVGI VFIYLETLRD PQVLAEAAAR
AHALGKRIIV LKAGASDAGR RAALSHTAAI AGNDTLFGAL ARDLGIVSIR DDEGVQPVLA
ALRRDWVMPP KPRVAILSNS GGAGALLADR LVAEGARVEA FSEPLRQAIR QTGLVEAGDQ
NPLDIGGGWE ALLDRVEPCL EVLDHAEEVD AVVVYYAFGD IIGAKVAPIA DYCAAMSKPA
VFVWQAAPSE FYASVTARDV LTATIGGGVR AVVAQMALAS AGEVAWQRRD VAAVALPVVQ
AGQSTIAELD AGAVLRKLGI GVVDAVVSAR GQAAAAIAEV GAKGWTRCVV KGNAADVLHR
NRVGLVEVGV PVERLAEVLE RFERRLDEVS SDPQRSLLIQ PMIAFEDEIG VGALLDPNYG
PAILIGPGGV GIEAASGERH VLLLSASDEA RAAYQSRVED AYGLAPGTLE PVVAGLERLL
ATPTISEIDI NPMVRTPDGG LIALDALIVV EPHHPTAAA