Gene RPB_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2107 
Symbol 
ID3908521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2395757 
End bp2397865 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content68% 
IMG OID637884000 
ProductCoA-binding 
Protein accessionYP_485724 
Protein GI86749228 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0186682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC TCCGCGCCGC CCTCGATCCG AAATCCGTGG CCATCATCGG CGCCTCGGAC 
AATCCCAACA AGGTCGGCGG ACGGCCGGTG CATTTTCTCG GCAAGTTCGG CTTTGCGGGC
AAGATCTATC CGATCAATCC CAGCCGGCCC GAAATCCAGG GCCACAAGAG CTACGCGTCG
CTGCAGGATC TGCCGGAAGC GCCCGAGATG GTGATCGTCG CGGTCGCCGG TGACAACGCC
ATCGCCGCGG TCGAAGATTG CGCGGCGTTC GGTGTCAAGA TCGCGGTGGT GATGGCGTCG
GGCTTCGGTG AGGTCGATGC GGGCGAAGGC AAGGCGAAAG AGCGCCGCAT GGTCGCTGCC
GCGCACAAGG CCGGGATGCG CATCGTCGGT CCGAACTCGC AGGGCCTCGC CAATTTCGGC
ACCGGCGCGA TCGCGTCGTT CTCGACGATG TTCATGGACA TGGACCGCGC CGCGGAGGAC
GCCAAGGGCC ACGTCGCGAT GCTGAGCCAG TCCGGCGCGC TGTCGACGGT GCCGGTCGGC
TTCCTGCGCC AGAAGGGCAT CGGCGTCCGC CACACCCACG CCACCGGCAA CGATTCCGAC
ATCACCGTCG GCGAACTGGC CTGCGCGGTC GCCGAGGACT CCGAAGTCAA GCTGATGCTG
CTGTACCTCG AAAGCATCCC GGACAAGCGC TATCTCGAAG AGCTGGCGTC GATCGCACTC
GATCGCGACC TGCCGATCAT CGCGCTGAAG TCGGGCCGCA CCGACGCCGG CAAGGCGGCG
GCGCAATCGC ACACCGGCGC GCTCGCCAAC GAGGACCGCG TGGTCGACGC GTTCTTCGAA
CATCACGGCA TCTGGCGCGC GCCGGACATG CGCGGCCTCG TCGAGGCGAC CGAGCTGTAT
CTCAAGGACT GGAAGCCGCA GGGGCGGCGG CTGGTGGCGA TCAGCAATTC CGGCGCCGTC
TGCGTGCTCA CCGCGGATGC CGCAACCTCC GTCGGCATGC CGATGGCGAA GCTGGCGCCG
CAGACCGACG CGAAGCTGAA AGGCATCCTG CCGAGCTTCG CCACCACCAC CAATCCGATC
GATCTCACCG CGGCGCTGCT GTCGAACAGC GCGCTGTTCG GCGACATTCT GCCGGTGATC
GCCGAGGACC CCGCCGCCGA TGCGTTCCTG ATCGGCGTGC CGGTGGCGGG GCCGGGCTAC
GACGTCGAAG CGTTCTCGCG CGATGCGGCG GCGTTCGGCA AACAGACCGG CAAGCCGCTG
GTGGTGGCGG CGACGCAGCC GAGCGTGGCG CAGGCCTTCG CGGCGAACGG CACCTCGGTG
TTTCCGACCG AAGTCGAGGC GGTCACCGCG CTGCATCAGT TCCTCGCGCA TCGCGAACTG
ATGGCCAAGA CACGCGCGCG CCGCACGGCG CTGGCGCCGG GCGAGGCGCT GATCGCATCG
CCCGCCGCGG AGACCACGAT GCTGAACGAG GCGGACAGTC TCGGTCTGCT GGCCGCGCGC
GGCATTCCGG TGGTGCCGCA TCGGTTGTGC CTGTCGCGCA ACGAGGCGAT CGCGGCGTTC
AACATCATCG GCGGTCCGGT GGTGGTGAAG GGCTGTTCGG CCGATATCGC CCACAAATCC
GAACTCGGCC TGGTGCGGCT CGGCGTCAAT TCGGCCGATA CCACCGGCGA CATCTTCACC
GAGATGGAGC AGATCATCGC GAAAAACGGC TCGCGCTTCG ACGGCGTCAT CGTCGCTTCG
ATGGCCGGCG GCCGCCGCGA GATGATGATC GGCGCGCATC GCGATCCGGT GTTCGGGCCG
GTGGTGGTGG TCGGCGACGG TGGCAAATAT GTCGAGATCG TCAAGGACAC GAGATTGCTG
CTGCCGCCGT TCACGGCGCA GGATGTGCGC GACGCATTGC AGTCGCTGCG GATCGCGCCG
CTGTTCGCAG GCGTCCGCGG CGAGCCGCCG ATGGATCTCG ACGCGCTGGT CGACGCAGTG
GTGAAGGTCG GCGCATTGAT GCGCGATCCC GCCGCGCGCG TCGCCAGCCT CGACCTCAAT
CCGGTGATGC TCGGCAGCGA AGGTCAGGGC TGCGTCGTGG TCGACGCCGT GGTGTTCCAC
GGCGTCTGA
 
Protein sequence
MSRLRAALDP KSVAIIGASD NPNKVGGRPV HFLGKFGFAG KIYPINPSRP EIQGHKSYAS 
LQDLPEAPEM VIVAVAGDNA IAAVEDCAAF GVKIAVVMAS GFGEVDAGEG KAKERRMVAA
AHKAGMRIVG PNSQGLANFG TGAIASFSTM FMDMDRAAED AKGHVAMLSQ SGALSTVPVG
FLRQKGIGVR HTHATGNDSD ITVGELACAV AEDSEVKLML LYLESIPDKR YLEELASIAL
DRDLPIIALK SGRTDAGKAA AQSHTGALAN EDRVVDAFFE HHGIWRAPDM RGLVEATELY
LKDWKPQGRR LVAISNSGAV CVLTADAATS VGMPMAKLAP QTDAKLKGIL PSFATTTNPI
DLTAALLSNS ALFGDILPVI AEDPAADAFL IGVPVAGPGY DVEAFSRDAA AFGKQTGKPL
VVAATQPSVA QAFAANGTSV FPTEVEAVTA LHQFLAHREL MAKTRARRTA LAPGEALIAS
PAAETTMLNE ADSLGLLAAR GIPVVPHRLC LSRNEAIAAF NIIGGPVVVK GCSADIAHKS
ELGLVRLGVN SADTTGDIFT EMEQIIAKNG SRFDGVIVAS MAGGRREMMI GAHRDPVFGP
VVVVGDGGKY VEIVKDTRLL LPPFTAQDVR DALQSLRIAP LFAGVRGEPP MDLDALVDAV
VKVGALMRDP AARVASLDLN PVMLGSEGQG CVVVDAVVFH GV