Gene RPB_0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0850 
Symbol 
ID3909108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp969528 
End bp972356 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content65% 
IMG OID637882743 
Productputative bifunctional glutamate synthase subunit beta/2-polyprenylphenol hydroxylase 
Protein accessionYP_484472 
Protein GI86747976 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases
[COG0543] 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC TCACCGTGCC GAACGACGTG CTGCGCTATC AGCAGGCGCG CAGCGCGCTG 
GAAAGCTCCA AGACCGAGCT GGAGGAGCTG CAAGCCAGCG AAGCGGTCGG CGTGTTCCAG
AAGCAGATCG CGCTGTTGCA GAAGCGGCTG CTCAACGACC CGGCGTCGCT GCGCAACATG
TTCATCGCCG ACGGCACCCA GGCGATCGTC TGGGAATTCC AGCAGGCGGA GCTCGGCGAA
GCGTTCACCA CGACGCTGTG GAATCTGCTC GCCCGCGGCG ATGATATGTC GATCATCCTG
CAGCGCTTCA TCTGGGCGCT GCCGCTGAAG TTCAAGCGCA AGTTCATCAA GGCGATCGAC
CTGCATCTGC GCGATCGCTA TCCGATGTTC GAAAACCTGT CGGAAGGCTG GCCGGGCGAG
GCGTTCATCC CGCCTTACAT CCGCCCGGCG GAGCAGCGCG CGGTCGACTT CGACCTCGTC
AACCAGGGCT ATCTCGGCTA CCAGTCGATC GGCTATTCGC TGCGGGAGTG CGAGCTGTTC
GTCTGGCTCG AAGTGATGCG CGACAAGCAG TGCGACGACA AACCGTGCGA GCTCGGCGTG
CTGATCCACG GCAAGAGCGA GCCGAAGGGC GGCTGCCCGG TGAAGATCCA CATCCCCGAG
ATGCTGGACC TGCTCGGCAA CGGCAAGCAT CGCGAAGCCT TGGAGCTGAT CGAGAGCTGC
AACCCGCTGC CCAACGTCAC CGGCCGCGTC TGCCCGCAGG AACTGCAGTG CCAGGGCGTC
TGCACCCATA CCAAGCGGCC GATCGAGATC GGCCAGCTCG AATGGTATCT GCCCGAACAC
GAGAAGCTGG TGAATCCGAA CGCCAATGAG CGCTTTGCCG GCCGCATCAG CCCGTGGGCC
GCGGCGGCAA AACCGCCGAT CGCGGTGGTC GGCTCCGGCC CGTCCGGCCT GATCAACGCG
TACTTGCTGG CGGTCGAAGG CTTCCCGGTC ACGATCTTCG AGGCGTTCCA CGACCTCGGC
GGCGTGCTGC GCTACGGCAT CCCGGAATTC AGATTGCCGA ACCAGCTGAT CGACGACGTC
GTCGAGAAGA TCACGCTGCT CGGCGGCCGC TTCGTGAAGA ACTTCGTGGT CGGCAAGACG
GCCACCTTGG AAGACCTCAA GTCCGAAGGC TTCTGGAAGA TTTTTGTCGG CACCGGCGCG
GGGCTTCCCA CGTTCATGAA CGTGCCCGGC GAGCATCTGC TCGGCGTGAT GTCGGCCAAC
GAGTTCCTGA CCCGCGTCAA CCTGATGCGC GGCCTCGACG ACCGCTACGA GACGCCGTTG
CCCGAGACCA AAGGCAAGAA CGTGTTCGTG ATCGGCGGCG GCAACACCGC GATGGACGCC
GCGCGCACCG CGAAACGTTT GGGCGGCAAC GTCACCATCG TGTATCGCCG CACCAAGAGC
GAGATGCCGG CGCGCGTCGA GGAGCTGCAT CACGCGCTCG AAGAAGGCAT CAATCTCGCG
GTGCTGCGCG CGCCGCGCGA ATTCATCGGC GACGACCACA CCCATTTCGT CACCCACGCG
CTGCTCGACG TCAACGAGCT CGGCGAACCG GACAAATCCG GCCGCCGCAG CCCGAAGCCA
ACTGGCGAGA TCGAACGCGT GCCGGTCGAT CTGGTGATCA TGGCGCTCGG CAACACCGCC
AACCCGATCA TGCGCGACGC CGAGCCCGGG CTTAAGACCA ACAAATGGGG CACGATCGAG
GTCGAAGCCG GCTCGCAGCG CACCTCGATC CAGGACGTGT ACTCCGGCGG CGACGCCGCA
CGCGGCGGCT CCACGGCGAT CCGCGCGGCC GGCGACGGCC AGGCGGCGGC CAAGGAAATC
GTCGGCGAGA TCCCGTTCAC CGCCGCCGAG ATCAAGGACC GTGTGGAGCG CGCGGCGCGC
TACACCGAGC TCGGCCAGAT CGAACAGACC ATCGTCGACA AGGTGACGCT GGCCGGCGGC
ATCGTCGAAT TCACCGTGCG CGCCCCGATG GTGGCGCGCT CAGCGCAGGC CGGGCAGTTC
GTCCGCGTGC TGCCGTGGGA GAAGGGCGAA CTGATCCCGC TGACGCTGGC CGATTGGGAC
GCCGAGAAAG GCACCATCGA CCTCGTGGTG CAGGGCATGG GCACCTCGTC GCTGGAGATC
AACCGGATGG CGATCGGCGA TGCGTTCAGT GGCGTCGCCG GCCCGCTCGG CCGCGCCTCG
GAGCTGCATC GCTACGACGG CAACCAGACC GTGGTGTTCT GCGCCGGCGG CGTCGGCCTG
CCGCCGGTGT ATCCGATCAT GCGCGAGCAC CTGCGGCTCG GCAATCACGT CACGCTGATC
TCCGGCTTCC GCGCCAAGGA GTTCCTGTTC TGGACCGGCG ACGACGAGCG CGTCGGCAAG
CTGAAGCAGG AATTCGGCAA TCAACTGTCG CTGATCTACA CCACCAACGA CGGCAGCTAC
GGCGTCAAGG GCTTCGTTAC CGGGCCGCTG GAAGAGATGA TCAAGGCCAA CCAGCAGGGC
TTCGGCCGCA GCATCGCCGA AGTGATCGCG ATCGGCCCGC CGCTGATGAT GCGGGCGGTG
AGCGATCTCA CCAAGCCCTA CGGCGTTAAG ACCGTGGCGA GCCTCAACTC GATCATGGTG
GATGCGACGG GAATGTGCGG CGCCTGCATG GTGCCGGTGA CCATCGACGG CAAGATGGTG
CGCAAGCACG CCTGCATCGA CGGCCCGGAA ATCGACGCCC ACATCATCGA CTGGGACAAG
TTCCTGCCGC GCTTCAACGC CTTCAAGGCG CAGGAATTGG AGAGCAAGGC CAAGCACGGG
TTCGCGTAG
 
Protein sequence
MSSLTVPNDV LRYQQARSAL ESSKTELEEL QASEAVGVFQ KQIALLQKRL LNDPASLRNM 
FIADGTQAIV WEFQQAELGE AFTTTLWNLL ARGDDMSIIL QRFIWALPLK FKRKFIKAID
LHLRDRYPMF ENLSEGWPGE AFIPPYIRPA EQRAVDFDLV NQGYLGYQSI GYSLRECELF
VWLEVMRDKQ CDDKPCELGV LIHGKSEPKG GCPVKIHIPE MLDLLGNGKH REALELIESC
NPLPNVTGRV CPQELQCQGV CTHTKRPIEI GQLEWYLPEH EKLVNPNANE RFAGRISPWA
AAAKPPIAVV GSGPSGLINA YLLAVEGFPV TIFEAFHDLG GVLRYGIPEF RLPNQLIDDV
VEKITLLGGR FVKNFVVGKT ATLEDLKSEG FWKIFVGTGA GLPTFMNVPG EHLLGVMSAN
EFLTRVNLMR GLDDRYETPL PETKGKNVFV IGGGNTAMDA ARTAKRLGGN VTIVYRRTKS
EMPARVEELH HALEEGINLA VLRAPREFIG DDHTHFVTHA LLDVNELGEP DKSGRRSPKP
TGEIERVPVD LVIMALGNTA NPIMRDAEPG LKTNKWGTIE VEAGSQRTSI QDVYSGGDAA
RGGSTAIRAA GDGQAAAKEI VGEIPFTAAE IKDRVERAAR YTELGQIEQT IVDKVTLAGG
IVEFTVRAPM VARSAQAGQF VRVLPWEKGE LIPLTLADWD AEKGTIDLVV QGMGTSSLEI
NRMAIGDAFS GVAGPLGRAS ELHRYDGNQT VVFCAGGVGL PPVYPIMREH LRLGNHVTLI
SGFRAKEFLF WTGDDERVGK LKQEFGNQLS LIYTTNDGSY GVKGFVTGPL EEMIKANQQG
FGRSIAEVIA IGPPLMMRAV SDLTKPYGVK TVASLNSIMV DATGMCGACM VPVTIDGKMV
RKHACIDGPE IDAHIIDWDK FLPRFNAFKA QELESKAKHG FA