Gene Rpal_5203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5203 
Symbol 
ID6412903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5608539 
End bp5611373 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content65% 
IMG OID642715093 
Productputative bifunctional glutamate synthase subunit beta/2-polyprenylphenol hydroxylase 
Protein accessionYP_001994166 
Protein GI192293561 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases
[COG0543] 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG TCACCGGAAC GGTCTCCAGC GACGTGCTGC GCTACCAGCA GGCGCGCAGC 
ACGCTGGAAA GCTCCAAGAG CGAACTGGAA GAGCTACAGA ACAGCGAAGC TGTCGGCGTC
TTCCAGAAGC AGCTCGCGCT GCTGCAGAAG CGCCTGGTCA ACGATCCGGC GTCGCTGCGG
AACATGTTCA TCGCGGACGG CACCCAGGCG ATCGTCTGGG AATTCCAACA GCCGGAGCTA
GGTGAAGCCT TCACCACCAC GCTGTGGAAT CTGCTCGCCC GCGGCGACGA CATGTCGATC
ATCCTGCAGC GCTTCATCTG GGCGCTGCCG CTGAAGTTCA AGCGCAAGTT CATCAAGGCG
ATCGACGTGC ATCTGCGCGG TCGCTACCCG ATGTTCGAGA ACCTGTCGGA AGGCTGGCCG
GGCGAAGCGT TCATCCCGCC CTACATCCGT CCGGCGGAAG AGCGCGCGAT CGACTTCGAA
CTCGTGAACC AGGGCTATCT CGGCTACCAG TCGATCGGCT ACTCGGTGCG CGAGTGTGAG
CTGTTCGTCT GGCTCGAGGT GATGCGCGAC AAGCAGTGCG ACGACAAGCC GTGCGAGCTC
GGCGTGCTGA TCCAGGGCAA GAAGGAAGCC AAGGGCGGCT GCCCGGTGAA GATCCACATC
CCCGAGATGC TGGACCTGCT CGGCAACGGC AAGCATCGCG AAGCCTTGGA GCTGATCGAG
AGCTGCAACC CGCTGCCGAA CGTCACCGGC CGCGTCTGCC CGCAGGAACT GCAGTGCCAG
GGCGTCTGCA CCCATACCAA GCGGCCGATC GAGATCGGCC AGCTCGAATG GTACCTGCCC
GAGCATCAGA AGCTCACCAA CCCGAACGCC AATGAGCGCT TCGCCGGCCG GATCTCGCCG
TGGGCTGCGG CCGTGAAGCC GCCGATCGCG GTGGTCGGCT CCGGCCCGTC GGGTCTGATC
AACGCGTACT TGCTGGCGGT CGAGGGCTTC CCGGTCACGA TCTTCGAGGC GTTCCACGAC
CTCGGCGGCG TGCTGCGCTA CGGCATTCCT GAATTCCGTC TGCCCAACAC CCTGATCGAC
GACGTCGTCG AGAAGATCAT CCTGCTCGGC GGCCGCTTCG TGAAGAACTT CGTGGTCGGC
AAGACCGCGA CGCTCGAAGA CCTCAAGGCC GAAGGCTTCT GGAAGATCTT CGTCGGCACC
GGCGCGGGTC TTCCCACCTT CATGAACGTG CCCGGCGAGC ATCTGCTCGG GGTGATGTCG
GCCAACGAGT TCCTGACCCG CGTCAACCTG ATGCGTGGTC TGGACGATCG CTACGAGACG
CCGCTGCCCG AGGTGAAGGA CAAGAACGTG TTCGTGATCG GCGGCGGCAA CACCGCGATG
GACGCGGCGC GCACCGCCAA GCGTCTCGGC GGCAACGTCA CCATCGTGTA TCGCCGCACC
AAGAGCGAGA TGCCGGCCCG CGTCGAGGAG CTGCATCACG CGCTGGAAGA GAACATCAAT
CTCGCGGTGC TGCGCGCGCC GCGCGAATTC ATCGGCGACG ACCACACCCA TTTCGTCACC
CACGCGCTGC TCGACGTCAA CGAGCTCGGC GAACCGGACA AGTCCGGCCG CCGCAGCCCG
AAGCCGACCG GGCAGATCGA GCGGGTGCCG GTCGACCTCG TGATCATGGC GCTGGGCAAC
ACCGCCAACC CGATCATGAA GGACGCCGAG CCCGGCCTGA AGACGAATAA ATGGGGCACG
ATCGAGGTCG AGGAAGGCTC GCAGCGCACC TCGATCAAGG ACGTCTACAC CGGCGGCGAC
GCCGCGCGCG GCGGCTCGAC CGCGATCCGT GCGGCCGGCG ACGGCCAGGC GGCGGCGCGC
GAGATCGTCG GCGAGATCCC GTTCACGCCG GCCGAGATCA AAGACCGCGT CGAACGCGCC
GCCAAGTACA CCGAGCTCGG CCAGATCGAA CAGACGATCG TGGGCAAGGT GCCACTCGCC
GGCGGCATCG TCGAGTTCAC CGTGCGGGCC CCGATGGTGG CGCGCTCGGC GCAGGCCGGC
CAGTTCGTGC GCGTGCTCCC CTGGGAGAAG GGCGAACTGA TCCCGCTGAC GCTGGCCGAT
TGGGACGCCG AGAAGGGCAC CATCGATCTG GTGGTGCAGG GCATGGGCAC CTCGTCGCTG
GAGATCAACC GGATGGCGAT CGGCGATGCG TTCTCTGGCA TCGCCGGCCC GCTCGGCCGC
GCCAGCGAGC TGCACCGCTA CGAGGGCAAC CAGACCGTGG TGTTCTGCGC CGGCGGCGTC
GGCCTGCCGC CGGTGTATCC GATCATGCGC GAGCACCTGC GGCTCGGAAA CCACGTCACG
CTGATCTCCG GTTTCCGCGC CAAGGAGTTC CTGTTCTGGA CCGGCGACGA CGAGCGCGTC
GGCAAGCTCA AGAAGGAGTT CGGCGACCAG CTGTCGCTGA TCTACACCAC CAATGACGGC
TCCTACGGCG TCAAAGGCTT CGTCACCGGC CCGCTCGAGG AGATGATGAA GGCCAACCAG
GAGGGTAAGG GCCGCAGCAT CGCCGAAGTG ATCGCGATCG GCCCGCCGCT GATGATGCGA
GCGGTCTCCG ACCTCACCAA GCCGTATGGC GTCAAGACCG TGGCGAGCCT CAACTCGATC
ATGGTGGATG CGACAGGGAT GTGCGGCGCC TGCATGGTGC CGGTGACGAT CGACGGCAAG
ATGGTGCGCA AGCACGCCTG CATCGACGGC CCGGAAATCG ACGCCCACAT CATCGACTGG
GACAAGTTCC TGCCCCGCTT CAACGCCTTC AAGGCGCAGG AACTGGAGAG CAAGAAGAAG
CACGGCTTCG CGTAA
 
Protein sequence
MSDVTGTVSS DVLRYQQARS TLESSKSELE ELQNSEAVGV FQKQLALLQK RLVNDPASLR 
NMFIADGTQA IVWEFQQPEL GEAFTTTLWN LLARGDDMSI ILQRFIWALP LKFKRKFIKA
IDVHLRGRYP MFENLSEGWP GEAFIPPYIR PAEERAIDFE LVNQGYLGYQ SIGYSVRECE
LFVWLEVMRD KQCDDKPCEL GVLIQGKKEA KGGCPVKIHI PEMLDLLGNG KHREALELIE
SCNPLPNVTG RVCPQELQCQ GVCTHTKRPI EIGQLEWYLP EHQKLTNPNA NERFAGRISP
WAAAVKPPIA VVGSGPSGLI NAYLLAVEGF PVTIFEAFHD LGGVLRYGIP EFRLPNTLID
DVVEKIILLG GRFVKNFVVG KTATLEDLKA EGFWKIFVGT GAGLPTFMNV PGEHLLGVMS
ANEFLTRVNL MRGLDDRYET PLPEVKDKNV FVIGGGNTAM DAARTAKRLG GNVTIVYRRT
KSEMPARVEE LHHALEENIN LAVLRAPREF IGDDHTHFVT HALLDVNELG EPDKSGRRSP
KPTGQIERVP VDLVIMALGN TANPIMKDAE PGLKTNKWGT IEVEEGSQRT SIKDVYTGGD
AARGGSTAIR AAGDGQAAAR EIVGEIPFTP AEIKDRVERA AKYTELGQIE QTIVGKVPLA
GGIVEFTVRA PMVARSAQAG QFVRVLPWEK GELIPLTLAD WDAEKGTIDL VVQGMGTSSL
EINRMAIGDA FSGIAGPLGR ASELHRYEGN QTVVFCAGGV GLPPVYPIMR EHLRLGNHVT
LISGFRAKEF LFWTGDDERV GKLKKEFGDQ LSLIYTTNDG SYGVKGFVTG PLEEMMKANQ
EGKGRSIAEV IAIGPPLMMR AVSDLTKPYG VKTVASLNSI MVDATGMCGA CMVPVTIDGK
MVRKHACIDG PEIDAHIIDW DKFLPRFNAF KAQELESKKK HGFA