Gene Rpal_4697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4697 
Symbol 
ID6412383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5055908 
End bp5058082 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content65% 
IMG OID642714576 
Productmalate synthase G 
Protein accessionYP_001993663 
Protein GI192293058 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTA TTGACGCCCA CGGACTGAAA ATCGCGCCTG TGCTGTTCGA CTTCATCGCC 
AAGGAGGCCG CGCCGAAAAC CGGCATCGCT CCCGACGTAT TCTGGGCCGG GCTCGCTGCG
ATCGTTCGTG ATCTGGCGCC GAAGACCCGC GCGCTGCTGA AGACCCGCGA CGACCTGCAG
GCCAAGATCG ATGCGTGGCA TCTCGCCAAC AAGGGCAAGA AGCAGGACAT GGCGGCCTAC
ACCGCCTTCC TGAAGGAGAT CGGCTACCTG CTGCCCGAGC CGCCGACGGT GCCGGTCGAG
ACCGCCAATA TCGACGAAGA GATCGGCAAG CTGTGCGGCC CGCAGCTCGT GGTGCCGCTG
TCGAATGCGC GCTACGCACT GAATGCCGCC AACGCGCGCT GGGGTTCGCT GTACGACGCA
TTCTACGGCA CCGACGCGAT CCCGCAGGAA GCCACCCAGG CTAAGGGCTA CGACAAGGCG
CGCGGCGACA AGGTGATCGC CAAGGCCAAG GCATTCCTCG ACCAAGCCGC GCCGCTCGCG
ACCGGCAGCC ATAGCGACGT CACTGGTTAC AGCGTGATCG CCGGCCAGCT GTCGGCCAAG
CTGAAGAGCG GCAATGCCAC CGGCCTGAAG AAACCGGCAC AGTTCGCCGG CTTCCGCGGC
GATGCCGCCA ATCCGAGCGC GGTGCTGCTG GTCAACAACG GCCTGCACAT CGAGATCAAG
ATCGATCGCG CCAACACCAT CGGCAAGGAC GATCCGGCCG GCGTCGCCGA CCTGGTCATC
GAGTCGGCGG TCTCGACCAT TCTCGACATG GAAGACTCGG TCGCCGCCGT CGATGCCGAC
GACAAGGTGC TGATCTATCG CAACACCCTC GGCCTGATGG ACGGCACGCT GTCGGAAAGC
TTTGAGAAGG GCGGCAAGAC CGTTACCCGC GCGCTCAACG GCGACCGCAC CTACACCGCG
CCGGACGGCA AGGAGATCTC GCTGCACGGC CGCAGCCTGC TGCTGATGCG CAACGTCGGC
CATCACATGT GGACCGATGC GGTGCTCGAC AGCGACGGCC AGGAGATTCC GGAAGGCTTC
CTCGACGCTG CGGTGTCCGG CCTGATCGCG ATCCACGATC TCAAGCACCT CGGCAAGACC
CGCAACAGCC GCACCGGCTC GGTCTACATC GTCAAGCCGA AGATGCACGG CCCGGATGAA
GTCGCCCTCA CCGTCGAGCT GTTCGGCCGC GTCGAGACCA TGCTCGGCCT GACCGCGAAC
ACCCTGAAGG TCGGCATCAT GGACGAGGAA CGCCGCACCA CGGTGAACCT CAAGGCCTGC
ATCCAGAACG CGTCGAAGCG GATCGTCTTC ATCAACACCG GCTTCCTCGA TCGCACCGGC
GACGAGATCC ACACCTCGAT GGAAGCGGGT CCGATGATCC GCAAGAACGA GATGAAGGCG
CAGCCCTGGA TCAAGGCCTA CGAAGACTGG AACGTCGACA CCGGTTTGGT CGACGGCCTG
CCGGGTCACG CCCAGATCGG CAAGGGCATG TGGGCGGCCC CCGACAAGAT GGCCGACATG
CTGGCGCAGA AGATCGGTCA CCCGCAGGCC GGCGCGACCA CCGCCTGGGT GCCGTCGCCG
ACCGCCGCGA CGCTGCACGC GCTGCACTAT CACCAGGTCG ACGTGATCGC GCGCCAGCAG
GAGCTGGCGA AGGGCGGTCC GCGCGCCAAG CTCGAAGACA TCCTCACCAT CCCGGTGTCG
AACTCGAACT GGGCGCCGGA CGATGTCCGC CAGGAGATCG ACAACAACTG CCAGGGCATC
CTCGGCTACG TGGTGCGCTG GATCGACCAG GGCGTCGGCT GCTCCAAGGT GCCGGACATC
CACGACGTCG GCCTGATGGA AGACCGCGCG ACGCTGCGCA TCTCAAGCCA GCACCTCGCC
AACTGGCTGC ATCACGGCGT CGTCACCAAG GACCAGGTGC TCGACTCGCT GAAGCGGATG
GCGGTGATCG TCGACAAGCA GAACGAAGGC GATGCGCTGT ACCGGCCGAT TGCGCCGGAC
TTCGACGGCG TCGCGTTCGA AGCCGCGTGC GACCTGATCT TCAAGGGCCG CGCGCAGCCG
AACGGCTACA CCGAATACAT CCTGCATGAG CGCCGCCGCG AGGCCAAGGC GGCGCACCTG
GAGTCGGCAC GCTAA
 
Protein sequence
MNRIDAHGLK IAPVLFDFIA KEAAPKTGIA PDVFWAGLAA IVRDLAPKTR ALLKTRDDLQ 
AKIDAWHLAN KGKKQDMAAY TAFLKEIGYL LPEPPTVPVE TANIDEEIGK LCGPQLVVPL
SNARYALNAA NARWGSLYDA FYGTDAIPQE ATQAKGYDKA RGDKVIAKAK AFLDQAAPLA
TGSHSDVTGY SVIAGQLSAK LKSGNATGLK KPAQFAGFRG DAANPSAVLL VNNGLHIEIK
IDRANTIGKD DPAGVADLVI ESAVSTILDM EDSVAAVDAD DKVLIYRNTL GLMDGTLSES
FEKGGKTVTR ALNGDRTYTA PDGKEISLHG RSLLLMRNVG HHMWTDAVLD SDGQEIPEGF
LDAAVSGLIA IHDLKHLGKT RNSRTGSVYI VKPKMHGPDE VALTVELFGR VETMLGLTAN
TLKVGIMDEE RRTTVNLKAC IQNASKRIVF INTGFLDRTG DEIHTSMEAG PMIRKNEMKA
QPWIKAYEDW NVDTGLVDGL PGHAQIGKGM WAAPDKMADM LAQKIGHPQA GATTAWVPSP
TAATLHALHY HQVDVIARQQ ELAKGGPRAK LEDILTIPVS NSNWAPDDVR QEIDNNCQGI
LGYVVRWIDQ GVGCSKVPDI HDVGLMEDRA TLRISSQHLA NWLHHGVVTK DQVLDSLKRM
AVIVDKQNEG DALYRPIAPD FDGVAFEAAC DLIFKGRAQP NGYTEYILHE RRREAKAAHL
ESAR