Gene RPB_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1395 
Symbol 
ID3908345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1584818 
End bp1586992 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content66% 
IMG OID637883289 
Productmalate synthase G 
Protein accessionYP_485016 
Protein GI86748520 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.528165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGTA TCGATGCCCA CGGCTTGAAA ATTGCGCCCG TCCTGTTCGA CTTCATCGCC 
AAGGAGGCCG CGCCGAAGAC CGGCGTCGCA CCGGACGCCT TCTGGGCCGG GCTCGCCGCG
ATCGTGCGCG ATCTGACGCC GAAGACGGTC GCGCTGCTGC AGCATCGCGA CGGCCTGCAG
GCCAAGATCG ATGCCTGGCA TCTCGCCAAC AAGGGCAAGA AGCAGGACAT GGCGGCCTAC
ACCGCCTTCC TGAAAGAGAT CGGCTATCTG CTGCCGGAGC CGCCGACGGT CGCGGTCGAG
ACCGCCAATG TCGACGACGA GATCGGCAAG CTGTGCGGCC CGCAGCTCGT GGTGCCGCTG
TCGAATGCGC GCTACGCACT CAACGCGGCG AACGCGCGTT GGGGCTCGCT GTATGACGCG
TTCTATGGCA CCGACGCGAT CCCGCAGGAG GCCAGCCAGG CCAAGGGCTA CGACAAGGCG
CGCGGCGACA AGGTGATCGC CAAGGCCAAG GCGTTCCTCG ACCAGGCCGC GCCGCTGGTG
GCCGGCAGCC ACAGCGACGT CACCGCCTAC AGCGTGATCG CCGGCCAGCT TTCGGCAAAA
CTCAAGAGCG GCAACGCCAC CGGCCTGAAG ACCCCCCGGC AGTTCGCCGG CTATCTGGGC
GACGCCGCGT CGCCGAGCGC GGTGCTGCTG GTCAATAACG GCCTGCATAT CGAGATCAAG
ATCGACCGCG CCAATACCAT CGGCAAGGAC GATGCGGCCG GCGTCGCCGA CCTCGTGATC
GAATCCGCGG TGTCGACCAT CCTCGACATG GAGGACTCGG TCGCAGCCGT CGACGCCGAA
GACAAGGTGC TGATCTACCG CAACACGCTC GGCCTGATGG ACGGCACGCT GTCGGCCGAT
TTCGACAAGG GTGGCAAGAC CGTCACCCGC GCGCTGAACG GCGACCGCAC TTATACCGGC
CCGGACGGCA AGGACGTCAC CCTGCACGGG CGCAGCCTGC TGCTGATGCG CAATGTCGGC
CATCACATGT GGACCGATGC GGTGCTCGAT TCGAACGGCG ACGAGATCCC CGAGGGCTTC
CTCGACGCCG CGGTGTCCGG CCTACTGGCG ATCCACGACC TCAAGGCACT CGGCAAGACC
CGCAACAGCC GCACCGGCTC GGTCTACATC GTCAAGCCGA AGATGCACGG CCCGGACGAA
GTGGCGCTGA CTTGCGAGCT GTTCGGCCGC GTCGAGCAGA TGCTCGGCCT GAAGGAGAAC
ACGCTGAAGG TCGGCATCAT GGACGAGGAG CGCCGCACCA CGGTGAACCT CAAGGCCTGC
ATCCAGAACG CGTCCAAGCG CATCGTGTTC ATCAACACGG GTTTCCTGGA CCGCACCGGC
GACGAGATCC ACACCTCGAT GGAAGCGGGT CCGATGATCC GCAAGAACGA GATGAAGGCG
CAGCCCTGGA TCAAGGCGTA CGAGGACTGG AACGTCGACA CCGGCCTGAT CGACGGCCTG
CCCGGCCATG CCCAGATTGG TAAGGGCATG TGGGCGGCCC CCGACAAGAT GGCCGACATG
CTGACGCAGA AGATCGGCCA CCCGCAGGCC GGCGCCACCA CCGCCTGGGT GCCGTCGCCG
ACCGCCGCGA CGCTGCACGC GCTGCACTAT CACCAGGTCG ACGTGCTGGC GCGTCAGCAG
GAGCTGAAGA CGGGCGGCCC GCGCGCCAAG CTCGAGGACA TCCTCACCAT CCCGGTGTCG
CAATCGAATT GGGCGCCGGA CGACGTCCGC CAGGAGATCG ACAACAACTG CCAGGGTATC
CTCGGCTACG TCGTGCGCTG GATCGATCAG GGCGTCGGCT GCTCCAAGGT GCCGGACATC
CACGACGTCG GCCTGATGGA AGACCGCGCC ACGCTGCGCA TCTCCAGCCA GCATCTGGCG
AACTGGCTGC ATCACGGCGT CGTCACCAAG GAGCAGGTGC TGGAATCGCT GAAGCGGATG
GCGGCGGTGG TCGACAAGCA GAACGCTGGT GATCCGCTGT ACCGGCCGAT GGCGCCGGAC
TTCGACGGCG TCGCCTTCGA GGCGGCGTGC GACCTGATCT TCAAGGGCCG CGAGCAGCCG
AACGGCTACA CCGAATTCAT CCTGCATATC CGCCGCCGCG AAGCCAAGGC GGCGCATCTG
CAGGACCTGA CCTGA
 
Protein sequence
MNRIDAHGLK IAPVLFDFIA KEAAPKTGVA PDAFWAGLAA IVRDLTPKTV ALLQHRDGLQ 
AKIDAWHLAN KGKKQDMAAY TAFLKEIGYL LPEPPTVAVE TANVDDEIGK LCGPQLVVPL
SNARYALNAA NARWGSLYDA FYGTDAIPQE ASQAKGYDKA RGDKVIAKAK AFLDQAAPLV
AGSHSDVTAY SVIAGQLSAK LKSGNATGLK TPRQFAGYLG DAASPSAVLL VNNGLHIEIK
IDRANTIGKD DAAGVADLVI ESAVSTILDM EDSVAAVDAE DKVLIYRNTL GLMDGTLSAD
FDKGGKTVTR ALNGDRTYTG PDGKDVTLHG RSLLLMRNVG HHMWTDAVLD SNGDEIPEGF
LDAAVSGLLA IHDLKALGKT RNSRTGSVYI VKPKMHGPDE VALTCELFGR VEQMLGLKEN
TLKVGIMDEE RRTTVNLKAC IQNASKRIVF INTGFLDRTG DEIHTSMEAG PMIRKNEMKA
QPWIKAYEDW NVDTGLIDGL PGHAQIGKGM WAAPDKMADM LTQKIGHPQA GATTAWVPSP
TAATLHALHY HQVDVLARQQ ELKTGGPRAK LEDILTIPVS QSNWAPDDVR QEIDNNCQGI
LGYVVRWIDQ GVGCSKVPDI HDVGLMEDRA TLRISSQHLA NWLHHGVVTK EQVLESLKRM
AAVVDKQNAG DPLYRPMAPD FDGVAFEAAC DLIFKGREQP NGYTEFILHI RRREAKAAHL
QDLT