Gene RPC_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4022 
Symbol 
ID3969212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4471330 
End bp4473492 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content65% 
IMG OID637927126 
Productmalate synthase G 
Protein accessionYP_533867 
Protein GI90425497 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.17876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGCA TCAAAGCCCA CGGCCTGCAG ATTGCGCCAG TTCTGTTCGA CTTCATCGCC 
AAGGAGGCCG CCCCCAAGAC CGGCATCGAT CCGGATGCGT TCTGGGCCGG CCTGGCCGGC
ATCGTGCGCG ACCTGGCACC GAAGACCCGC GAGCTGCTGG CGCTGCGCGA CGCGCTGCAG
CTCAAGATCG ACGATTGGCA CAAAGCCAAC AAGGGCAAGC CCTGGGATAT CGAGGCCTAC
ACCGCGTTCC TGAAAGAGAT CGGCTATCTG TTGCCGGAGC CGGCCACCGT CGAAGTCGAG
ACGACCGATG TCGACGAGGA GATCGGCAAG ATCTGCGGCC CGCAGCTGGT GGTGCCGCTC
AGCAACGCGC GCTACTCGCT GAACGCCGCC AACGCGCGCT GGGGCTCGCT GTACGACGCG
CTGTACGGCA CCGATGCGAT CCCGCATGAC GCTTCGGAGA CCGGCCGCGG TTACAACAAG
GCGCGCGGCG CCAAGGTAAT CGCCAAGGCG AAAGCCTTCC TCGACCAAGC GGTGCCGCTG
GCGACCGCCA GCCACACCGA TGTGACGTCC TACAGCGTGA TCGCCGGTCA TCTGTCGGTG
AAGCTGAAGA GCGGCAACGC CACCGCGCTG AAGAACGCCG CGCAGTTCGC CGGCTTCCTC
GGCGATGCAG CGGCACCGAC CGCGATCCTG CTGGTCAACC ACGGCATGCA TATCGAAATC
AAGGTCGATC GCTCCAGCGT GATCGGCAAG GACGATGCTG CCGGCGTCGC CGACGTGATC
CTGGAATCGG CCGTCTCCAC CATTCTCGAC ATGGAAGACT CGGTCGCCGC TGTCGACGCC
GAAGACAAGG TGCTGATCTA TCGCAACGTG CTCGGCCTGA TGGATGGCAC GCTCACCGCC
GACTTCGAGA AGGGCGGCAA GACGCTGACC CGCGCGCTGA ACGGCGACCG CGTGTACCAG
ACCGTCGACG GCAAGGGCCT GACGCTGCAC GGTCGCAGCC TGTTGCTGAT GCGCAATGTC
GGCCATCACA TGTTCACCGA CGCGGTGCTC GACGAATCCG GCGCAGAGAT CCCCGAAGGC
TTCCTCGACG CCGCGGTCGC CGGCCTGCTG GCGATTCACG ACCTCAAGGG TGGCTCCAAG
ACCCGCAACA GCCGCACCGG CTCGGTCTAT ATCGTCAAGC CGAAGATGCA CGGCCCCGAC
GAGGTGGCGC TGACCGTCGA ATTGTTCGGC CGTGTCGAGC AGATGCTCGG CCTTGCGGAG
AACACCATGA AGGTCGGCAT CATGGACGAG GAGCGCCGCA CCACGGTCAA CCTCAAGGCC
TGCATCCAGA ATGCGTCGAA GCGCATTGTG TTCATCAACA CCGGCTTCCT CGATCGCACC
GGCGACGAGA TCCACACCTC GATGGAAGCG GGCCCGATGA TCCGCAAGAA CGAGATGAAG
GCGCAGCCCT GGATCAAGGC CTATGAAGAC TGGAACGTCG ACATGGGACT GATCGATGGC
CTGCCCGGCC ACGCCCAGAT CGGCAAGGGC ATGTGGGCGG CGCCCGACAA GATGGCGGAC
ATGCTGGCGC AGAAGGTCGG CCATCCGCAG GCCGGCGCCA CCACCGCCTG GGTGCCCTCG
CCCACCGCCG CGACGCTGCA CGCGCTGCAT TATCACCAGG TCGACGTGCT GAAGCGGCAG
CAGGAGTTGA AAACCGGCGG TCCGCGCGCC AAACTCAGCG ACATCCTCAC CATCCCGGTG
TCGCAGTCGA ACTGGGCGCC GGACGACGTC CGCCAAGAGA TCGACAACAA CTGCCAGGGC
ATTCTCGGCT ATGTGGTGCG CTGGATCGAC CAGGGCGTCG GCTGCTCCAA GGTGCCGGAC
ATCCACGACG TCGGACTGAT GGAAGACCGC GCCACGCTGC GGATCTCCAG CCAGCATCTG
GCGAATTGGC TGCATCAGGG TGTCATCACC AAGGAGCAGG TGATGGAGTC CTTGAAGCGG
ATGGCTGTGG TGGTCGACAA GCAGAACGAA GGCGACGCGC TGTACAAGCC GATGGCGCCC
GGCTTCGACG GCGTGGCGTT TAAGGCCGCC TGCGACCTGA TCTTCAAGGG CCGCGAGCAG
CCGAACGGCT ATACCGAGTT CATCCTCACC GCGCGGCGCC GCGAAGCCAA GGCCGCGGTG
TGA
 
Protein sequence
MNRIKAHGLQ IAPVLFDFIA KEAAPKTGID PDAFWAGLAG IVRDLAPKTR ELLALRDALQ 
LKIDDWHKAN KGKPWDIEAY TAFLKEIGYL LPEPATVEVE TTDVDEEIGK ICGPQLVVPL
SNARYSLNAA NARWGSLYDA LYGTDAIPHD ASETGRGYNK ARGAKVIAKA KAFLDQAVPL
ATASHTDVTS YSVIAGHLSV KLKSGNATAL KNAAQFAGFL GDAAAPTAIL LVNHGMHIEI
KVDRSSVIGK DDAAGVADVI LESAVSTILD MEDSVAAVDA EDKVLIYRNV LGLMDGTLTA
DFEKGGKTLT RALNGDRVYQ TVDGKGLTLH GRSLLLMRNV GHHMFTDAVL DESGAEIPEG
FLDAAVAGLL AIHDLKGGSK TRNSRTGSVY IVKPKMHGPD EVALTVELFG RVEQMLGLAE
NTMKVGIMDE ERRTTVNLKA CIQNASKRIV FINTGFLDRT GDEIHTSMEA GPMIRKNEMK
AQPWIKAYED WNVDMGLIDG LPGHAQIGKG MWAAPDKMAD MLAQKVGHPQ AGATTAWVPS
PTAATLHALH YHQVDVLKRQ QELKTGGPRA KLSDILTIPV SQSNWAPDDV RQEIDNNCQG
ILGYVVRWID QGVGCSKVPD IHDVGLMEDR ATLRISSQHL ANWLHQGVIT KEQVMESLKR
MAVVVDKQNE GDALYKPMAP GFDGVAFKAA CDLIFKGREQ PNGYTEFILT ARRREAKAAV