Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4022 |
Symbol | |
ID | 3969212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4471330 |
End bp | 4473492 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637927126 |
Product | malate synthase G |
Protein accession | YP_533867 |
Protein GI | 90425497 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.17876 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCA TCAAAGCCCA CGGCCTGCAG ATTGCGCCAG TTCTGTTCGA CTTCATCGCC AAGGAGGCCG CCCCCAAGAC CGGCATCGAT CCGGATGCGT TCTGGGCCGG CCTGGCCGGC ATCGTGCGCG ACCTGGCACC GAAGACCCGC GAGCTGCTGG CGCTGCGCGA CGCGCTGCAG CTCAAGATCG ACGATTGGCA CAAAGCCAAC AAGGGCAAGC CCTGGGATAT CGAGGCCTAC ACCGCGTTCC TGAAAGAGAT CGGCTATCTG TTGCCGGAGC CGGCCACCGT CGAAGTCGAG ACGACCGATG TCGACGAGGA GATCGGCAAG ATCTGCGGCC CGCAGCTGGT GGTGCCGCTC AGCAACGCGC GCTACTCGCT GAACGCCGCC AACGCGCGCT GGGGCTCGCT GTACGACGCG CTGTACGGCA CCGATGCGAT CCCGCATGAC GCTTCGGAGA CCGGCCGCGG TTACAACAAG GCGCGCGGCG CCAAGGTAAT CGCCAAGGCG AAAGCCTTCC TCGACCAAGC GGTGCCGCTG GCGACCGCCA GCCACACCGA TGTGACGTCC TACAGCGTGA TCGCCGGTCA TCTGTCGGTG AAGCTGAAGA GCGGCAACGC CACCGCGCTG AAGAACGCCG CGCAGTTCGC CGGCTTCCTC GGCGATGCAG CGGCACCGAC CGCGATCCTG CTGGTCAACC ACGGCATGCA TATCGAAATC AAGGTCGATC GCTCCAGCGT GATCGGCAAG GACGATGCTG CCGGCGTCGC CGACGTGATC CTGGAATCGG CCGTCTCCAC CATTCTCGAC ATGGAAGACT CGGTCGCCGC TGTCGACGCC GAAGACAAGG TGCTGATCTA TCGCAACGTG CTCGGCCTGA TGGATGGCAC GCTCACCGCC GACTTCGAGA AGGGCGGCAA GACGCTGACC CGCGCGCTGA ACGGCGACCG CGTGTACCAG ACCGTCGACG GCAAGGGCCT GACGCTGCAC GGTCGCAGCC TGTTGCTGAT GCGCAATGTC GGCCATCACA TGTTCACCGA CGCGGTGCTC GACGAATCCG GCGCAGAGAT CCCCGAAGGC TTCCTCGACG CCGCGGTCGC CGGCCTGCTG GCGATTCACG ACCTCAAGGG TGGCTCCAAG ACCCGCAACA GCCGCACCGG CTCGGTCTAT ATCGTCAAGC CGAAGATGCA CGGCCCCGAC GAGGTGGCGC TGACCGTCGA ATTGTTCGGC CGTGTCGAGC AGATGCTCGG CCTTGCGGAG AACACCATGA AGGTCGGCAT CATGGACGAG GAGCGCCGCA CCACGGTCAA CCTCAAGGCC TGCATCCAGA ATGCGTCGAA GCGCATTGTG TTCATCAACA CCGGCTTCCT CGATCGCACC GGCGACGAGA TCCACACCTC GATGGAAGCG GGCCCGATGA TCCGCAAGAA CGAGATGAAG GCGCAGCCCT GGATCAAGGC CTATGAAGAC TGGAACGTCG ACATGGGACT GATCGATGGC CTGCCCGGCC ACGCCCAGAT CGGCAAGGGC ATGTGGGCGG CGCCCGACAA GATGGCGGAC ATGCTGGCGC AGAAGGTCGG CCATCCGCAG GCCGGCGCCA CCACCGCCTG GGTGCCCTCG CCCACCGCCG CGACGCTGCA CGCGCTGCAT TATCACCAGG TCGACGTGCT GAAGCGGCAG CAGGAGTTGA AAACCGGCGG TCCGCGCGCC AAACTCAGCG ACATCCTCAC CATCCCGGTG TCGCAGTCGA ACTGGGCGCC GGACGACGTC CGCCAAGAGA TCGACAACAA CTGCCAGGGC ATTCTCGGCT ATGTGGTGCG CTGGATCGAC CAGGGCGTCG GCTGCTCCAA GGTGCCGGAC ATCCACGACG TCGGACTGAT GGAAGACCGC GCCACGCTGC GGATCTCCAG CCAGCATCTG GCGAATTGGC TGCATCAGGG TGTCATCACC AAGGAGCAGG TGATGGAGTC CTTGAAGCGG ATGGCTGTGG TGGTCGACAA GCAGAACGAA GGCGACGCGC TGTACAAGCC GATGGCGCCC GGCTTCGACG GCGTGGCGTT TAAGGCCGCC TGCGACCTGA TCTTCAAGGG CCGCGAGCAG CCGAACGGCT ATACCGAGTT CATCCTCACC GCGCGGCGCC GCGAAGCCAA GGCCGCGGTG TGA
|
Protein sequence | MNRIKAHGLQ IAPVLFDFIA KEAAPKTGID PDAFWAGLAG IVRDLAPKTR ELLALRDALQ LKIDDWHKAN KGKPWDIEAY TAFLKEIGYL LPEPATVEVE TTDVDEEIGK ICGPQLVVPL SNARYSLNAA NARWGSLYDA LYGTDAIPHD ASETGRGYNK ARGAKVIAKA KAFLDQAVPL ATASHTDVTS YSVIAGHLSV KLKSGNATAL KNAAQFAGFL GDAAAPTAIL LVNHGMHIEI KVDRSSVIGK DDAAGVADVI LESAVSTILD MEDSVAAVDA EDKVLIYRNV LGLMDGTLTA DFEKGGKTLT RALNGDRVYQ TVDGKGLTLH GRSLLLMRNV GHHMFTDAVL DESGAEIPEG FLDAAVAGLL AIHDLKGGSK TRNSRTGSVY IVKPKMHGPD EVALTVELFG RVEQMLGLAE NTMKVGIMDE ERRTTVNLKA CIQNASKRIV FINTGFLDRT GDEIHTSMEA GPMIRKNEMK AQPWIKAYED WNVDMGLIDG LPGHAQIGKG MWAAPDKMAD MLAQKVGHPQ AGATTAWVPS PTAATLHALH YHQVDVLKRQ QELKTGGPRA KLSDILTIPV SQSNWAPDDV RQEIDNNCQG ILGYVVRWID QGVGCSKVPD IHDVGLMEDR ATLRISSQHL ANWLHQGVIT KEQVMESLKR MAVVVDKQNE GDALYKPMAP GFDGVAFKAA CDLIFKGREQ PNGYTEFILT ARRREAKAAV
|
| |