Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1375 |
Symbol | |
ID | 4021852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1543492 |
End bp | 1545666 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637961568 |
Product | malate synthase G |
Protein accession | YP_568514 |
Protein GI | 91975855 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.209074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGTA TCGACGCCCA CGGCTTGAAA ATTGCGCCTG TCCTGTTCGA CTTCATCGCC AAGGAGGCCG CGCCGAAGAC CGGCGTTGCG CCCGACGCAT TCTGGGCCGG GCTCGCGGCC ATCGTCCGCG ACCTGACGCC CAAGCTCCGC AAGGCGCTGA CCACGCGCGA CGATCTCCAG GCCAAAATCG ACGCCTGGCA CCTCGCCAAC AAGGGCAAGA AGCAGGATCT CGCGGTCTAC ACCGCCTTCC TCAAGGAGAT CGGCTATCTG CAGCCGGAGC CGGCGACGGT TGCGGTCGAG ACCGCCAATG TCGACGAGGA GATCGGCAAG CTGTGCGGCC CGCAGCTCGT GGTGCCGCTG TCGAACGCGC GCTACGCGCT GAACGCGGCG AATGCGCGCT GGGGCTCGCT GTATGACGCG TTCTACGGCA CCGATGCGAT CCCGCAGGAA GCCACTCAGG CCAAGGGCTA CGACAAGGCG CGCGGCGACA AGGTGATCGC CAAGGCCAAG GCGTTCCTCG ACCAGGCCGC GCCGCTGGTG GCCGGCAGCC ACAGCGACGT CACCGCCTAC AGCGTGATCG CCGGCCAGTT CTCGGCCAAG CTGAAGAGCG GCAACGCCAC CGGCCTGAAG AAGCCTGAGC AGTTCGCCGG CTATCTGGGC GACGCCGCGT CGCCGAGCGC CGTGCTGGTG GTCAATAACG GCCTGCACAT CGAGATCAAG ATCGACCGCG CCAACACCAT CGGCAAGGAC GATCCGGCCG GCGTCGCCGA CCTGGTGATC GAGTCCGCGG TCTCGACCAT CCTCGACATG GAAGACTCGG TCGCGGCGGT CGACGCCGAA GACAAGGTGC TGATCTATCG TAACGTGCTC GGCCTGATGG ACGGCACGCT GTCGGAGAGT TTCGACAAGG GCGGCAAGAC CGTCACTCGC GCGCTGAACG GCGACCGCAC CTATACCGGC CCCGACGGCA AGGACATCAC GCTGCACGGC CGCAGCCTGT TGCTGATGCG CAATGTCGGC CATCACATGT GGACCGACGC GGTGCTCGAC GCCGACGGCG CGGAGATTCC GGAAGGCTTC CTCGACGCCG CCGTCTCCGG CCTGATCGCG ATCCACGACC TCAAGGCGCT CGGCAAGACC CGCAACAGCC GCAGCGGCTC GGTCTACATC GTCAAGCCGA AGATGCACGG CCCGGACGAA GTGGCGCTGA CCTGCGAGCT GTTCGGCCGC GTCGAGAAGA TGCTCGGCCT GAACGACAAC ACGCTGAAGG TCGGCATCAT GGACGAGGAG CGCCGCACCA CGGTGAACCT CAAGGCCTGC ATTCAGAATG CGTCGAAGCG GATCGTGTTC ATCAACACCG GCTTCCTCGA CCGCACCGGC GACGAGATCC ACACCTCGAT GGAAGCGGGT CCGATGATCC GCAAGAACGA GATGAAGGCG CAGCCCTGGA TCAAGGCCTA TGAAGACTGG AATGTCGATA CCGGCCTGAT CGACGGCCTG CCCGGCCACG CCCAGATCGG CAAGGGCATG TGGGCGGCGC CGGACAAGAT GGCCGACATG CTGGCGCAGA AGATCGGCCA TCCGCAGGCC GGCGCCACCA CCGCCTGGGT GCCGTCGCCG ACCGCCGCGA CGCTGCACGC GCTGCATTAT CACCAGGTCG ACGTCATCGC CCGTCAGCAG GAACTGCAGA AGGGCGGGCC GCGCGCCAAG CTCGACGACA TCCTCACCAT TCCGGTGTCG CAATCGAACT GGGCGCCGGA CGACGTCCGC CAGGAGATCG ACAACAACTG CCAGGGCATC CTCGGCTACG TCGTGCGCTG GATCGACCAG GGCGTCGGCT GCTCCAAGGT GCCGGACATC CACGATGTCG GCTTGATGGA AGACCGCGCG ACGCTGCGCA TCTCCAGCCA GCATCTGGCG AACTGGCTGC ATCATGGCGT CGTCACCAAG GAGCAGGTGC TGGAATCGCT GAAGCGGATG GCCGCCGTCG TCGACAAGCA GAACGCCAGC GATCCGCTGT ACCGGCCGAT GGCGCCGGAT TTCGACGGCG TCGCCTTCGA GGCCGCCTGC GACCTGATCT TCAAGGGCCG CGAGCAGCCG AACGGCTACA CCGAATTCAT CCTGCATATC CGTCGCCGCG AAGCCAAGGC CGCGCATCTG CAGGATCTAC GCTGA
|
Protein sequence | MNRIDAHGLK IAPVLFDFIA KEAAPKTGVA PDAFWAGLAA IVRDLTPKLR KALTTRDDLQ AKIDAWHLAN KGKKQDLAVY TAFLKEIGYL QPEPATVAVE TANVDEEIGK LCGPQLVVPL SNARYALNAA NARWGSLYDA FYGTDAIPQE ATQAKGYDKA RGDKVIAKAK AFLDQAAPLV AGSHSDVTAY SVIAGQFSAK LKSGNATGLK KPEQFAGYLG DAASPSAVLV VNNGLHIEIK IDRANTIGKD DPAGVADLVI ESAVSTILDM EDSVAAVDAE DKVLIYRNVL GLMDGTLSES FDKGGKTVTR ALNGDRTYTG PDGKDITLHG RSLLLMRNVG HHMWTDAVLD ADGAEIPEGF LDAAVSGLIA IHDLKALGKT RNSRSGSVYI VKPKMHGPDE VALTCELFGR VEKMLGLNDN TLKVGIMDEE RRTTVNLKAC IQNASKRIVF INTGFLDRTG DEIHTSMEAG PMIRKNEMKA QPWIKAYEDW NVDTGLIDGL PGHAQIGKGM WAAPDKMADM LAQKIGHPQA GATTAWVPSP TAATLHALHY HQVDVIARQQ ELQKGGPRAK LDDILTIPVS QSNWAPDDVR QEIDNNCQGI LGYVVRWIDQ GVGCSKVPDI HDVGLMEDRA TLRISSQHLA NWLHHGVVTK EQVLESLKRM AAVVDKQNAS DPLYRPMAPD FDGVAFEAAC DLIFKGREQP NGYTEFILHI RRREAKAAHL QDLR
|
| |