Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4697 |
Symbol | |
ID | 6412383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5055908 |
End bp | 5058082 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714576 |
Product | malate synthase G |
Protein accession | YP_001993663 |
Protein GI | 192293058 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTA TTGACGCCCA CGGACTGAAA ATCGCGCCTG TGCTGTTCGA CTTCATCGCC AAGGAGGCCG CGCCGAAAAC CGGCATCGCT CCCGACGTAT TCTGGGCCGG GCTCGCTGCG ATCGTTCGTG ATCTGGCGCC GAAGACCCGC GCGCTGCTGA AGACCCGCGA CGACCTGCAG GCCAAGATCG ATGCGTGGCA TCTCGCCAAC AAGGGCAAGA AGCAGGACAT GGCGGCCTAC ACCGCCTTCC TGAAGGAGAT CGGCTACCTG CTGCCCGAGC CGCCGACGGT GCCGGTCGAG ACCGCCAATA TCGACGAAGA GATCGGCAAG CTGTGCGGCC CGCAGCTCGT GGTGCCGCTG TCGAATGCGC GCTACGCACT GAATGCCGCC AACGCGCGCT GGGGTTCGCT GTACGACGCA TTCTACGGCA CCGACGCGAT CCCGCAGGAA GCCACCCAGG CTAAGGGCTA CGACAAGGCG CGCGGCGACA AGGTGATCGC CAAGGCCAAG GCATTCCTCG ACCAAGCCGC GCCGCTCGCG ACCGGCAGCC ATAGCGACGT CACTGGTTAC AGCGTGATCG CCGGCCAGCT GTCGGCCAAG CTGAAGAGCG GCAATGCCAC CGGCCTGAAG AAACCGGCAC AGTTCGCCGG CTTCCGCGGC GATGCCGCCA ATCCGAGCGC GGTGCTGCTG GTCAACAACG GCCTGCACAT CGAGATCAAG ATCGATCGCG CCAACACCAT CGGCAAGGAC GATCCGGCCG GCGTCGCCGA CCTGGTCATC GAGTCGGCGG TCTCGACCAT TCTCGACATG GAAGACTCGG TCGCCGCCGT CGATGCCGAC GACAAGGTGC TGATCTATCG CAACACCCTC GGCCTGATGG ACGGCACGCT GTCGGAAAGC TTTGAGAAGG GCGGCAAGAC CGTTACCCGC GCGCTCAACG GCGACCGCAC CTACACCGCG CCGGACGGCA AGGAGATCTC GCTGCACGGC CGCAGCCTGC TGCTGATGCG CAACGTCGGC CATCACATGT GGACCGATGC GGTGCTCGAC AGCGACGGCC AGGAGATTCC GGAAGGCTTC CTCGACGCTG CGGTGTCCGG CCTGATCGCG ATCCACGATC TCAAGCACCT CGGCAAGACC CGCAACAGCC GCACCGGCTC GGTCTACATC GTCAAGCCGA AGATGCACGG CCCGGATGAA GTCGCCCTCA CCGTCGAGCT GTTCGGCCGC GTCGAGACCA TGCTCGGCCT GACCGCGAAC ACCCTGAAGG TCGGCATCAT GGACGAGGAA CGCCGCACCA CGGTGAACCT CAAGGCCTGC ATCCAGAACG CGTCGAAGCG GATCGTCTTC ATCAACACCG GCTTCCTCGA TCGCACCGGC GACGAGATCC ACACCTCGAT GGAAGCGGGT CCGATGATCC GCAAGAACGA GATGAAGGCG CAGCCCTGGA TCAAGGCCTA CGAAGACTGG AACGTCGACA CCGGTTTGGT CGACGGCCTG CCGGGTCACG CCCAGATCGG CAAGGGCATG TGGGCGGCCC CCGACAAGAT GGCCGACATG CTGGCGCAGA AGATCGGTCA CCCGCAGGCC GGCGCGACCA CCGCCTGGGT GCCGTCGCCG ACCGCCGCGA CGCTGCACGC GCTGCACTAT CACCAGGTCG ACGTGATCGC GCGCCAGCAG GAGCTGGCGA AGGGCGGTCC GCGCGCCAAG CTCGAAGACA TCCTCACCAT CCCGGTGTCG AACTCGAACT GGGCGCCGGA CGATGTCCGC CAGGAGATCG ACAACAACTG CCAGGGCATC CTCGGCTACG TGGTGCGCTG GATCGACCAG GGCGTCGGCT GCTCCAAGGT GCCGGACATC CACGACGTCG GCCTGATGGA AGACCGCGCG ACGCTGCGCA TCTCAAGCCA GCACCTCGCC AACTGGCTGC ATCACGGCGT CGTCACCAAG GACCAGGTGC TCGACTCGCT GAAGCGGATG GCGGTGATCG TCGACAAGCA GAACGAAGGC GATGCGCTGT ACCGGCCGAT TGCGCCGGAC TTCGACGGCG TCGCGTTCGA AGCCGCGTGC GACCTGATCT TCAAGGGCCG CGCGCAGCCG AACGGCTACA CCGAATACAT CCTGCATGAG CGCCGCCGCG AGGCCAAGGC GGCGCACCTG GAGTCGGCAC GCTAA
|
Protein sequence | MNRIDAHGLK IAPVLFDFIA KEAAPKTGIA PDVFWAGLAA IVRDLAPKTR ALLKTRDDLQ AKIDAWHLAN KGKKQDMAAY TAFLKEIGYL LPEPPTVPVE TANIDEEIGK LCGPQLVVPL SNARYALNAA NARWGSLYDA FYGTDAIPQE ATQAKGYDKA RGDKVIAKAK AFLDQAAPLA TGSHSDVTGY SVIAGQLSAK LKSGNATGLK KPAQFAGFRG DAANPSAVLL VNNGLHIEIK IDRANTIGKD DPAGVADLVI ESAVSTILDM EDSVAAVDAD DKVLIYRNTL GLMDGTLSES FEKGGKTVTR ALNGDRTYTA PDGKEISLHG RSLLLMRNVG HHMWTDAVLD SDGQEIPEGF LDAAVSGLIA IHDLKHLGKT RNSRTGSVYI VKPKMHGPDE VALTVELFGR VETMLGLTAN TLKVGIMDEE RRTTVNLKAC IQNASKRIVF INTGFLDRTG DEIHTSMEAG PMIRKNEMKA QPWIKAYEDW NVDTGLVDGL PGHAQIGKGM WAAPDKMADM LAQKIGHPQA GATTAWVPSP TAATLHALHY HQVDVIARQQ ELAKGGPRAK LEDILTIPVS NSNWAPDDVR QEIDNNCQGI LGYVVRWIDQ GVGCSKVPDI HDVGLMEDRA TLRISSQHLA NWLHHGVVTK DQVLDSLKRM AVIVDKQNEG DALYRPIAPD FDGVAFEAAC DLIFKGRAQP NGYTEYILHE RRREAKAAHL ESAR
|
| |