Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1395 |
Symbol | |
ID | 3908345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1584818 |
End bp | 1586992 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883289 |
Product | malate synthase G |
Protein accession | YP_485016 |
Protein GI | 86748520 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.528165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGTA TCGATGCCCA CGGCTTGAAA ATTGCGCCCG TCCTGTTCGA CTTCATCGCC AAGGAGGCCG CGCCGAAGAC CGGCGTCGCA CCGGACGCCT TCTGGGCCGG GCTCGCCGCG ATCGTGCGCG ATCTGACGCC GAAGACGGTC GCGCTGCTGC AGCATCGCGA CGGCCTGCAG GCCAAGATCG ATGCCTGGCA TCTCGCCAAC AAGGGCAAGA AGCAGGACAT GGCGGCCTAC ACCGCCTTCC TGAAAGAGAT CGGCTATCTG CTGCCGGAGC CGCCGACGGT CGCGGTCGAG ACCGCCAATG TCGACGACGA GATCGGCAAG CTGTGCGGCC CGCAGCTCGT GGTGCCGCTG TCGAATGCGC GCTACGCACT CAACGCGGCG AACGCGCGTT GGGGCTCGCT GTATGACGCG TTCTATGGCA CCGACGCGAT CCCGCAGGAG GCCAGCCAGG CCAAGGGCTA CGACAAGGCG CGCGGCGACA AGGTGATCGC CAAGGCCAAG GCGTTCCTCG ACCAGGCCGC GCCGCTGGTG GCCGGCAGCC ACAGCGACGT CACCGCCTAC AGCGTGATCG CCGGCCAGCT TTCGGCAAAA CTCAAGAGCG GCAACGCCAC CGGCCTGAAG ACCCCCCGGC AGTTCGCCGG CTATCTGGGC GACGCCGCGT CGCCGAGCGC GGTGCTGCTG GTCAATAACG GCCTGCATAT CGAGATCAAG ATCGACCGCG CCAATACCAT CGGCAAGGAC GATGCGGCCG GCGTCGCCGA CCTCGTGATC GAATCCGCGG TGTCGACCAT CCTCGACATG GAGGACTCGG TCGCAGCCGT CGACGCCGAA GACAAGGTGC TGATCTACCG CAACACGCTC GGCCTGATGG ACGGCACGCT GTCGGCCGAT TTCGACAAGG GTGGCAAGAC CGTCACCCGC GCGCTGAACG GCGACCGCAC TTATACCGGC CCGGACGGCA AGGACGTCAC CCTGCACGGG CGCAGCCTGC TGCTGATGCG CAATGTCGGC CATCACATGT GGACCGATGC GGTGCTCGAT TCGAACGGCG ACGAGATCCC CGAGGGCTTC CTCGACGCCG CGGTGTCCGG CCTACTGGCG ATCCACGACC TCAAGGCACT CGGCAAGACC CGCAACAGCC GCACCGGCTC GGTCTACATC GTCAAGCCGA AGATGCACGG CCCGGACGAA GTGGCGCTGA CTTGCGAGCT GTTCGGCCGC GTCGAGCAGA TGCTCGGCCT GAAGGAGAAC ACGCTGAAGG TCGGCATCAT GGACGAGGAG CGCCGCACCA CGGTGAACCT CAAGGCCTGC ATCCAGAACG CGTCCAAGCG CATCGTGTTC ATCAACACGG GTTTCCTGGA CCGCACCGGC GACGAGATCC ACACCTCGAT GGAAGCGGGT CCGATGATCC GCAAGAACGA GATGAAGGCG CAGCCCTGGA TCAAGGCGTA CGAGGACTGG AACGTCGACA CCGGCCTGAT CGACGGCCTG CCCGGCCATG CCCAGATTGG TAAGGGCATG TGGGCGGCCC CCGACAAGAT GGCCGACATG CTGACGCAGA AGATCGGCCA CCCGCAGGCC GGCGCCACCA CCGCCTGGGT GCCGTCGCCG ACCGCCGCGA CGCTGCACGC GCTGCACTAT CACCAGGTCG ACGTGCTGGC GCGTCAGCAG GAGCTGAAGA CGGGCGGCCC GCGCGCCAAG CTCGAGGACA TCCTCACCAT CCCGGTGTCG CAATCGAATT GGGCGCCGGA CGACGTCCGC CAGGAGATCG ACAACAACTG CCAGGGTATC CTCGGCTACG TCGTGCGCTG GATCGATCAG GGCGTCGGCT GCTCCAAGGT GCCGGACATC CACGACGTCG GCCTGATGGA AGACCGCGCC ACGCTGCGCA TCTCCAGCCA GCATCTGGCG AACTGGCTGC ATCACGGCGT CGTCACCAAG GAGCAGGTGC TGGAATCGCT GAAGCGGATG GCGGCGGTGG TCGACAAGCA GAACGCTGGT GATCCGCTGT ACCGGCCGAT GGCGCCGGAC TTCGACGGCG TCGCCTTCGA GGCGGCGTGC GACCTGATCT TCAAGGGCCG CGAGCAGCCG AACGGCTACA CCGAATTCAT CCTGCATATC CGCCGCCGCG AAGCCAAGGC GGCGCATCTG CAGGACCTGA CCTGA
|
Protein sequence | MNRIDAHGLK IAPVLFDFIA KEAAPKTGVA PDAFWAGLAA IVRDLTPKTV ALLQHRDGLQ AKIDAWHLAN KGKKQDMAAY TAFLKEIGYL LPEPPTVAVE TANVDDEIGK LCGPQLVVPL SNARYALNAA NARWGSLYDA FYGTDAIPQE ASQAKGYDKA RGDKVIAKAK AFLDQAAPLV AGSHSDVTAY SVIAGQLSAK LKSGNATGLK TPRQFAGYLG DAASPSAVLL VNNGLHIEIK IDRANTIGKD DAAGVADLVI ESAVSTILDM EDSVAAVDAE DKVLIYRNTL GLMDGTLSAD FDKGGKTVTR ALNGDRTYTG PDGKDVTLHG RSLLLMRNVG HHMWTDAVLD SNGDEIPEGF LDAAVSGLLA IHDLKALGKT RNSRTGSVYI VKPKMHGPDE VALTCELFGR VEQMLGLKEN TLKVGIMDEE RRTTVNLKAC IQNASKRIVF INTGFLDRTG DEIHTSMEAG PMIRKNEMKA QPWIKAYEDW NVDTGLIDGL PGHAQIGKGM WAAPDKMADM LTQKIGHPQA GATTAWVPSP TAATLHALHY HQVDVLARQQ ELKTGGPRAK LEDILTIPVS QSNWAPDDVR QEIDNNCQGI LGYVVRWIDQ GVGCSKVPDI HDVGLMEDRA TLRISSQHLA NWLHHGVVTK EQVLESLKRM AAVVDKQNAG DPLYRPMAPD FDGVAFEAAC DLIFKGREQP NGYTEFILHI RRREAKAAHL QDLT
|
| |