Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2247 |
Symbol | |
ID | 4709498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2466855 |
End bp | 2469038 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639856723 |
Product | malate synthase G |
Protein accession | YP_001003813 |
Protein GI | 121999026 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.109783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC GGGTTCAAGT CGGTGGCCTG CAGGTGGCGC GTGAACTCCA CGACCTGGTG GCCAACGAGA TCGTGCCGGG GACCGGCATC GACGCCGACA CCGTGTGGGA CGAGCTCGGC GGCATCGTCC GCGATCTGGC TCCGCGCAAC CGCGAGTTGC TGGAACAGCG CGAGAACTTG CAGCGGCAGA TCGACGACTG GCATCGGAAC CATCGTGGCC AGTTCTCGGT ATCCGATCAC AAGGCGTTCC TTGAGCAGAT CGGCTACCTG GAACCCGAAG TGGACGCGTT CGAGATCACC ACCACCGGGG TCGACCCGGA GATCGCCACC GTGGCCGGGC CGCAGCTGGT GGTGCCGGTC GATAACGCGC GATTCGCGCT CAACGCGGCC AACGCGCGCT GGGGCAGCCT GTTCGATGCC CTCTACGGCA CCGACATGAT CCCCGAGAGC GACGGGCTAG CCAAGGGCAA GACGTACAAT CCCGCTCGCG GTAAAAAGGT CATGGAGTTG GCCGCCGAGA CCCTGGACGA GGTGGCCCCG CTGGCCCATG GGCTCCACGC CGAGGTTACC GCCTACCGGC TGAGCGACGG CAACCCGCGC CAGCTGGTCA TCACCCTGGC CGACGGCAGT GAGACCGAAC TGGCCGATCC GACCCGCTTC GTCGGCTTCA CCGGCGAGCC GGATCGCCCC GCCACCCTCC TGCTGCGCCA CAACGGCCTC CACGCCGAGA TCGTCATCGA CCCGAACGAC CCGATCGGTC AGGATCACCC GGCCGGCGTC AAGGACGTGG TCATGGAGTC GGCGCTGACC GCCATCCAGG ACTGCGAGGA CTCCGTGGCC GCGGTGGACG CCGAGGACAA GGTGCGCGTC TACCGCAACT GGTTGGGCCT GATGAAAGGC GACCTAGAGA CGTCGGTGAG CAAGGGCGGC GAGACCTTTA CGCGGCGGCT CAACCCGGAT CGCACCTACA CGGCCCCGGA CGGCGGCTCG CTGACCCTGC CGGGCCGCTC GCTCATGCTG GTGCGCAATG TCGGTCACCT GATGACCACA CCGGCGGTCC TCGACGGCGA CGGCAACGAG ATCCCCGAGG GCATGCTCGA CGCCATGATG ACCGTCCTCT GCGCGGTCCA CGACCTCAAG GGGCTCGGAC AGGTATGCAA CTCGAAGACC GGCAGCGTCT ACATCGTCAA GCCGAAGATG CACGGTCCCG AGGAGGTGGC GCTGACCGTG AACCTGTTCG AGCGCGTCGA GGACGCCCTG GGTCTGGCGC GTGCCACCCT CAAGGTGGGC ATCATGGATG AGGAGCGGCG CACTACGGTC AACCTGCGTG CCTGCATCCA GCAGGCCCGG GATCGGGTGA TCTTCATCAA CACCGGCTTC CTCGACCGCA CTGGCGACGA GATCCACACC GCCATGGAGG CCGGCGCGGT GATCCGCAAG GCGGACATGA AGGGGGCCGC CTTCATGACC ACCTACGAGG ACTGGAACGT CGATGTCGGC CTGGCTTCCG GCTTCAAGGG CAAGGCCCAG ATCGGCAAGG GCATGTGGCC GAAGCCGGAC AAGATGCGCG AGATGTTCGA CACCAAGGCT GGCCACCCCA AGGCAGGCGC GAACTGTGCC TGGGTGCCGT CGCCGACGGC GGCGACCCTG CACGCTGTGC ACTACCACCA GGTGGACGTG GCTGGCGTCC AGGCCGAGAT CGCCCGGGAG GGGTGGCGTT CCGACCTGAG CCGGATCCTC ACCGTGCCGC TGGCGCCGAG CACCGACTGG AGCGCCGAGG AGATCCAGCA GGAGGTGGAC AACAACTGCC AGGGCATCCT CGGCTATGTG GTGCGCTGGA TCGACCAGGG CATTGGCTGC TCCAAGGTCC CGGACGTCAA TAACGTGGGG CTGATGGAGG ATCGCGCCAC GCTGCGCATC TCCAGCCAGC ACGTGGCCAA CTGGCTCTAC CACGGCGTGG TGACCGAGGA GCAGGTCATG GACAGCCTCA AGCGCATGGC CCAGGTGGTC GACGAGCAGA ACGCCGGCGA CCCGAACTAC CGCCCCATGG CCGAAGACTT CGACGGCAGC GTCGCCTTCC AGGCGGCCTG TGATCTGGTC TTCAAGGGGC GGGAGCAGCC CTCCGGTTAC ACCGAGCCGG TGCTCCATCG CCGTCGGCAG GAGGCGAAGG CGAAGTACGC CTGA
|
Protein sequence | MSERVQVGGL QVARELHDLV ANEIVPGTGI DADTVWDELG GIVRDLAPRN RELLEQRENL QRQIDDWHRN HRGQFSVSDH KAFLEQIGYL EPEVDAFEIT TTGVDPEIAT VAGPQLVVPV DNARFALNAA NARWGSLFDA LYGTDMIPES DGLAKGKTYN PARGKKVMEL AAETLDEVAP LAHGLHAEVT AYRLSDGNPR QLVITLADGS ETELADPTRF VGFTGEPDRP ATLLLRHNGL HAEIVIDPND PIGQDHPAGV KDVVMESALT AIQDCEDSVA AVDAEDKVRV YRNWLGLMKG DLETSVSKGG ETFTRRLNPD RTYTAPDGGS LTLPGRSLML VRNVGHLMTT PAVLDGDGNE IPEGMLDAMM TVLCAVHDLK GLGQVCNSKT GSVYIVKPKM HGPEEVALTV NLFERVEDAL GLARATLKVG IMDEERRTTV NLRACIQQAR DRVIFINTGF LDRTGDEIHT AMEAGAVIRK ADMKGAAFMT TYEDWNVDVG LASGFKGKAQ IGKGMWPKPD KMREMFDTKA GHPKAGANCA WVPSPTAATL HAVHYHQVDV AGVQAEIARE GWRSDLSRIL TVPLAPSTDW SAEEIQQEVD NNCQGILGYV VRWIDQGIGC SKVPDVNNVG LMEDRATLRI SSQHVANWLY HGVVTEEQVM DSLKRMAQVV DEQNAGDPNY RPMAEDFDGS VAFQAACDLV FKGREQPSGY TEPVLHRRRQ EAKAKYA
|
| |