Gene Hhal_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2247 
Symbol 
ID4709498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2466855 
End bp2469038 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content67% 
IMG OID639856723 
Productmalate synthase G 
Protein accessionYP_001003813 
Protein GI121999026 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.109783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC GGGTTCAAGT CGGTGGCCTG CAGGTGGCGC GTGAACTCCA CGACCTGGTG 
GCCAACGAGA TCGTGCCGGG GACCGGCATC GACGCCGACA CCGTGTGGGA CGAGCTCGGC
GGCATCGTCC GCGATCTGGC TCCGCGCAAC CGCGAGTTGC TGGAACAGCG CGAGAACTTG
CAGCGGCAGA TCGACGACTG GCATCGGAAC CATCGTGGCC AGTTCTCGGT ATCCGATCAC
AAGGCGTTCC TTGAGCAGAT CGGCTACCTG GAACCCGAAG TGGACGCGTT CGAGATCACC
ACCACCGGGG TCGACCCGGA GATCGCCACC GTGGCCGGGC CGCAGCTGGT GGTGCCGGTC
GATAACGCGC GATTCGCGCT CAACGCGGCC AACGCGCGCT GGGGCAGCCT GTTCGATGCC
CTCTACGGCA CCGACATGAT CCCCGAGAGC GACGGGCTAG CCAAGGGCAA GACGTACAAT
CCCGCTCGCG GTAAAAAGGT CATGGAGTTG GCCGCCGAGA CCCTGGACGA GGTGGCCCCG
CTGGCCCATG GGCTCCACGC CGAGGTTACC GCCTACCGGC TGAGCGACGG CAACCCGCGC
CAGCTGGTCA TCACCCTGGC CGACGGCAGT GAGACCGAAC TGGCCGATCC GACCCGCTTC
GTCGGCTTCA CCGGCGAGCC GGATCGCCCC GCCACCCTCC TGCTGCGCCA CAACGGCCTC
CACGCCGAGA TCGTCATCGA CCCGAACGAC CCGATCGGTC AGGATCACCC GGCCGGCGTC
AAGGACGTGG TCATGGAGTC GGCGCTGACC GCCATCCAGG ACTGCGAGGA CTCCGTGGCC
GCGGTGGACG CCGAGGACAA GGTGCGCGTC TACCGCAACT GGTTGGGCCT GATGAAAGGC
GACCTAGAGA CGTCGGTGAG CAAGGGCGGC GAGACCTTTA CGCGGCGGCT CAACCCGGAT
CGCACCTACA CGGCCCCGGA CGGCGGCTCG CTGACCCTGC CGGGCCGCTC GCTCATGCTG
GTGCGCAATG TCGGTCACCT GATGACCACA CCGGCGGTCC TCGACGGCGA CGGCAACGAG
ATCCCCGAGG GCATGCTCGA CGCCATGATG ACCGTCCTCT GCGCGGTCCA CGACCTCAAG
GGGCTCGGAC AGGTATGCAA CTCGAAGACC GGCAGCGTCT ACATCGTCAA GCCGAAGATG
CACGGTCCCG AGGAGGTGGC GCTGACCGTG AACCTGTTCG AGCGCGTCGA GGACGCCCTG
GGTCTGGCGC GTGCCACCCT CAAGGTGGGC ATCATGGATG AGGAGCGGCG CACTACGGTC
AACCTGCGTG CCTGCATCCA GCAGGCCCGG GATCGGGTGA TCTTCATCAA CACCGGCTTC
CTCGACCGCA CTGGCGACGA GATCCACACC GCCATGGAGG CCGGCGCGGT GATCCGCAAG
GCGGACATGA AGGGGGCCGC CTTCATGACC ACCTACGAGG ACTGGAACGT CGATGTCGGC
CTGGCTTCCG GCTTCAAGGG CAAGGCCCAG ATCGGCAAGG GCATGTGGCC GAAGCCGGAC
AAGATGCGCG AGATGTTCGA CACCAAGGCT GGCCACCCCA AGGCAGGCGC GAACTGTGCC
TGGGTGCCGT CGCCGACGGC GGCGACCCTG CACGCTGTGC ACTACCACCA GGTGGACGTG
GCTGGCGTCC AGGCCGAGAT CGCCCGGGAG GGGTGGCGTT CCGACCTGAG CCGGATCCTC
ACCGTGCCGC TGGCGCCGAG CACCGACTGG AGCGCCGAGG AGATCCAGCA GGAGGTGGAC
AACAACTGCC AGGGCATCCT CGGCTATGTG GTGCGCTGGA TCGACCAGGG CATTGGCTGC
TCCAAGGTCC CGGACGTCAA TAACGTGGGG CTGATGGAGG ATCGCGCCAC GCTGCGCATC
TCCAGCCAGC ACGTGGCCAA CTGGCTCTAC CACGGCGTGG TGACCGAGGA GCAGGTCATG
GACAGCCTCA AGCGCATGGC CCAGGTGGTC GACGAGCAGA ACGCCGGCGA CCCGAACTAC
CGCCCCATGG CCGAAGACTT CGACGGCAGC GTCGCCTTCC AGGCGGCCTG TGATCTGGTC
TTCAAGGGGC GGGAGCAGCC CTCCGGTTAC ACCGAGCCGG TGCTCCATCG CCGTCGGCAG
GAGGCGAAGG CGAAGTACGC CTGA
 
Protein sequence
MSERVQVGGL QVARELHDLV ANEIVPGTGI DADTVWDELG GIVRDLAPRN RELLEQRENL 
QRQIDDWHRN HRGQFSVSDH KAFLEQIGYL EPEVDAFEIT TTGVDPEIAT VAGPQLVVPV
DNARFALNAA NARWGSLFDA LYGTDMIPES DGLAKGKTYN PARGKKVMEL AAETLDEVAP
LAHGLHAEVT AYRLSDGNPR QLVITLADGS ETELADPTRF VGFTGEPDRP ATLLLRHNGL
HAEIVIDPND PIGQDHPAGV KDVVMESALT AIQDCEDSVA AVDAEDKVRV YRNWLGLMKG
DLETSVSKGG ETFTRRLNPD RTYTAPDGGS LTLPGRSLML VRNVGHLMTT PAVLDGDGNE
IPEGMLDAMM TVLCAVHDLK GLGQVCNSKT GSVYIVKPKM HGPEEVALTV NLFERVEDAL
GLARATLKVG IMDEERRTTV NLRACIQQAR DRVIFINTGF LDRTGDEIHT AMEAGAVIRK
ADMKGAAFMT TYEDWNVDVG LASGFKGKAQ IGKGMWPKPD KMREMFDTKA GHPKAGANCA
WVPSPTAATL HAVHYHQVDV AGVQAEIARE GWRSDLSRIL TVPLAPSTDW SAEEIQQEVD
NNCQGILGYV VRWIDQGIGC SKVPDVNNVG LMEDRATLRI SSQHVANWLY HGVVTEEQVM
DSLKRMAQVV DEQNAGDPNY RPMAEDFDGS VAFQAACDLV FKGREQPSGY TEPVLHRRRQ
EAKAKYA