Gene Hhal_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1029 
Symbol 
ID4709643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1104071 
End bp1106365 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content67% 
IMG OID639855500 
Productmalic enzyme 
Protein accessionYP_001002607 
Protein GI121997820 
COG category[C] Energy production and conversion 
COG ID[COG0280] Phosphotransacetylase
[COG0281] Malic enzyme 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACG ACTTCAAAGA AGCCGCGCTC GAATACCATC GGCAGCCGAC CCCCGGGAAG 
ATCGAGGTCA TCCCCAGCAA GCCGCTGGCC AACCAGCGTG ACCTCGCCCT GGCCTACTCC
CCCGGGGTTG CCTCGGCCTG CGAAGCCATC GTCAACGACC CGCGCGAGGC GGCCGCCATG
ACCGCCCGGG GCAACCTGGT GGCGGTCATC ACCAACGGTT CCGCCGTGCT CGGCCTGGGT
AACATCGGTC CACTGGCGTC GAAACCGGTG ATGGAGGGCA AGGGTGTCCT GTTCAAGAAG
TTCGCCGGAA TCGATGTCTT CGACATCGAG GTCGACGAGC CGGACGCCGA GCGCTTCGTC
GACATCGTCT CCGCTCTGGA GCCGACCTTC GGCGGCATCA ATCTGGAGGA CATCAAGGCC
CCGGAGTGCT TCGAGATCGA GGAGGCCCTC AAGCGCCGGA TGAACATCCC GGTGTTCCAC
GATGATCAGC ACGGCACCGC CATCATCACT GCTGCGGCTA TCCGCAATGG CCTGCGCGTG
GTGGGCAAGC GGCTCGAGGA CGTCACCCTG GTCTGCTCCG GCGCCGGTGC CGCGGCCATC
GCCTGCCTCG ACCTGCTGGT GGCCATGGGG CTGCCCAAGG AGCAGATCAC CGTCACCGAT
CGCAAGGGCG TGGTCTACAA GGGGCGCAAG GAGTACATGG ACCCGCGTAA GGAAGGCTAT
GCCCGGGAGA CGGCCTCCCG CAGTCTGCGC GAGGTGATCG AGGGCGCGGA CATCTTCCTC
GGGCTGTCGG CACCGGGGGT GCTGAATGCC GAGATGGTCC ACCTGATGGC CGATCAGCCG
ATCATCATGG CGTTGGCCAA TCCGGTTCCC GAGATCCTCC CCGAGGAGGT GCGCGAGGCC
CGGCCCGATG CGGTGATCGC CACCGGCCGC TCGGATTACC CCAACCAGGT GAACAACGTC
CTCTGCTTCC CGTTCATCTT CCGCGGGGCG CTGGATGTGG GCGCCAGCGC CATCAACGAG
GCGATGAAGA TCGCCGCGGT GGAGGCCATC GCCGACCTGG CTACCGAGGA GTCGTCCGAG
GAGGTGGTGG CAGCCTACGG CGGCCGGCCC TGGAGCTTCG GCCCCGAGTA CCTGATTCCC
AAGCCCTTCG ACCCCCGACT GATCAGCCGC GTGGCGCCGG CGGTGGCCCA AGCCGCCATG
GAGAGCGGGG TGGCTTGCCG GCCGATCGCG GACTTCGAGG CCTACCGCCT GCAGCTGCAG
CAGTACGTGT TCCAGTCCGG GCTGGTCATG AAGCCGATTT TCGAGCGCGC CCGGGAGCAG
CCCAAGCGGG TGGTCTACAC CGACGGCGAA GAGGAGCGGA TCCTCCGTGC CGTGCAGCTG
GTGGTCGACG AACAGCTTGC GCGGCCCATC GTCGTCGGGC GCCGGCGCGT GGTGGAGAAG
CGGCTGCGCC AGCTCGGTTT GCGGGTACAG ATCGACGAGG ATTTCGAGCT GGTCGACCCC
GAGGGCGACC CCCGCTACCG CGACTATTGG CAGGCGTACT ACCAGCTGAT GGCCCGGCGC
GGCGTGACGC CGGCCCGGGC GCGGACGGTG GTGCGTACCC GCAACACGGT CATCGGCGCA
CTGATGGTCC ATCTAGGCGA CGCCGACGCC CTGGTCGGCG GTATCGAGGG GCGCTATCAG
CGGCAGATGC AGCACGTTCA GGACGTTATC GGCCGTCGCC GCGGGGTGCG CAACCTGGCG
GCGATGAACG TGCTGATCAT GCCCAAGGGG ACCTTCTTCC TGGCGGACAC CTACGTCAAC
CAGGATCCGA ACCCCCACGA GATCGCCGAG ATGACCTTGT TGGCTGCGGA CGAGGTGCGC
CGTTTCGGAA CGGTGCCGAA GGTGGCGCTG CTCTCGCACT CGAACTTCGG TACCTCGTCG
CAGCCCTCGG CGGAGAAGAT GCGCCACGCC CTGGAGCTGA TCCAGGACCG GGACCCGGCC
CTCGAGGTGG AGGGCGAGAT GCACGGCGAC GCAGCGATCT CCGAGGAGGT GCGACGGCGC
ATCTTCCCCG ATGCGCAGCT CGAGGGGGAG GCGAACCTGC TGATCATGCC GGGGCTCGAC
GCCGCCAACA TCTCGTTCAA CCTGCTCAAG GCGACCACCG ACAGCGTCTC TGTCGGGCCG
ATCCTGCTCG GGACCGCGAA ACCGGCCCAC CTGCTCACCC CGTCGTCCAC CGTTCGGGCC
ATCGTCAACC TCACCGCCCT GTCCGTGGTC GAGGCGCAGA TGACCGATGC GGTGGAGCCG
GAGAGGCATC CGTAG
 
Protein sequence
MSDDFKEAAL EYHRQPTPGK IEVIPSKPLA NQRDLALAYS PGVASACEAI VNDPREAAAM 
TARGNLVAVI TNGSAVLGLG NIGPLASKPV MEGKGVLFKK FAGIDVFDIE VDEPDAERFV
DIVSALEPTF GGINLEDIKA PECFEIEEAL KRRMNIPVFH DDQHGTAIIT AAAIRNGLRV
VGKRLEDVTL VCSGAGAAAI ACLDLLVAMG LPKEQITVTD RKGVVYKGRK EYMDPRKEGY
ARETASRSLR EVIEGADIFL GLSAPGVLNA EMVHLMADQP IIMALANPVP EILPEEVREA
RPDAVIATGR SDYPNQVNNV LCFPFIFRGA LDVGASAINE AMKIAAVEAI ADLATEESSE
EVVAAYGGRP WSFGPEYLIP KPFDPRLISR VAPAVAQAAM ESGVACRPIA DFEAYRLQLQ
QYVFQSGLVM KPIFERAREQ PKRVVYTDGE EERILRAVQL VVDEQLARPI VVGRRRVVEK
RLRQLGLRVQ IDEDFELVDP EGDPRYRDYW QAYYQLMARR GVTPARARTV VRTRNTVIGA
LMVHLGDADA LVGGIEGRYQ RQMQHVQDVI GRRRGVRNLA AMNVLIMPKG TFFLADTYVN
QDPNPHEIAE MTLLAADEVR RFGTVPKVAL LSHSNFGTSS QPSAEKMRHA LELIQDRDPA
LEVEGEMHGD AAISEEVRRR IFPDAQLEGE ANLLIMPGLD AANISFNLLK ATTDSVSVGP
ILLGTAKPAH LLTPSSTVRA IVNLTALSVV EAQMTDAVEP ERHP