Gene Hhal_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1594 
Symbol 
ID4709628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1732693 
End bp1734579 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content66% 
IMG OID639856059 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_001003160 
Protein GI121998373 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.662359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACCG CAATCGTTAA CCCCCAGGCA GAGACTGAAG CAACCAGCCA CCCGCTTGCC 
GGCAAGACCA TGAGTGGCGG GGAGATTATT GTCGAGGTCC TGGCCCAGGA GGGGGTCGAT
ACGCTCTTTG GCTATAGCGG CGGTGCCATC CTTCCTACCT ACGACGCCAT CTTCAAGTAC
AACGAGCGTC ACCGGAAAGA GCAGGCCGTC GAGGAAGAGG ATCCGATCCG GCTGATCGTG
CCGGCCAACG AGCAGGGCGC CGGATTCATG GCAGCCGGCT ACGCCCGGGC CACCGGCAAG
GTCGGCTGCT TCCTGGTGAC CTCCGGGCCG GGGGCGACCA ACACGGTCAC CCCCATCCGC
GACTGCATGG CCGATTCGGT GCCGGTGGTG GCCATCACGG GGCAGGTGCC GACGCACGCC
ATGGGCACGG ATGCGTTCCA AGAGGCGCCC ATCGTCAACA TCATGGGGAG CTGCGCCAAG
CACGTCTTCC TGGTGACCGA TCCGGAGCAG CTTGAGGCGA CCGTGCGGAC GGCCTTCGAG
GTGGCTCGTT CCGGGCGTCC GGGCCCCGTG GTGGTCGACG TGCCCAAGAA CATGCAGAAC
TGGATCGGGG AGTTCCGCGG CGAGGGGGTG CTGGAGGTGC GCGGCTATCG CCAGCGCATG
GAATCCCTCC GCTCCGCCAA GCTCTCCGAT CGCAAATGCC GGGCGTTCAT GGAGATGCTC
GAGCGCTCCC GCCGGCCTCT GCTCTACGTC GGCGGCGGCG TGGTCAGTGG CGAGGCCCAC
CAGGAGTTGC GCGACTTCGC CCACACCTTC GGGATCCCCG TGGTCAGTAC CCTGATGGGC
CTGGGGGCGG TGGATACGAC CGATGACCTT TACCTGGGCA TGCTCGGTAT GCACGGCACG
GCCTACGCCA ACTACGCCGT GGAGGACTGT GATTTCCTCA TCGCGGTCGG TGCCCGTTTC
GATGACCGGG TCGCCGGTAA GGTGGAGGAG TTCGCGCCGC TGGCCGAGCA GATCGCCCAC
ATCGACATCG ATGCTGCTGA GATCGGCAAG GTCAAGGGCG TCGACTGGGC GCATGTCGGC
GAGGCCGGGC GCAGCCTGCG CCAGCTGTTG CGCTACGGCG AATCCATGGG CTTCGAGCCC
CGCTTCGATG CCTGGCTTGA GCACGTGCGC GGGCTGCGTG AGCGCCACCC CATGGATTAC
GACCGCGACA GCGCGCTGAT CCAGCCCCAC TACGTGCTCG AAAAACTCAA CGAACTGACT
GCCGGCGAGG CGATCATTGC CACCGGCGTC GGGCAGCACC AGATGTGGGC GGCACAGTAC
TGCGACTTCC GCGGTCCGCG GCAGTGGCTC ACCTCCGGCG GACTCGGCAC CATGGGCTTC
GGGCTGCCAG CGGCCATCGG CGCCTACCTG GGGCGTCCGG ACCGGGTGGT CATCGACGTC
GATGGCGACG GTTCGCTGCG AATGAACCTC GGTGAGCTAG AGACAGCCAC AACCTACAAC
CTGCCGGTCA AGATCCTGTC GCTGAACAAC GTCGGCGACG GCATGGTCCG GCAGTGGCAG
AAGCTCTACT TCGGGGATCG CTTCTCCGGG TCGGACAAGT CACTGCACCG CAAGGACTTC
ATCAAGGCGG CCGAGGCCGA TGGCTACGAA TTTGCCCGCC GCGTCGCGGA CAAGGAGGAA
CTCGAGGAGG CCCTGCGCGC GTTTGTCGAG TTCCCTGGGC CCGCCTTCCT GGAGGTTATG
ATCGATCCGG ATGCGGGGGT CTTCCCGATG GTCGGTCCGG GCAGCAGCTA CAAGGAGATG
GTCACCGGCG ATCACATCCC GAGCCGGGAT ATGGCCCTGC GCCCGCGTAC CAAGCTCGAG
GACGGCGAGT CGCCGGATCT GTTCTGA
 
Protein sequence
MDTAIVNPQA ETEATSHPLA GKTMSGGEII VEVLAQEGVD TLFGYSGGAI LPTYDAIFKY 
NERHRKEQAV EEEDPIRLIV PANEQGAGFM AAGYARATGK VGCFLVTSGP GATNTVTPIR
DCMADSVPVV AITGQVPTHA MGTDAFQEAP IVNIMGSCAK HVFLVTDPEQ LEATVRTAFE
VARSGRPGPV VVDVPKNMQN WIGEFRGEGV LEVRGYRQRM ESLRSAKLSD RKCRAFMEML
ERSRRPLLYV GGGVVSGEAH QELRDFAHTF GIPVVSTLMG LGAVDTTDDL YLGMLGMHGT
AYANYAVEDC DFLIAVGARF DDRVAGKVEE FAPLAEQIAH IDIDAAEIGK VKGVDWAHVG
EAGRSLRQLL RYGESMGFEP RFDAWLEHVR GLRERHPMDY DRDSALIQPH YVLEKLNELT
AGEAIIATGV GQHQMWAAQY CDFRGPRQWL TSGGLGTMGF GLPAAIGAYL GRPDRVVIDV
DGDGSLRMNL GELETATTYN LPVKILSLNN VGDGMVRQWQ KLYFGDRFSG SDKSLHRKDF
IKAAEADGYE FARRVADKEE LEEALRAFVE FPGPAFLEVM IDPDAGVFPM VGPGSSYKEM
VTGDHIPSRD MALRPRTKLE DGESPDLF