Gene Hhal_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1810 
Symbol 
ID4711007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1983109 
End bp1984179 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID639856280 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001003376 
Protein GI121998589 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.704264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAGA AGATCCTGCT GCTCCCCGGG GACGGTATCG GCCCGGAGAT CACCGCGGAG 
GCCCGGCGTG TGCTCGAGGC GCTGAATAAG CGCTACGGGG TCGGCTGCGA GATGGAGACG
GCGCCCATTG GCGGCGCCGG TTACGACGCC GCCGGCCAGC CCCTGCCCGA CGAGACCCTG
CGCCTGGCGC GGGAGGCGGA TGCGGTGCTG CTGGGGGCCG TCGGCGGGCC GCAGTACGAT
GCGCTCCCAC GGGACGTCCG ACCGGAGCGG GGGCTTCTGG CGATCCGCTC GGAGCTGGGT
CTGTTCGGCA ATCTGCGCCC GGCCATCCTG TATCCGCAGC TGGCCGGCGC CTCGGCGCTG
CGGGAAGATG TTGTCGCGGG GCTGGATATC CTCATCGTTC GCGAACTCAC CGGCGGGATC
TACTTTGGCC AGCCCCGCGG GATCCGCACC CTGGACAGTG GCGAGCGTCA GGGCTTCAAC
ACCGAGGTCT ATAGCGAGTC GGAGATCGAG CGCATTGCCC GTCTCGCCTT CGCCGCCGCC
GAGCAGCGCC AGGGACGGGT CTGCTCGGTG GACAAGGCCA ATGTCCTGGA AAGCTCGGAG
CTATGGCGCG AAGTGGTCGA GCGGGTGGCG GCGGACTACC CGGGTGTCGA GCTCAGCCAC
ATGTACGTGG ACAACGCCGC CATGCAGCTG GTGCGTGCGC CGAAGCAGTT CGATGTGGTG
GTCACCGGGA ACCTGTTCGG GGACATCCTC TCGGATTGCG CCGCGCAGCT GACCGGCTCC
ATCGGCATGC TCCCGTCCGC CTCCCTCGAT GAACACGGCA AGGGGCTCTA CGAGCCGGTC
CACGGCTCGG CGCCGGATAT TGCCGGGCAG GACAAGGCGA ACCCGCTAGC CACCATCCTC
TCGGTGGCCA TGATGCTGCG CTACAGCCTG GGCGCGGGTG AGGCCGCGGA CCGGGTCGAG
GCCGCCGTGG GGGCGGTGCT CGAGGAGGGG TTGCGCACTC CGGACCTGCA GGGCGGCAAC
CGGCCGGTGG GCACTCGTGA GATGGGTGAG GCAGTGGCGG GGCGGCTGTG A
 
Protein sequence
MAQKILLLPG DGIGPEITAE ARRVLEALNK RYGVGCEMET APIGGAGYDA AGQPLPDETL 
RLAREADAVL LGAVGGPQYD ALPRDVRPER GLLAIRSELG LFGNLRPAIL YPQLAGASAL
REDVVAGLDI LIVRELTGGI YFGQPRGIRT LDSGERQGFN TEVYSESEIE RIARLAFAAA
EQRQGRVCSV DKANVLESSE LWREVVERVA ADYPGVELSH MYVDNAAMQL VRAPKQFDVV
VTGNLFGDIL SDCAAQLTGS IGMLPSASLD EHGKGLYEPV HGSAPDIAGQ DKANPLATIL
SVAMMLRYSL GAGEAADRVE AAVGAVLEEG LRTPDLQGGN RPVGTREMGE AVAGRL