Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1810 |
Symbol | |
ID | 4711007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1983109 |
End bp | 1984179 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856280 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_001003376 |
Protein GI | 121998589 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR00169] 3-isopropylmalate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.704264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAGA AGATCCTGCT GCTCCCCGGG GACGGTATCG GCCCGGAGAT CACCGCGGAG GCCCGGCGTG TGCTCGAGGC GCTGAATAAG CGCTACGGGG TCGGCTGCGA GATGGAGACG GCGCCCATTG GCGGCGCCGG TTACGACGCC GCCGGCCAGC CCCTGCCCGA CGAGACCCTG CGCCTGGCGC GGGAGGCGGA TGCGGTGCTG CTGGGGGCCG TCGGCGGGCC GCAGTACGAT GCGCTCCCAC GGGACGTCCG ACCGGAGCGG GGGCTTCTGG CGATCCGCTC GGAGCTGGGT CTGTTCGGCA ATCTGCGCCC GGCCATCCTG TATCCGCAGC TGGCCGGCGC CTCGGCGCTG CGGGAAGATG TTGTCGCGGG GCTGGATATC CTCATCGTTC GCGAACTCAC CGGCGGGATC TACTTTGGCC AGCCCCGCGG GATCCGCACC CTGGACAGTG GCGAGCGTCA GGGCTTCAAC ACCGAGGTCT ATAGCGAGTC GGAGATCGAG CGCATTGCCC GTCTCGCCTT CGCCGCCGCC GAGCAGCGCC AGGGACGGGT CTGCTCGGTG GACAAGGCCA ATGTCCTGGA AAGCTCGGAG CTATGGCGCG AAGTGGTCGA GCGGGTGGCG GCGGACTACC CGGGTGTCGA GCTCAGCCAC ATGTACGTGG ACAACGCCGC CATGCAGCTG GTGCGTGCGC CGAAGCAGTT CGATGTGGTG GTCACCGGGA ACCTGTTCGG GGACATCCTC TCGGATTGCG CCGCGCAGCT GACCGGCTCC ATCGGCATGC TCCCGTCCGC CTCCCTCGAT GAACACGGCA AGGGGCTCTA CGAGCCGGTC CACGGCTCGG CGCCGGATAT TGCCGGGCAG GACAAGGCGA ACCCGCTAGC CACCATCCTC TCGGTGGCCA TGATGCTGCG CTACAGCCTG GGCGCGGGTG AGGCCGCGGA CCGGGTCGAG GCCGCCGTGG GGGCGGTGCT CGAGGAGGGG TTGCGCACTC CGGACCTGCA GGGCGGCAAC CGGCCGGTGG GCACTCGTGA GATGGGTGAG GCAGTGGCGG GGCGGCTGTG A
|
Protein sequence | MAQKILLLPG DGIGPEITAE ARRVLEALNK RYGVGCEMET APIGGAGYDA AGQPLPDETL RLAREADAVL LGAVGGPQYD ALPRDVRPER GLLAIRSELG LFGNLRPAIL YPQLAGASAL REDVVAGLDI LIVRELTGGI YFGQPRGIRT LDSGERQGFN TEVYSESEIE RIARLAFAAA EQRQGRVCSV DKANVLESSE LWREVVERVA ADYPGVELSH MYVDNAAMQL VRAPKQFDVV VTGNLFGDIL SDCAAQLTGS IGMLPSASLD EHGKGLYEPV HGSAPDIAGQ DKANPLATIL SVAMMLRYSL GAGEAADRVE AAVGAVLEEG LRTPDLQGGN RPVGTREMGE AVAGRL
|
| |