Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1812 |
Symbol | |
ID | 4711017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1984912 |
End bp | 1986312 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639856282 |
Product | isopropylmalate isomerase large subunit |
Protein accession | YP_001003378 |
Protein GI | 121998591 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.364535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGGA AGACCCTCTA CGACAAACTC TGGGATAGCC ACGTTGTCAC CGAGTACGAC GATGGATCGG CGCTGCTTTA CATCGATCGT CAGCTCCTGC ATGAGGTGAC CTCGCCGCAG GCGTTCGAGG GTCTGCGCCT GGCCGGTCGC CAGCCGTGGC GCGTGGCGTC CAATCTTGCC GTGACCGATC ACAACGTCCC CACCACCGAC CGCAGTCAGC CGGTGGAGGA TCCGGTCTCG CGGGTGCAGA TCGAGACGCT GGATCGCAAC TGCAAGGACT TCCAGGTGAT CGAGTTTGGT ATCCGCGACC CGCGCCAAGG GATCGTCCAC GTCGTCGGGC CTGAGCAGGG CACGACACTG CCGGGGATGA CCCTGGTCTG TGGCGACTCG CACACCTCGA CCCACGGCGC CCTGGGCGCA CTGGCCTTCG GGGTGGGCAC CAGTGAGGTG GAGCATGTCC TGGCCACGCA GACCCTGGTG CAGAAGAAGG CCCGCACCAT GCTCATCCGC ATCGATGGTC AGCTCGGGCG GGGGGTCACG GCCAAGGACA TCATCCTGGC GATCATCGGT CGCATCGGTA CTGCGGGCGG CACGGGGTAC GCACTGGAGT ACGGCGGCGA GGCCATACGC AGCCTCTCCA TGGAAGGGCG GATGACCATC TGCAACATGT CCATCGAGGC CGGGGCGCGT ACCGGTATGG TGGCTGTGGA TGACACCACC ATCGAGTATG TCCGGGGCCG TCCGAATGCC CCCGAGGGCG CGCTGTGGGA CCAGGCCGTG GCCAGTTGGC GCCACCTGGT CTCGGATGAG GATGCGGCCT TCGACCGGGT GGTGGAACTC CACGCCGACG AGATCGAGCC CCAGGTGACC TGGGGGACGT CGCCGGAGAT GGTCGCCTCG GTGAATCGCC GCGTCCCCGA CCCGGCGGAG GAGAGCGATG CGGTGCGCGC CCGGGCGATG GGCCGGGCCC TGGAGTACAT GGGCCTGGAG CCGGGGACGC CACTGACCGA TATCCCCATG GACAAGATCT TCATCGGGTC TTGCACCAAT GCCCGCATCG AGGACCTGCG CGAGGCGGCC GCCGTCGTCC ATGGCCGTCG GGTGGCTGAG AATATCCGCC AGGCGCTGGT GGTCCCCGGC TCCGGGGTGG TCAAGCAGCA GGCCGAAGGC GAGGGGCTCG ACCGGGTCTT TCTCGATGCC GGCTTCGAGT GGCGCGAACC GGGGTGCTCC ATGTGCCTGG GCATGAACCC CGACCGCCTG GAGCCCGGCG AGCGCTGTGC CTCGACCTCC AACCGGAACT TCGAGGGGCG CCAGGGCCAG GGCGGCCGCA CCCATCTGGC CAGCCCGGCG ATGGTGGCCG CTGCCGCCAT CCATGGTCAC TTCGTGGATA TCCGGGAGTA G
|
Protein sequence | MTGKTLYDKL WDSHVVTEYD DGSALLYIDR QLLHEVTSPQ AFEGLRLAGR QPWRVASNLA VTDHNVPTTD RSQPVEDPVS RVQIETLDRN CKDFQVIEFG IRDPRQGIVH VVGPEQGTTL PGMTLVCGDS HTSTHGALGA LAFGVGTSEV EHVLATQTLV QKKARTMLIR IDGQLGRGVT AKDIILAIIG RIGTAGGTGY ALEYGGEAIR SLSMEGRMTI CNMSIEAGAR TGMVAVDDTT IEYVRGRPNA PEGALWDQAV ASWRHLVSDE DAAFDRVVEL HADEIEPQVT WGTSPEMVAS VNRRVPDPAE ESDAVRARAM GRALEYMGLE PGTPLTDIPM DKIFIGSCTN ARIEDLREAA AVVHGRRVAE NIRQALVVPG SGVVKQQAEG EGLDRVFLDA GFEWREPGCS MCLGMNPDRL EPGERCASTS NRNFEGRQGQ GGRTHLASPA MVAAAAIHGH FVDIRE
|
| |