Gene Hhal_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1812 
Symbol 
ID4711017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1984912 
End bp1986312 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content68% 
IMG OID639856282 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001003378 
Protein GI121998591 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.364535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGGA AGACCCTCTA CGACAAACTC TGGGATAGCC ACGTTGTCAC CGAGTACGAC 
GATGGATCGG CGCTGCTTTA CATCGATCGT CAGCTCCTGC ATGAGGTGAC CTCGCCGCAG
GCGTTCGAGG GTCTGCGCCT GGCCGGTCGC CAGCCGTGGC GCGTGGCGTC CAATCTTGCC
GTGACCGATC ACAACGTCCC CACCACCGAC CGCAGTCAGC CGGTGGAGGA TCCGGTCTCG
CGGGTGCAGA TCGAGACGCT GGATCGCAAC TGCAAGGACT TCCAGGTGAT CGAGTTTGGT
ATCCGCGACC CGCGCCAAGG GATCGTCCAC GTCGTCGGGC CTGAGCAGGG CACGACACTG
CCGGGGATGA CCCTGGTCTG TGGCGACTCG CACACCTCGA CCCACGGCGC CCTGGGCGCA
CTGGCCTTCG GGGTGGGCAC CAGTGAGGTG GAGCATGTCC TGGCCACGCA GACCCTGGTG
CAGAAGAAGG CCCGCACCAT GCTCATCCGC ATCGATGGTC AGCTCGGGCG GGGGGTCACG
GCCAAGGACA TCATCCTGGC GATCATCGGT CGCATCGGTA CTGCGGGCGG CACGGGGTAC
GCACTGGAGT ACGGCGGCGA GGCCATACGC AGCCTCTCCA TGGAAGGGCG GATGACCATC
TGCAACATGT CCATCGAGGC CGGGGCGCGT ACCGGTATGG TGGCTGTGGA TGACACCACC
ATCGAGTATG TCCGGGGCCG TCCGAATGCC CCCGAGGGCG CGCTGTGGGA CCAGGCCGTG
GCCAGTTGGC GCCACCTGGT CTCGGATGAG GATGCGGCCT TCGACCGGGT GGTGGAACTC
CACGCCGACG AGATCGAGCC CCAGGTGACC TGGGGGACGT CGCCGGAGAT GGTCGCCTCG
GTGAATCGCC GCGTCCCCGA CCCGGCGGAG GAGAGCGATG CGGTGCGCGC CCGGGCGATG
GGCCGGGCCC TGGAGTACAT GGGCCTGGAG CCGGGGACGC CACTGACCGA TATCCCCATG
GACAAGATCT TCATCGGGTC TTGCACCAAT GCCCGCATCG AGGACCTGCG CGAGGCGGCC
GCCGTCGTCC ATGGCCGTCG GGTGGCTGAG AATATCCGCC AGGCGCTGGT GGTCCCCGGC
TCCGGGGTGG TCAAGCAGCA GGCCGAAGGC GAGGGGCTCG ACCGGGTCTT TCTCGATGCC
GGCTTCGAGT GGCGCGAACC GGGGTGCTCC ATGTGCCTGG GCATGAACCC CGACCGCCTG
GAGCCCGGCG AGCGCTGTGC CTCGACCTCC AACCGGAACT TCGAGGGGCG CCAGGGCCAG
GGCGGCCGCA CCCATCTGGC CAGCCCGGCG ATGGTGGCCG CTGCCGCCAT CCATGGTCAC
TTCGTGGATA TCCGGGAGTA G
 
Protein sequence
MTGKTLYDKL WDSHVVTEYD DGSALLYIDR QLLHEVTSPQ AFEGLRLAGR QPWRVASNLA 
VTDHNVPTTD RSQPVEDPVS RVQIETLDRN CKDFQVIEFG IRDPRQGIVH VVGPEQGTTL
PGMTLVCGDS HTSTHGALGA LAFGVGTSEV EHVLATQTLV QKKARTMLIR IDGQLGRGVT
AKDIILAIIG RIGTAGGTGY ALEYGGEAIR SLSMEGRMTI CNMSIEAGAR TGMVAVDDTT
IEYVRGRPNA PEGALWDQAV ASWRHLVSDE DAAFDRVVEL HADEIEPQVT WGTSPEMVAS
VNRRVPDPAE ESDAVRARAM GRALEYMGLE PGTPLTDIPM DKIFIGSCTN ARIEDLREAA
AVVHGRRVAE NIRQALVVPG SGVVKQQAEG EGLDRVFLDA GFEWREPGCS MCLGMNPDRL
EPGERCASTS NRNFEGRQGQ GGRTHLASPA MVAAAAIHGH FVDIRE