Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3302 |
Symbol | |
ID | 5735172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4168134 |
End bp | 4169399 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641280449 |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_001546066 |
Protein GI | 159899819 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.879653 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCAGA CATTTGCCGA GCAAATTTTG GGGCATGCTT CCGGTCGCAG CGACGTGCAA GCAGGCGATA TGGTGGTAGT CAACGTCGAT TTGGTGATGA TGCACGATAG CCTCTCGCCT AGCATTATCG AAACCTTGCA CAACGAATTA GGCGCTGAAC GCGTGTGGGA TCGCGACAAA GTTGCAGTGG TAATCGACCA CGTTGCCCCA GCCGCCACCG TGCGCCAAGC CGAGCAGCAA CAACAAGTGC GGCGTTGGGT CGCCCAACAA GGCATCAGCC ATTTGTTTGA TGTTGGTCGC GGCATCTCGC ACCCCGTGTT GATCGAAGAA GGGTTAGTGC AACCGGGCAT GCTGGTGGTT GGCAGCGATT CGCATAGCAC TGGCTATGGT GCAGCAGCGG CCTTTGGCTC AGGTATGGGC ACGACCGATA TCGCCTTAGC ACTGGCGACT GGCCAAACTT GGTTTCGCGT CCCAGAAACT GTGCGGGTCA ATGCGGTTGG CAATTTTCAA CCAGGCGTGA GCGTCAAGGA TTTTGGTTTG TGGGCGGCTC GCACGCTCCG CGCTGATGGA GCAACCTATC AAAGCGTCGA GTGGCACGGG GTTGATTTTC TTTCGTGGCG CGAACGTATG ACCTTGGCAA CTTTATCAAT TGAAGTTGGA GCCAAGGCCG GAATCGTTGC CCCAACTGGC TTGGGCGCTG AACATCCCGT GCCAGAATGG TTGAGGGTTG AGGCCGATGC CAGCTACAGC CGCGTTGTCG AATGCGATTT GAGCACGCTC GAACCGCAAG TTAGCGTACC GCATTATGTC GATAACGTAG TCGATTTGGC TGATGTCGGG CGGGTCGCGG TTGATGTCGT CTATCTTGGC ACCTGCACCA ATGGCCACTA CGAAGATATG GCCGCCGCCG CCAGCATTCT CAAGGGACGG CGTTTGGCTC CAAATGTGCG AATGATTGTC GTACCTGCTT CGAGCGAAAG TCTGCATCGC GCCGCCAGCG ATGGCACATT AGCTACCTTG TTGGCGGCTG GGGCAACCAT CGGTACACCT GGTTGTGGCG CATGCATTGG TCGCCATATG GGCGTGTTAG CGCCCGATGA AGTCTGCGTT TTCACTGGCA ATCGCAACTT CCGCGGACGA ATGGGCAGCC CAGGAGCCAA TATCTACTTG GCCTCGCCTG AAGTTGCCGC CGCCACCGCC GTCACGGGCT ACATCACCCA CCCGCGCAAT GTGCTCGATA GCACTGAGCA AGCAGTTTTC GCCTAA
|
Protein sequence | MGQTFAEQIL GHASGRSDVQ AGDMVVVNVD LVMMHDSLSP SIIETLHNEL GAERVWDRDK VAVVIDHVAP AATVRQAEQQ QQVRRWVAQQ GISHLFDVGR GISHPVLIEE GLVQPGMLVV GSDSHSTGYG AAAAFGSGMG TTDIALALAT GQTWFRVPET VRVNAVGNFQ PGVSVKDFGL WAARTLRADG ATYQSVEWHG VDFLSWRERM TLATLSIEVG AKAGIVAPTG LGAEHPVPEW LRVEADASYS RVVECDLSTL EPQVSVPHYV DNVVDLADVG RVAVDVVYLG TCTNGHYEDM AAAASILKGR RLAPNVRMIV VPASSESLHR AASDGTLATL LAAGATIGTP GCGACIGRHM GVLAPDEVCV FTGNRNFRGR MGSPGANIYL ASPEVAAATA VTGYITHPRN VLDSTEQAVF A
|
| |