Gene Haur_3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3302 
Symbol 
ID5735172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4168134 
End bp4169399 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content57% 
IMG OID641280449 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_001546066 
Protein GI159899819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.879653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCAGA CATTTGCCGA GCAAATTTTG GGGCATGCTT CCGGTCGCAG CGACGTGCAA 
GCAGGCGATA TGGTGGTAGT CAACGTCGAT TTGGTGATGA TGCACGATAG CCTCTCGCCT
AGCATTATCG AAACCTTGCA CAACGAATTA GGCGCTGAAC GCGTGTGGGA TCGCGACAAA
GTTGCAGTGG TAATCGACCA CGTTGCCCCA GCCGCCACCG TGCGCCAAGC CGAGCAGCAA
CAACAAGTGC GGCGTTGGGT CGCCCAACAA GGCATCAGCC ATTTGTTTGA TGTTGGTCGC
GGCATCTCGC ACCCCGTGTT GATCGAAGAA GGGTTAGTGC AACCGGGCAT GCTGGTGGTT
GGCAGCGATT CGCATAGCAC TGGCTATGGT GCAGCAGCGG CCTTTGGCTC AGGTATGGGC
ACGACCGATA TCGCCTTAGC ACTGGCGACT GGCCAAACTT GGTTTCGCGT CCCAGAAACT
GTGCGGGTCA ATGCGGTTGG CAATTTTCAA CCAGGCGTGA GCGTCAAGGA TTTTGGTTTG
TGGGCGGCTC GCACGCTCCG CGCTGATGGA GCAACCTATC AAAGCGTCGA GTGGCACGGG
GTTGATTTTC TTTCGTGGCG CGAACGTATG ACCTTGGCAA CTTTATCAAT TGAAGTTGGA
GCCAAGGCCG GAATCGTTGC CCCAACTGGC TTGGGCGCTG AACATCCCGT GCCAGAATGG
TTGAGGGTTG AGGCCGATGC CAGCTACAGC CGCGTTGTCG AATGCGATTT GAGCACGCTC
GAACCGCAAG TTAGCGTACC GCATTATGTC GATAACGTAG TCGATTTGGC TGATGTCGGG
CGGGTCGCGG TTGATGTCGT CTATCTTGGC ACCTGCACCA ATGGCCACTA CGAAGATATG
GCCGCCGCCG CCAGCATTCT CAAGGGACGG CGTTTGGCTC CAAATGTGCG AATGATTGTC
GTACCTGCTT CGAGCGAAAG TCTGCATCGC GCCGCCAGCG ATGGCACATT AGCTACCTTG
TTGGCGGCTG GGGCAACCAT CGGTACACCT GGTTGTGGCG CATGCATTGG TCGCCATATG
GGCGTGTTAG CGCCCGATGA AGTCTGCGTT TTCACTGGCA ATCGCAACTT CCGCGGACGA
ATGGGCAGCC CAGGAGCCAA TATCTACTTG GCCTCGCCTG AAGTTGCCGC CGCCACCGCC
GTCACGGGCT ACATCACCCA CCCGCGCAAT GTGCTCGATA GCACTGAGCA AGCAGTTTTC
GCCTAA
 
Protein sequence
MGQTFAEQIL GHASGRSDVQ AGDMVVVNVD LVMMHDSLSP SIIETLHNEL GAERVWDRDK 
VAVVIDHVAP AATVRQAEQQ QQVRRWVAQQ GISHLFDVGR GISHPVLIEE GLVQPGMLVV
GSDSHSTGYG AAAAFGSGMG TTDIALALAT GQTWFRVPET VRVNAVGNFQ PGVSVKDFGL
WAARTLRADG ATYQSVEWHG VDFLSWRERM TLATLSIEVG AKAGIVAPTG LGAEHPVPEW
LRVEADASYS RVVECDLSTL EPQVSVPHYV DNVVDLADVG RVAVDVVYLG TCTNGHYEDM
AAAASILKGR RLAPNVRMIV VPASSESLHR AASDGTLATL LAAGATIGTP GCGACIGRHM
GVLAPDEVCV FTGNRNFRGR MGSPGANIYL ASPEVAAATA VTGYITHPRN VLDSTEQAVF
A