Gene Haur_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3304 
Symbol 
ID5735174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4169928 
End bp4170923 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID641280451 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001546068 
Protein GI159899821 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR02088] isopropylmalate/isohomocitrate dehydrogenases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.652858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGGT TGTGTTTGAT TGCTGGCGAT GGGATTGGCC GCGAAGTGGT GCAAGCAGCC 
CGCCAAGTGC TCGAAGCCTT AGCAGTTCCT GCCGAGTTTG TTGAGGCCGA GGCTGGTTGG
GAAACCTTTC AGCGCACTGG CAACGCCTTG CCCGAACAAA CTTTAGCGGC TGTCCAAGCG
GCCAATTCAA CCTTGTTTGG CGCAGTTAGT TCGCCATCGC AGCGGGTCGC TGGCTATCGC
AGTCCAATTG TTGGCATGCG CAAAGCCATT GATTTATATG CCTGTGTGCG GCCAGTCCAA
ACGCCACCGC TGGCCAACGC CCGCGCTGGA GTCAATTTGG TGGTGGTACG CGAAAATACC
GAGGGTTTGT ATAGCGGCCA AGAAACCCGC GAGGGCGATG AACGGGCTAC GGCGCAACGA
ATTATCACCC GCCAAGCCAG CGAACGGATT GTGCAATGGG CAGTCCAATA TGCCCAACGC
ACTGGCCGCC GCAAAATCAC GGTGGTACAC AAAGCCAATG TGCTACGCGA AACCTGTGGT
TTGTTCCGCG AAACTGCACT GCGCGTGCTA AGCGATGCGC CCGATTTACA AGTTGAAGAA
ATGTTAGTCG ATAACGCTGC TTATCAATTG GCGCGTGCTC CCGAGCGCTT TGAAGTGTTG
GTCACCACCA ATTTGTTTGG CGATATTCTC TCAGATGTAG CTAGCGTTTG GGGTGGCGGG
CTTGGTTTGG CAGCATCGGC CAATTATGGC ACGCGCACGG CGGTATTCGA GCCTGTGCAT
GGTAGCGCAC CGGATATTGC CGGCCAAGGC ATCGCCAATC CCTTGGCAAC CTTGAGCGCT
AGCGTGTTGA TGCTCGAATT TGTGGGCTTG AACAGCTACG CCGAGCGTTT GCAAACTGCG
ATTCAAGCGG TATTAGCCAA TGGGCCATAT ACGCCCGATC TAGCTGGCGC GGCGACGACT
GCCGAAGTAG TGCAAGCCGT GATTGACCAA TTTTGA
 
Protein sequence
MTRLCLIAGD GIGREVVQAA RQVLEALAVP AEFVEAEAGW ETFQRTGNAL PEQTLAAVQA 
ANSTLFGAVS SPSQRVAGYR SPIVGMRKAI DLYACVRPVQ TPPLANARAG VNLVVVRENT
EGLYSGQETR EGDERATAQR IITRQASERI VQWAVQYAQR TGRRKITVVH KANVLRETCG
LFRETALRVL SDAPDLQVEE MLVDNAAYQL ARAPERFEVL VTTNLFGDIL SDVASVWGGG
LGLAASANYG TRTAVFEPVH GSAPDIAGQG IANPLATLSA SVLMLEFVGL NSYAERLQTA
IQAVLANGPY TPDLAGAATT AEVVQAVIDQ F