Gene Haur_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0133 
Symbol 
ID5732028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp164250 
End bp165329 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID641277257 
Productisocitrate/isopropylmalate dehydrogenase 
Protein accessionYP_001542913 
Protein GI159896666 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGC CAACGATTGT TGTCCTTGAG GGCGATCAAA CGGGGCAAGA ACTGCTTGAA 
GAAAGTCTGC GCGTCCTTGA TCCTGCGGTG ACGGGCGTTG ATATAGAGCT AAAGCGCTAC
GATTTGAGCC TCGAATCGCG CCGAGCGACC AATAATCAAA TTGTGTTGGA AGCAGCTCAA
GCCATGAAGG AAGCTGGTTT TGGCTTGAAA GCCGCTACAA TCACTCCTGA AAAAGCTGGC
GATGTTGGTA GCCCTAACGC TATTCTGCGC GAACAAATCA ATGGTACGGT GATTGTACGA
ACGGGCCGCC GGATTCCAGG CGTGCGCCCA GTTGGTGGTG CGTATGCGCC AATCTCGGTC
ATTCGCATGG CGGTTGACGA TGCCTATGGT GCCAAAGAAT GGCGCGAAGG CGAAGGCGAT
AATGAAGTTG CTTATCGCAC CGAGAAAATC ACCCGTGGCA CGTGCCGCGC CGTTTCAGAA
TATGCCTTTA TGCATGCTCG TCGCATGAAA GCCAAAGTTT TCGGTGGCCC CAAATATACG
GTTAGCCCAA TTTATGAAGG CATGCTTAAG GAAGAAATGG ATGCAGCCGC CAAGCGCTAT
GCCGATGTAC GCTACGAACC ACAGTTGATC GATGCGACCT ATGCTTTGCT CTTGACCAAC
TCGGGCGATC CAATGGTGAT TCCTGCGCTC AACCGCGACG GCGACTGCTT GAGCGACTTG
GTATTGCAAA TGTTCGGCAC GATTGCTGGC GCAGAATCAT TGCTCTTGGC CTTCGACAAA
GATTTCAAAG TTAATGTTGT GATGGCTGAA GCACCCCACG GCACGGCTCC CAGCTTGGAA
GGCAAGAATG TTGCTAATCC AATGGCGATG ATTTTGGCTT CGGCAGCCTT GCTTGATTAT
ATTGATACAC CGCAAGCCAA CATGGCAGCC CGCGCGATCA GCGAAGCTAC CTTGGAAGCT
GTCTACGACG GCGTGCGTAC TGCCGATTTG GGTGGCCACA CCACCACCAG CGATTTCACC
GACGAAGTGA TTCGCCGCGT AAAAACCAAA ATGGAAGTTT GGCCATCGCT CGGTAACTAA
 
Protein sequence
MSKPTIVVLE GDQTGQELLE ESLRVLDPAV TGVDIELKRY DLSLESRRAT NNQIVLEAAQ 
AMKEAGFGLK AATITPEKAG DVGSPNAILR EQINGTVIVR TGRRIPGVRP VGGAYAPISV
IRMAVDDAYG AKEWREGEGD NEVAYRTEKI TRGTCRAVSE YAFMHARRMK AKVFGGPKYT
VSPIYEGMLK EEMDAAAKRY ADVRYEPQLI DATYALLLTN SGDPMVIPAL NRDGDCLSDL
VLQMFGTIAG AESLLLAFDK DFKVNVVMAE APHGTAPSLE GKNVANPMAM ILASAALLDY
IDTPQANMAA RAISEATLEA VYDGVRTADL GGHTTTSDFT DEVIRRVKTK MEVWPSLGN