Gene Haur_4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4446 
Symbol 
ID5736297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5687284 
End bp5688951 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content52% 
IMG OID641281609 
Product2-isopropylmalate synthase 
Protein accessionYP_001547206 
Protein GI159900959 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCG ATTATGTACG CATTTTCGAT ACAACCCTGC GCGACGGTGA GCAATCACCA 
GGCTGTACCA TGCATCTGCA CGAAAAATTG GAAGTTGCCA AGCAATTGGC GCGAATGGGC
GTGGATATTA TCGAAGCGGG CTTTCCGGCG GCCTCACCTG GCGATTGGGA AGCAGTTAAC
CAAATTGCCA AGAGCGTTGG CACTGCCGAT GGTCCAGTGA TCTGTGGCTT AGCACGAGCA
GTCAAAAGCG ATATCGAAGC CTGTGCCACA GCGATTGCAC CTGCCGCCAA AAAGCGTATT
CACACTTTCC TTTCAAGCTC AGATATTCAT ATCGAGTATC AATTGCGCTC AACCCGCGAG
GAAGTGCTGA CGAAAACCCG TGAGATGGTG CGCTTGGCTA GCTCGCTTTG CGACGATGTT
GAATTCTCGC CGATGGATGC GACCCGTTCC GACATCGAAT TTATGTATCA GATGCTGGCG
ATAGCGATTG AAGAAGGCGC AACAACCTTG AATATTCCTG ATACCGTGGG CTATTCCACG
CCTGAGGAAT ATGCAGCGTT GCTTCGGGGG ATTATCGAAA ACGTACCTGG TGCTGCTGAT
GTGATTATCT CAACCCATTG CCACGACGAT TTGGGCTTGG CGGTGGCGAA TTCGTTGGCT
GGCGTTCGAG CTGGCGCACG CCAAATTGAA TGCACAATCA ATGGCATCGG CGAGCGAGCT
GGCAATGCTT CGTTGGAAGA AGTGGTGATG GCTTTGGAAA CGCGCAAGCA ATTTTACGGC
CTGAGCACCA ATATCGACAC AACCCAACTA ACGCCTAGCT CACGCTTGTT GAGTGCCTGC
ACCAACACCC AAGTGCCACC CAACAAGGCA ATTGTTGGCG CGAATGCCTT TGCTCACGAA
TCGGGCATTC ACCAAGATGG CGTGTTGAAA CACCGTATGA CCTACGAAAT TATGAGCGCC
GAATCGGTGG GTCAAGATGG CAATGCCTTG GTTTTGGGCA AACACTCAGG CCGCCATGCC
TTCCGCCATC GGGTGCAAGA ATTGGGCTAT AGCTTCGATG AAGCTACGAT CAACCACTTG
TTTGCCCGCT TCAAAGATGT AGCTGATCGC AAGAAATATG TTGATGATCG CGATGTTGAG
GCCTTGATTA GCGATGAAGC AGGCCGCCCA AGCCCAGTTT ATGAATTGGA GCATGTGCAA
TTTGCCTCAG GCGTGAACGC CATCCCAACC GCAACCGTGC GCATGCGCGG CCCCAACGGC
GAAGTCAAAA TCGAATCAGC CCAAGGCACA GGCCCAGTGG ATGCAGTCTA TTCGGCGATC
AACAAAGTTG TGCTAACTCC GGTTACGTTG CTCGAATTTG CGGTCAATGC GATCACCGAA
GGTATCGATG CGGTTGGCGA AGTTAGCGTG AAAGTCGTCG AAGGCAAGCA TAAACATGTG
CGTGGAGTTT TAACCGACAA AGCAACAGAT GCCAAAGTTT GGCGCGGCAA TGGAGTCAAT
GTTGATATCA TCGTTGCGGC TGCCGAAGCC TATGTTAGCG CCTTGAACAA ATTGCTGAAA
GCTCGCCAAG AACGGCTCAA GCAAGAAACC ACCGCCCGCA TGGAAGTTGC CAGCGCTGCG
CCAGGCGTGG ATTTGTTCGG TGGCTCGACC CTTGGACGGT ATGAATAA
 
Protein sequence
MTSDYVRIFD TTLRDGEQSP GCTMHLHEKL EVAKQLARMG VDIIEAGFPA ASPGDWEAVN 
QIAKSVGTAD GPVICGLARA VKSDIEACAT AIAPAAKKRI HTFLSSSDIH IEYQLRSTRE
EVLTKTREMV RLASSLCDDV EFSPMDATRS DIEFMYQMLA IAIEEGATTL NIPDTVGYST
PEEYAALLRG IIENVPGAAD VIISTHCHDD LGLAVANSLA GVRAGARQIE CTINGIGERA
GNASLEEVVM ALETRKQFYG LSTNIDTTQL TPSSRLLSAC TNTQVPPNKA IVGANAFAHE
SGIHQDGVLK HRMTYEIMSA ESVGQDGNAL VLGKHSGRHA FRHRVQELGY SFDEATINHL
FARFKDVADR KKYVDDRDVE ALISDEAGRP SPVYELEHVQ FASGVNAIPT ATVRMRGPNG
EVKIESAQGT GPVDAVYSAI NKVVLTPVTL LEFAVNAITE GIDAVGEVSV KVVEGKHKHV
RGVLTDKATD AKVWRGNGVN VDIIVAAAEA YVSALNKLLK ARQERLKQET TARMEVASAA
PGVDLFGGST LGRYE