Gene Haur_4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4441 
Symbol 
ID5736292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5681652 
End bp5683067 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content54% 
IMG OID641281604 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001547201 
Protein GI159900954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGA CATTATTTGA GAAAATCTGG GATGCCCATG TTGTTCAAGC TGCTGACGCT 
GAATCGCCAG CAACTTTATA TATCGATTTG CACTTGGTAC ACGAAGTAAC CTCGCCGCAA
GCCTTTACGA TGTTGCGCGA ACGTGGTTTG ACCGTGCGTC GCCCAGCCCA AACCCTCGCC
ACCATGGACC ATAGCACCCC AACCACGCCA CGCGGCGCTG ATGGGATTAT TCCAGTGACC
GATGCAATTG CTCGCAAACA GCTTGATCAA TTGATCAAAA ATTGTAGCGA TTTTGGCATA
CCATTGTACA ACTTGGGCAC TGAAAACCAA GGCATTGTGC ACGTAATCGG GCCAGAACAA
GGCTACACCC AACCAGGTAT GACGATTGTG TGTGGCGATT CGCACACCAG CACCCACGGC
GCATTCGGAG CCTTGGCCTT CGGCATTGGC ACCAGCGAAG TTGGTCACGT GTTGGCAACT
CAATGTTTGT TGCAGCAAAA ACCTCAAACC GCCGAAATTC GCATCGATGG CACGCTGCGT
CCAGGTGTGA CCGCCAAAGA CATCATCTTG GCGATTATTG CCAAAATTGG GGTTGGCGGC
GGCACGGGCT ATGTACTTGA ATACACTGGC TCGGCGATTC GCGCCCTGAC CATGGAAGAA
CGCATGACGA TTTGTAATAT GTCGATCGAA GGTGGAGCAC GGGCTGGCTT GATCGCGCCT
GATGAAACCA CCTTTGCTTG GCTGAAAGAT CGGCCACATA CGCCCAAGGG TGAAGCATGG
GATGCTGCGG TCGAATATTG GCGCACGTTG CCCAGCGACG AAGGCGCAAC CTACGATTTG
CAAGTTGTCT TGAATGCTGA CGAATTGGCT CCGATGATCA CCTATGGCAC GAATCCCGGC
ATGGGCATTC CGGTCACTGG CAATGTGCCA GCACCCAGCG ATTTGGCCGA TGATAGCCAA
CGCATGGCCT TGGATAAAGC CTTGAATTAC ATGGGCTTGC AACCAAACCA ATCGCTGATC
GGCCAAAAAG TCGATGTAGT GTTCCTTGGC TCCTGCACCA ACTCGCGGAT TTCGGATTTA
CGGGCGGCGG CCAAAGTGAT CGAAGGCAAA AAAGTTGCCG ACGGCCTACG CATGTTGGTC
GTGCCTGGCT CACAACAAGT CAAGCGCCAA GCCGAAGCTG AAGGCTTAGA CCAAATTTTC
CGCGCGGCAG GCGCTGAATG GCGTGAAGCT GGCTGCTCGA TGTGTATTGC CATGAACGGC
GATCAACTGC AACCAGGCCA ATATGCAGTT AGCACCAGCA ACCGCAACTT TGAAGGCCGC
CAAGGCAAGG GCGGGCGCAC CTTCCTCGCC AGCCCGCTGA CCGCTGCCGC TACCGCAATC
AACGGCCATA TCGTCGATGT ACGCGAAATC TTGTAA
 
Protein sequence
MAKTLFEKIW DAHVVQAADA ESPATLYIDL HLVHEVTSPQ AFTMLRERGL TVRRPAQTLA 
TMDHSTPTTP RGADGIIPVT DAIARKQLDQ LIKNCSDFGI PLYNLGTENQ GIVHVIGPEQ
GYTQPGMTIV CGDSHTSTHG AFGALAFGIG TSEVGHVLAT QCLLQQKPQT AEIRIDGTLR
PGVTAKDIIL AIIAKIGVGG GTGYVLEYTG SAIRALTMEE RMTICNMSIE GGARAGLIAP
DETTFAWLKD RPHTPKGEAW DAAVEYWRTL PSDEGATYDL QVVLNADELA PMITYGTNPG
MGIPVTGNVP APSDLADDSQ RMALDKALNY MGLQPNQSLI GQKVDVVFLG SCTNSRISDL
RAAAKVIEGK KVADGLRMLV VPGSQQVKRQ AEAEGLDQIF RAAGAEWREA GCSMCIAMNG
DQLQPGQYAV STSNRNFEGR QGKGGRTFLA SPLTAAATAI NGHIVDVREI L