Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0607 |
Symbol | |
ID | 5732505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 699118 |
End bp | 700401 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277734 |
Product | imidazolonepropionase |
Protein accession | YP_001543383 |
Protein GI | 159897136 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCACG CTGATCAATT AATTACCAAT ATAGGTCGCT TGGTTACTGG CCCGCAAGCA CCTTTACGCG GCCAACAGCT GGCCCAATTA ACCGCTATCG ACCAAGCTGT TGTTGCTGTC CAAGCAGGCA ACATCGTGGC ACTTGGCAGT CAGGCCGAGC TTAGCGCTTG GACTGCCGAT CAAACAATCG ATGCAGGTGG TTATTTGGCA ATCCCAGGCT TTGTCGATCC CCACACCCAC GCTTGTTATG CAGGCGATCG CGCCCATGAG TTCGAATTAC GAATCAAAGG CGCAAGCTAT AGCGAATTAA TGGCGGCTGG TGGAGGAATT ATGTCAACAG TTCATGCCAC ACGAGCAGCC AGTAAAGCCG AATTAGTTGC CCAAACCCGC CCACGGCTTG ATCAATTATT GGCCCATGGC ACAACTACCG TCGAAATCAA AAGTGGCTAT GGGCTTGATA CCGCTACCGA ATTAACCATG CTCGAAGCGA TCGCTGAGCT AGCCCAAACT CATCCAATTG GCATCGTGCC GACCTTTATG GGGGCACACG CCATTCCCGC CGAATATCGC GACAATCCAG AAGCGTTTGT AGATTTGGTG GTTGATGAGA TGTTGCCTGC AGTAGCAGCT TGGTGGCAAC AGCAAACAAT CTGGCAAGAA CCCTTGGCTT GCGACATTTT CTGCGAAAAT GGAGCCTTTT CAGTTGCCCA AAGCCAACGT ATTTTGGTTA AGGCCAAAGC ATTGGGCTTT CGCTTGAAAT TGCATGTCGA TGAGTTTGAG CCGTTGGGTG GCACGCCGCT AGCGGTCGAA CTAGGGGCAA TCTCGGTTGA TCACTTGGTC GCCACGCCGC CCGAACATAT CGCGATCTTG GCAAATTCAG AAACCGTGGG CGTTTCGTTG CCTGGCACGC CGTTTGGTCT GGGCAAGAGT CAATTCAGCC CAGCTCGCAG TTTAATCGAA GCGAACGGAA TTTTGGCCCT AGCCACCGAT TGTAATCCAG GCACCAGCCC TTGTGAATCG ATGCCGATGG CAATTGCCAT TGCTTGTCGC TATTTACGGC TGACTCCAGC CGAGGCCTTG AACGCGGCCA CGGTTAATTC AGCGTTTGCA ATTCGCCAGC ATGAGCGCGT TGGTAGCTTA GCAGTTGGCA TGCAAGCTGA TCTGGCCTTG CTCAACCTGC CCGACGAACG CCATATCGGC TATAAGTTTG GCACAAATCC AGTCGCCATC GTGATCAAAA CAGGTAGGGT GGTCCGTCGG AACCAACTTC ACGCTGATCG CTAA
|
Protein sequence | MPHADQLITN IGRLVTGPQA PLRGQQLAQL TAIDQAVVAV QAGNIVALGS QAELSAWTAD QTIDAGGYLA IPGFVDPHTH ACYAGDRAHE FELRIKGASY SELMAAGGGI MSTVHATRAA SKAELVAQTR PRLDQLLAHG TTTVEIKSGY GLDTATELTM LEAIAELAQT HPIGIVPTFM GAHAIPAEYR DNPEAFVDLV VDEMLPAVAA WWQQQTIWQE PLACDIFCEN GAFSVAQSQR ILVKAKALGF RLKLHVDEFE PLGGTPLAVE LGAISVDHLV ATPPEHIAIL ANSETVGVSL PGTPFGLGKS QFSPARSLIE ANGILALATD CNPGTSPCES MPMAIAIACR YLRLTPAEAL NAATVNSAFA IRQHERVGSL AVGMQADLAL LNLPDERHIG YKFGTNPVAI VIKTGRVVRR NQLHADR
|
| |