Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1200 |
Symbol | |
ID | 5733093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1382713 |
End bp | 1384191 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278340 |
Product | inosine-5'-monophosphate dehydrogenase |
Protein accession | YP_001543976 |
Protein GI | 159897729 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase [COG0517] FOG: CBS domain |
TIGRFAM ID | [TIGR01302] inosine-5'-monophosphate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATCG ATTGGAGTGC TAAGTTCGCG TTTGAAGGGC TGACCTATGA TGATGTGCTG TTGATTCCAG CCTACTCGGA TGTCTTGCCT TCACAAATCG ACGTTAGCAC GTGGCTAACC CGCGAGATTC GTTTGAACAT TCCAGTTGTC TCATCTGCGA TGGATACGGT GACCGAAGAT CGCATGGCAA TTGCGCTGGC TCGCGAAGGT GGCCTTGGCA TTATTCACAA AAATATGGCT CCTGCCCAAC AAGCAGATTT GGTGCGGCGG GTCAAACGCT CAGAAAGTGG CATGATCACT GATCCCATCA CTTTGCGCCC CGAACAAACC ATCGGCGAAG CCTGGGAACT CATGAGCGAT TACCATATTT CAGGGGTGCC GATTACCAGC GCTGCTGGCG AACTCGTCGG GATACTTACC AACCGCGATT TACGCTTTGA AACCGACCCC AGCCGCAAAA TCAGCGAATT GATGACCAGC GAAGAATTGG TCACCGTGCC AGTTGGCACA ACCCTCGAAC AAGCCAAACA AGCCTTGCAT CAACACCGGA TCGAAAAGGT TTTGGTGGTT GATGAACATG GCAAACTCAA TGGTCTGATC ACGGTCAAGG ATATTCAAAA GCAGATCGAA CATCCTAACG CCACCAAAGA TGCTTATGGC CGCTTGCGAG TTGGCGCTGC CGTCGGCGCT TCGACCAGCG AACTCGAGCG GGTGCGTTTG TTGGTTGAGG CTGGGGTTGA TGTGATTGCG GTCGATACTG CTCATGGCCA CTCCAAAGCC GTGCTCGATG CAATTGCCCG CATCAAACAA CAATATCCTG AGCTGCAAAT CATTGGTGGG AATGTGAGTA CTGGCGAAGG TGCTCGTGCC TTGATCGAAC ACGGCGCTGA TGCGGTTAAG GTCGGACAAG GGCCCGGATC GATTTGTACC ACTCGGGTGG TATCAGGTGC GGGGATGGCC CAAGTTACAG CGGTAATGGA GTGTGTTAAA GCTGCCGAAG AAGCTGGCGT GCCAATCATT GCTGATGGTG GGATTAAGTA TAGTGGTGAT GTGGCCAAAG CACTGGCTGC CGGCGCACAC ACGGTTATGC TGGGTGGCTT ATTAGCAGGA ACCGACGAAT CGCCAGGCGA GATGATTCTC TACCAAGGCC GCTCATTCAA ATCGTATCGT GGCATGGGTT CGATTGGTGC GATGCAGCAA GGCAGTAGCG ACCGCTATTT CCAAAGCAAC CAGCCTGCTC GCAAGTTGGT GGCCGAAGGA ATTGAAGGGA TGGTTCCCTA CAAAGGCGCA TTGGCCGATA CCATTTATCA ATTGGTCGGT GGTTTGCGCT CAGGTATGGG CTATGTTGGG GCGCATAATG TTGATGAATT GCGCAAGAAT GCCCGCTTCT CGCGGATTTC GCCCGCAGGT TTGGCCGAAA GCCATCCCCA CGATGTAACG ATCACCAACG AAGCTCCCAA CTACGAGCGC CGCGGCTAG
|
Protein sequence | MPIDWSAKFA FEGLTYDDVL LIPAYSDVLP SQIDVSTWLT REIRLNIPVV SSAMDTVTED RMAIALAREG GLGIIHKNMA PAQQADLVRR VKRSESGMIT DPITLRPEQT IGEAWELMSD YHISGVPITS AAGELVGILT NRDLRFETDP SRKISELMTS EELVTVPVGT TLEQAKQALH QHRIEKVLVV DEHGKLNGLI TVKDIQKQIE HPNATKDAYG RLRVGAAVGA STSELERVRL LVEAGVDVIA VDTAHGHSKA VLDAIARIKQ QYPELQIIGG NVSTGEGARA LIEHGADAVK VGQGPGSICT TRVVSGAGMA QVTAVMECVK AAEEAGVPII ADGGIKYSGD VAKALAAGAH TVMLGGLLAG TDESPGEMIL YQGRSFKSYR GMGSIGAMQQ GSSDRYFQSN QPARKLVAEG IEGMVPYKGA LADTIYQLVG GLRSGMGYVG AHNVDELRKN ARFSRISPAG LAESHPHDVT ITNEAPNYER RG
|
| |