Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1247 |
Symbol | |
ID | 5733125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1455013 |
End bp | 1456161 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278387 |
Product | xylose isomerase |
Protein accession | YP_001544023 |
Protein GI | 159897776 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2115] Xylose isomerase |
TIGRFAM ID | [TIGR02631] xylose isomerase, Arthrobacter type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAGC ATAAATTTAG CTTTGGATTA TGGACAGTTG GCAATGTTGG GCGTGACCCA TTTGGTGAAC CAGTGCGCAA AACCCTCTCG CCAGTTGAGA TTGTGCATTT GTTGGCTGAA GTTGGAGCAT GGGGCGTAAA TTTTCACGAT AACGATTTAG TGCCGATTGA TGCTAGTGCC AGCCAAAAAG CCCAAATTAT TGCCGATTTC AAACAAGCAC TCAAGGATAC CGGCTTGGTT GTGCCGATGG CAACCACCAA TTTATTCGGT CACCCAGCCT TTAAAGATGG CGCATTTACC AGCAACGATC CGGCTGTGCG GGCTTATGCC TTGCAAAAAA CCATGGCAGC CATGGATTTG GGCGCTGAAT TTGGCGCGAA AACCTATGTG TTTTGGGGTG GCCGTGAAGG CAGCGAAACC GATGCCTCGA AAAATCTGCT CGAAGGCTTG AAGTGGTTCC GCGAAGCGCT CAACTTCTTG TGCGACTATA GCAATGCCCA AGGCTATGGC TATCGTTTTG CCTTGGAAGC CAAGCCCAAC GAACCACGCG GCGATATCTT CTTGCCTACC ACCGGAGCCA TGTTGGGCTT TATTCAGACC CTCGATCAGC CCGAGATGGT GGGGGTAAAT CCCGAAGTTG CCCATGAAAC CATGGCAGGC TTGAATTTTA CCCATGCCGT GGCCCAAGCG CTTGATGCTG GCAAACTGTT CCATATCGAC CTCAACGATC AAAATAGTGG TCGCTACGAC CAAGATTATC GCTTTGGAGC ACAAAACTAC AAACAAAGCT TTTTCTTGGT GAAATTGCTG CAAGATGCTG GCTACGATGG CCCATTGCAC TTCGATGCTC ACGCTTACCG CAGCGAAGAT CTTGAGGGAG TTAAAGATTT TGCCCGTGGT TGTATGCGTA CCTACCAAAT TTTGGCCGAA AAAGTTCAGC GCTTCAATGC TGATGCTGAA ATTCAAGCCT TGTTAGCTCA AATCAACGCC CCAAATGCCG ATGTTGAGCA ATTCCGTGGT GGCTACACGC CAGAACGTGC CGCTGCGCTC AAAGCCTATC AATTTGATCG TCAAGCACTT GGCGAACGCG GCCTCGGCTA CGAAAAGCTT GATCAACTAA CCTTCGAGTT GTTGATGGGA GCCAGATAG
|
Protein sequence | MTQHKFSFGL WTVGNVGRDP FGEPVRKTLS PVEIVHLLAE VGAWGVNFHD NDLVPIDASA SQKAQIIADF KQALKDTGLV VPMATTNLFG HPAFKDGAFT SNDPAVRAYA LQKTMAAMDL GAEFGAKTYV FWGGREGSET DASKNLLEGL KWFREALNFL CDYSNAQGYG YRFALEAKPN EPRGDIFLPT TGAMLGFIQT LDQPEMVGVN PEVAHETMAG LNFTHAVAQA LDAGKLFHID LNDQNSGRYD QDYRFGAQNY KQSFFLVKLL QDAGYDGPLH FDAHAYRSED LEGVKDFARG CMRTYQILAE KVQRFNADAE IQALLAQINA PNADVEQFRG GYTPERAAAL KAYQFDRQAL GERGLGYEKL DQLTFELLMG AR
|
| |