Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1207 |
Symbol | |
ID | 5733100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1390320 |
End bp | 1391339 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278347 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001543983 |
Protein GI | 159897736 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0228638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC GCGTGTTGGT TACCGGAGCG AATGGTCATT TAGGCTCAGT TTTGGCTCAA ATGTTGGTGG AGCGTGGGGT TGATGTGCGG GCCAGCGTGC GCAATCGCAG CCAAATCAAG CCGCAACTGG CCTATGAGCA AGTGTATGCC GATTTGATGG ATATGGATTC GCTGCAACAA GCCTTGGTTG GAGTTGATAC ACTCTATCAA GTAGCGGCGG TATTCAAGCA TTGGTCGCGC AATCCGCAAC GCGAGATCAT TCAACCAAAC GTTGAGGGCA CACGCAATAT TTTGCGAGCA GCGGCGCAGG CGGGGGTCAA ACGGGTGGTG TATGTCAGTT CAATCGCGGC AGTCGATAAA AATAATCCTC AGCGCCAAAT CCCAGCAGAT GAAACAACCT GGAACCAATA TACCTATGGT AATCCGTATT ATCAATCGAA AATCGCCTCG GAACAATTGG CGTGGAAGTT GGCCAAGGAA TATGGCTTGG AGATGATGGC GGGCTTGCCA GGCACGATCA TCGGCGATCC CAATGGCCGT ACCACACCAT CGTTGGGGAT TTTGGAGTTG GTGTTGAGCA ATAAAATGCC ACTAGATATC AATATGGATT TTAATTTTGT CGATGTGGCT GATGTTGCCG AAGGGCTGAT CGCTGCTGAG CGCCAAGGTC GGGCAGGCGA ACGCTATATT TTGGCGAATG ATCAATCGTT ACCATTGCGA CGAATCTTTG AAATCGCCCA AGAGTTTAAT CCCAAAATCA AAGTGCCAAT GCGGGTATCT AAAGGCATTA CCAATGTCGT AGCTGGCATG ATGGAGTTGG TGGCGAATGT CACAGGCCGC GAACCAATGA TTTTGCGCAG CCAAGTTGGG CTATATTGCG GCATAGAACA ACGCTTATCA ATTGCCAAAG CCAAACGTGA GTTGGGCTAC AATCCTTTGC CTGCGGTCGA TGCCGTCCGT AAGACTTTTC AGATTCTGGC TCGGCAAGCA CCAAAGCAGG TTGCCTTGGG TCAAGTCTAG
|
Protein sequence | MSKRVLVTGA NGHLGSVLAQ MLVERGVDVR ASVRNRSQIK PQLAYEQVYA DLMDMDSLQQ ALVGVDTLYQ VAAVFKHWSR NPQREIIQPN VEGTRNILRA AAQAGVKRVV YVSSIAAVDK NNPQRQIPAD ETTWNQYTYG NPYYQSKIAS EQLAWKLAKE YGLEMMAGLP GTIIGDPNGR TTPSLGILEL VLSNKMPLDI NMDFNFVDVA DVAEGLIAAE RQGRAGERYI LANDQSLPLR RIFEIAQEFN PKIKVPMRVS KGITNVVAGM MELVANVTGR EPMILRSQVG LYCGIEQRLS IAKAKRELGY NPLPAVDAVR KTFQILARQA PKQVALGQV
|
| |