Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1426 |
Symbol | |
ID | 5733334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1645635 |
End bp | 1646585 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278564 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001544198 |
Protein GI | 159897951 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCAT TACAAGTTAT TTTTGGCACA GGCCCAGCTG GTAGCACGCT TGCTGAAGAG CTTATTAGTC AAGGCCAGCG CGTTCGCTGT ATCAATCGTA GTGGTAAGGC CGATCTGCCA GCAGCGGTTG AGGTCGTCGC TGGCGATTTG CTCAATCAAG GCCAAGTTAA TGAGTTGTGC CAAGGTGCAA AGGTGGTTTA CCACTGCGCA AACGTACATT ATGCTGAACA GACCAAGATT ATGCCGCAAT TTCAGCAAAC GATTATGCAG GCTAGCGCCG CTGCTGGCGC TCGCTTGGTG GTGCTCGATA CGCTTTACGT CTATGGTTCG AGTCAAGGCC GACCAATGAC TGAGGCCACG CCATTTGCGC CGCATACGCG TAAAGGTCGC ATGCGAGCTG ATTTAGTTGA AACCTATTTG GCAGCGCATC GGGCTGGTAC GCTTGAAGTC ACGCTGGGTC GAGCTGCCGA TTTCTTTGGG CCGCGTGTGC TCAACTCAAG CTTAGGCGAT CGGGTGTTTC CGATGCTGTT GCAGCACAAA CCAGCTCAAT TGTTGGGCAA TATCGATTTG CCGCATAGTT TTAGCTATAT CGGCGATGTC GCCCGTGGCT TGGCATTGTT GGGCCAACAT CCGGCGGCGC TAGGCCAAGC TTGGCATTTG CCCGTCATGC CAGCATTAAC CCAACGCGCC ATGCTCCAAA CAATCGGCAC ATTGTTGGAC TACCCGGTGC GCAGCATTGC TTTGCCTAAA ATGGCGATTC AGGCGTTTGG CCTGATGGAT TCGTTTATGC GCGAGTTTGT TGAGATGTTC TATCAATATA CTGAGCCACA AATTGTCGAT GCCCAAGCGA TCGAACGCCA ACTTGGCTTG GCTGCCACGC CTTTGGAGCA AGCGTTGCAG ACAACGATTA ATTGGTATCG TGGTCAAAAT CAGCAAAAAG CCGCTGCCTA A
|
Protein sequence | MSSLQVIFGT GPAGSTLAEE LISQGQRVRC INRSGKADLP AAVEVVAGDL LNQGQVNELC QGAKVVYHCA NVHYAEQTKI MPQFQQTIMQ ASAAAGARLV VLDTLYVYGS SQGRPMTEAT PFAPHTRKGR MRADLVETYL AAHRAGTLEV TLGRAADFFG PRVLNSSLGD RVFPMLLQHK PAQLLGNIDL PHSFSYIGDV ARGLALLGQH PAALGQAWHL PVMPALTQRA MLQTIGTLLD YPVRSIALPK MAIQAFGLMD SFMREFVEMF YQYTEPQIVD AQAIERQLGL AATPLEQALQ TTINWYRGQN QQKAAA
|
| |