Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0376 |
Symbol | |
ID | 5732227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 449487 |
End bp | 450467 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277499 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001543155 |
Protein GI | 159896908 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATCT TAGTCACAGG CGCTACTGGT TTTTTGGGTG CGCACACCGC GTTGGCTCTG CAAAAAGCTG GGCATACGGT GCTGGGTTTG GGCCGCCGCT GGGAGCATGT TCCACAACTA CTAGCCGCTG GCATTCAGCC AATCAAGGCC GATCTGCGCG ATCGCGCTAC GTTGATTGCG GCCTGTGCTA GTTGCGATGT GGTAGTGCAT AGCGCGGCAC TTTCGGCTCC TTGGGGCAGT CGCAGCGATT TTCAAACCAT TAACGTTGAT GGTACAGCCA ATGTACTAGC TGGTTGTGCT GCACAGAAGG TCGGACGCTT GGTGTTTATT TCATCACCAA GCGTTCTTTC CAATGGCCGC GATCAGTTTG ATTTGCTCGA TACAATGCCC TATCCAGCGC GGCCTATTTC GCTGTATTCG GCCAGCAAAC AGCAAGCAGA ACAATTGGTG CTCAAGCATT CCACGCCCAG CGTGATTTTG CGGCCTAAGG CAATTTTTGG CGAGGGCGAC CAAGCCTTGT TGCCACGAAT TATCGCCGCA GCCCGCGCTG GCCGCTTACG TCAATTTGGC AACGGACAAA ATTTGGTTGA TTTGACCTAT GTTGCCAATG TAGTGCATGC AATTGAGTTG GCGTTAACGG CTCCAGCAGC GCTTGGCAAG TGTTATACAA TCACCAATGG CGAACATCCC CAGCTCTGGG CGGTGATTCG GCGGGTACTG GCTGAGTTGG GTTTGCCTAG CCAGTTGCGG CCAATGCCCT TGTCGCTAGC CTTAGCCGTG GCACGAATCA TGGAAAGTAT CAGCTTGCTA ACACGGCGTG AGCCATTACT AACGCGTTAT AGCGTCTTGG CGCTGGCCCG CAGCCAAACT CACTCCTTGG TTGCAGCCCA ACACGATTTG GGTTATCAGC CATTGATCAG CCTTGAAACG GGGATTCAAC GCACCATCGC CGCGCTTAAA CAACCAACCA AGGGGGTTTA A
|
Protein sequence | MQILVTGATG FLGAHTALAL QKAGHTVLGL GRRWEHVPQL LAAGIQPIKA DLRDRATLIA ACASCDVVVH SAALSAPWGS RSDFQTINVD GTANVLAGCA AQKVGRLVFI SSPSVLSNGR DQFDLLDTMP YPARPISLYS ASKQQAEQLV LKHSTPSVIL RPKAIFGEGD QALLPRIIAA ARAGRLRQFG NGQNLVDLTY VANVVHAIEL ALTAPAALGK CYTITNGEHP QLWAVIRRVL AELGLPSQLR PMPLSLALAV ARIMESISLL TRREPLLTRY SVLALARSQT HSLVAAQHDL GYQPLISLET GIQRTIAALK QPTKGV
|
| |