Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4820 |
Symbol | |
ID | 5736665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6146730 |
End bp | 6147758 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281985 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001547578 |
Protein GI | 159901331 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1089] GDP-D-mannose dehydratase |
TIGRFAM ID | [TIGR01472] GDP-mannose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.503744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAATAG GCATAAACAG CTTCTTGCCA ACAATCTCAA CTCGCAGTAT AGTGCGGGCT ATGACACGAG CATTAATTAC TGGCGCTAAT GGCTTTGTTG GCCAACATCT TGTTCGCTAT TTGCAGCAAG CAACCACTTG GGAATTATGG GCTTTGGGGC GTGAAGCTCA CCCACAACTT CCAACAGTGC TGGCTGATCT GCTTGATCGT TCGGCGGTAG CAACGGCGGT GGCAAATGCA GCCCCCGATC TGGTGGTGCA TTTAGCGGCC CAATCGGCGA TTCCTCAATC GTTTCGTGAT CCCGCCGGCA CGTTTAGTAT CAATGTGCTC GGCCAATTGC ACCTATTTGA AGCGATCAAG TCGGCTCAAC TTGATCCAAT TGTGTTGGTG GTTGGCTCGA ATGCGATGTA TGGCATGGCC CATCGTTCAG GCTTGCCCGC CGATGAAAAC ACCATGCTTT GCCCAGCTGA TCCCTATGCT GTTTCCAAGG CAGCCCAAGA TCTGTTGGCG GGGCAATGGT GGTATAGCCA TGGCCTGAAG GTGATTCGTG CTCGGCCATT TAACCATACT GGGCCTGGCC AACGGGCTGA TTTTGTTGTG CCAGCCTTTG CCCACCAAAT TGCTCGCATC GAAGCGGGCT TGCAGCCGCC AGTGATTCAG GTTGGCAACC TTACGCCACA GCGTGATTTT AGCGATGTAC GTGATGTTGT GCGGGCCTAT CATCTGCTGC TTGAACGAGC GCAGCCTGGC GAAATTTATA ATATTGGCGT AGGTCAGAGT GTCTCAATTC AGTCAATTCT TGATCGCCTC ATTGCACTCA GTGGCCAAAC GATCACGGTC GAAGTTGATC CTCAACGCTT GCGTCCAGTT GATGTGCCAA TCGTGGCGTG CGATGCTAGT CGCTTGCGCA GCCAAATCGG GTGGGAGCCG CAGTATTGCC TTGACGATAC GCTGAGCGAT ATTCTGAATG AATGGCGCAG CCACGTTGCA ACCGAGCAAG AGAGAGTTGG TTCCCACATT AACCAATAA
|
Protein sequence | MRIGINSFLP TISTRSIVRA MTRALITGAN GFVGQHLVRY LQQATTWELW ALGREAHPQL PTVLADLLDR SAVATAVANA APDLVVHLAA QSAIPQSFRD PAGTFSINVL GQLHLFEAIK SAQLDPIVLV VGSNAMYGMA HRSGLPADEN TMLCPADPYA VSKAAQDLLA GQWWYSHGLK VIRARPFNHT GPGQRADFVV PAFAHQIARI EAGLQPPVIQ VGNLTPQRDF SDVRDVVRAY HLLLERAQPG EIYNIGVGQS VSIQSILDRL IALSGQTITV EVDPQRLRPV DVPIVACDAS RLRSQIGWEP QYCLDDTLSD ILNEWRSHVA TEQERVGSHI NQ
|
| |