Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0401 |
Symbol | |
ID | 5731969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 471172 |
End bp | 472380 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277524 |
Product | nucleoside-diphosphate-sugar epimerase |
Protein accession | YP_001543180 |
Protein GI | 159896933 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAC GAATATGGAT AGTGTTGTGT GGCTTGATGC TTGGCCAGCG CTGCTGGAAA TGGTGGCAAG TATGGCGCTT TTTTGGCAAG CCTACTCCTA CTGCTCAACA CGAGCCTGCG ACCACCTTGG TGAGTTTGCT CCAGCCAATT TTGAGCGGCG ACCCGCATTT GGCCATATGT TTACGCGCCA ATTTGAATGC GCCAAGCAGC TATAAGCGCG AATGGCTATG GTTAATTGAT GATGATGATC GGATCGCTCA ACAGCTTTGC TATGGATTGC AAGCAGAATA TGCCGAGCAA ACGATTCGGA TTATCAGTTT GCCAGCCCCA GCTGAGCGGG TTAATCCCAA AACCTTCAAG CTAATTGCGG GATTACAACA AGCCCAAGGC CAAATTATTT GCGTGCTCGA CGATGATACC AGCCTGCCAG CCTATGGCTT GGAACAATGT TTGCCATGGC TCGATCAAGC GGGAGTAGGC TTGGCCTTTG GCTTGCCCTA CTATCGCTCG TTCGATAATA CTTGGTCGAG TTTGGTGGCA TTGTTTGTTA ATAGCAATAG TTTGCTGACC TATGTACCCT ATAGCCAAGT TAGCGAGCCA TTTACGATCA ATGGAATGTT CTATGCCATG CGCCGCGACG TTTTGGAGCA ATTGCATGGT TTTGTTGGCT TAGAGCATAT TTTGGCTGAC GATTTTGCGG TGGCGCAACG GGTGCAACAG GCAGGCCTGC GCTTGCAGCA AACCAGCATG CGCCATGCAA TTCGTACTAC CGTGACCAAT GCCCAACGCT ATCGCAGCCT GATCCAACGC TGGTTTATCT TCCCACGTGA ATCGCTGCTA CGCCATTTGA ATCGGCGCGA ACGCAGCTTG CTGTTTTGTT TGGCGATTGT GCCCACGCTG TTTCCATTGG TTTTGGCGAT CGTGAGCGTC TTGCGACCAA GCCAACGTCA ACGCTGGTTT GCTGCAACGT ATACTTTGCT TGGCCTGATC AGTTTTATTC AGATTGATCA AGCCTACCTA GAGCAAGCCA CGCCACGGCG CTATTGGCTG TTTGTACCAT TTTTAGAATT GCTCATTCCA GTGCAATTGA TTCAAGCCTT GCTTGCGCCT CAGCGCATTG TTTGGCGCGG CCATGTGATG GATGTGGAAA AAGGCGGCGC ATTTCGCTTT GTGCAACGCA GGGATGATGG TTCGGCGAAT GGGTTCTAG
|
Protein sequence | MIKRIWIVLC GLMLGQRCWK WWQVWRFFGK PTPTAQHEPA TTLVSLLQPI LSGDPHLAIC LRANLNAPSS YKREWLWLID DDDRIAQQLC YGLQAEYAEQ TIRIISLPAP AERVNPKTFK LIAGLQQAQG QIICVLDDDT SLPAYGLEQC LPWLDQAGVG LAFGLPYYRS FDNTWSSLVA LFVNSNSLLT YVPYSQVSEP FTINGMFYAM RRDVLEQLHG FVGLEHILAD DFAVAQRVQQ AGLRLQQTSM RHAIRTTVTN AQRYRSLIQR WFIFPRESLL RHLNRRERSL LFCLAIVPTL FPLVLAIVSV LRPSQRQRWF AATYTLLGLI SFIQIDQAYL EQATPRRYWL FVPFLELLIP VQLIQALLAP QRIVWRGHVM DVEKGGAFRF VQRRDDGSAN GF
|
| |