Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1537 |
Symbol | |
ID | 4709130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1671337 |
End bp | 1672356 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639856004 |
Product | UDP-glucose 4-epimerase |
Protein accession | YP_001003106 |
Protein GI | 121998319 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGTCC TGGTCACCGG CGGCACCGGG TACATCGGCA GCCACGCGGC GGTGGCGCTC ATCGAGGCGG GCCATGAGGC GGTGCTCCTA GACAATCAGC ACAACAGCAC CGGCGCTGTC GCATCTCGCA TCGGCCGGAT CACCCGCACA TCGCCGGAAC TGATCCGAGG CGATGTACGG GATGCCTCGA TGCTGGCGGA GTTATTCAGC GCCCGGCGCA TCGACGCGGT GATGCACTTT GCCGGGCTCA AGGCCGTGGG CGAGAGCGTG CAGCGACCGC TCGACTACTA CGAAACCAAT GTCGGCGGTA CCATTCGCCT CTGCCAGGCG ATGGAAGCCG CCGGAGTCCG CAAGCTGATT TTCAGCTCCT CGGCCACCGT ATACGGCGAC CCGGATCGCG TACCAATCCG CGAGGACGCC CCTACCGGCG GGACGACCAA CCCCTACGGC ACTAGCAAGC ACATGGTCGA GCGGATGCTC ACCGACCTGA GCACAGCCAA CCCGGGCTGG TGCATCGGGA TCCTACGCTA CTTCAATCCC GTCGGAGCCC ATAAGAGTGC GCTGATCGGC GAATCGCCGT CCGGCATCCC CAACAATCTC GTGCCGTATA TCGCTCAGGT GGCCGCCGGG GAGCGCGAGC ACCTGCAGGT CTTCGGCGAC GACTACCCGA CCCGGGACGG CACGGGGGTG CGGGACTACA TCCACGTCGT CGATCTGGCG GCCGGGCATG TACGCGCGCT GGAGTACCTT GACGCCCAAC CGGGTCAGCA CGCCTGGAAC CTCGGCCGCG GCGAGGGGCA CTCGGTCTTG GAGGCCGTGC GTGCCTTCGA GCAAGCGAGC GGGTGCTCCA TCCCGTACCG GGTTACGGAG CGGCGCCCCG GCGATGTCGC CGAGTGCTGG GCCGACCCAA GCAAGGCCGA GCGGGAGCTC GGCTGGCGCG CCGAGCGAGG CCTGGCCACG ATGATGGAGG ATGCCTGGCG CTGGCAGCGT CAGGGCTCGG ATACCGAGAC AGCTTCCTGA
|
Protein sequence | MRVLVTGGTG YIGSHAAVAL IEAGHEAVLL DNQHNSTGAV ASRIGRITRT SPELIRGDVR DASMLAELFS ARRIDAVMHF AGLKAVGESV QRPLDYYETN VGGTIRLCQA MEAAGVRKLI FSSSATVYGD PDRVPIREDA PTGGTTNPYG TSKHMVERML TDLSTANPGW CIGILRYFNP VGAHKSALIG ESPSGIPNNL VPYIAQVAAG EREHLQVFGD DYPTRDGTGV RDYIHVVDLA AGHVRALEYL DAQPGQHAWN LGRGEGHSVL EAVRAFEQAS GCSIPYRVTE RRPGDVAECW ADPSKAEREL GWRAERGLAT MMEDAWRWQR QGSDTETAS
|
| |