Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1038 |
Symbol | |
ID | 5732942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1184626 |
End bp | 1185663 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278173 |
Product | galactose-1-phosphate uridylyltransferase |
Protein accession | YP_001543814 |
Protein GI | 159897567 |
COG category | [C] Energy production and conversion |
COG ID | [COG1085] Galactose-1-phosphate uridylyltransferase |
TIGRFAM ID | [TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTCA GTGATACGCC CCATCGCCGC TACAACCCGC TGACTGACGA ATGGGTCTTG GTTTCGCCAC ATCGAACTGC CCGCCCATGG CAAGGCCAGG TCGAAAAAAC CATTCCCGAC CAGCGCCCAG CCTTTGATCC GCAATGCTAC CTTTGCCCTG GAGTAACGCG GGCCAATGGC GAGATCAACC CAGTTTATGC TAGCACCTTT GTGTTCCCTA ATGATTTTGC CGCCTTGCTG CCCGATAGCC CGAATGCAGC GCTTGATGAT GGCCTGTTTC AAGCCCATAG CGAACGTGGT ATTTGTCGGG TGATTTGTTT CTCGCCACGT CACGATTTGA CCTTGGCCGA AATGGAGATT CCCGATATTC GTTTGGTCAT CGATTTGTGG GCTAGTCAAT TTAGCGAGCT AGCTGCGATC GATTGGATCA AGCATGTCGA GATTTTTGAA AATCGCGGCG CAGCGATGGG GGCAAGCAAT CCACACCCGC ATGGCCAAAT TTGGGCCAAC GAAAGCATTC CAACGCTGGT CGCCACCGAA CAGCGTAGCC AAAAGGCTTA CTTTGCTCAA CACCAACGCC CATTACTGAT CGATTACGTG GAGCAAGAAT TAGCGCGGGG CGAACGGGTG GTCTATGCCA ACGATTACTG GGCAGCGGTC GTGCCATTTT GGGCAGTTTG GCCTTATGAA ACCATGCTTT TGCCACGTCG GGCGGTGAGC ACCTTGGCCG AATTAAGCGA AGCTGAACGT GATGGCCTCG CCGATTTACT CAGCCATACC CTGATTCGCT ACGATAATTT GTTCCAAACC TCGTTTCCCT ATACGTTTGG CTGGCACAAT GCCCCCTGCG ATGGCGAACA ATATCCCCAT CATGTGGTGC ATGCCCATAT TTACCCGCCA TTGTTACGCT CGGCCACGGT GCGCAAATTT ATGGTTGGCT ACGAAATGTT GGCCCAACCA CAGCGCGACT TAACCGCCGA AACCGCCGCG CAACGGCTAC GCGATTTACC CAGCCTGCAT TGGACTAAAG CTGAGTAG
|
Protein sequence | MNLSDTPHRR YNPLTDEWVL VSPHRTARPW QGQVEKTIPD QRPAFDPQCY LCPGVTRANG EINPVYASTF VFPNDFAALL PDSPNAALDD GLFQAHSERG ICRVICFSPR HDLTLAEMEI PDIRLVIDLW ASQFSELAAI DWIKHVEIFE NRGAAMGASN PHPHGQIWAN ESIPTLVATE QRSQKAYFAQ HQRPLLIDYV EQELARGERV VYANDYWAAV VPFWAVWPYE TMLLPRRAVS TLAELSEAER DGLADLLSHT LIRYDNLFQT SFPYTFGWHN APCDGEQYPH HVVHAHIYPP LLRSATVRKF MVGYEMLAQP QRDLTAETAA QRLRDLPSLH WTKAE
|
| |