Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4489 |
Symbol | |
ID | 5736340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5748525 |
End bp | 5749778 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281652 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547249 |
Protein GI | 159901002 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTACA CCATCCTTAG CATTGCCTCA ACCTCGTTTT TTGCAGATTA CGGCGCACAC GTGCGGATTT GGGAAGAAAC TCGCGCCCTG CAAAAATTGG GTCATCGCAT TGTTATTGCA ACCTATCATA ATGGCGATAA TATGCCAGGC TTTGAAATTC GCCGCTCATG GGACGTGCCA TGGGTTAAGC GCACGATGGT CGGGGCTTCG CATCATAAAA TGTATTTGGA TGTGGCACTT TCGTGGCGAG CTTTACGGGT TGCCATGGAA ATCAAGCCCG ATTTAATCCA TGCCCATATT CACGAATCGG CTTTGATTGG CAGTGTGCTT TCGCGCATGT TCAAGATTCC GCTGGTGTTC GATTATCAAG GCAGCCTCAC TGCCGAAATG CTTGATCATG GCTTTCTTAA ACGCGATGGC ATGTTTTATA AGCCATTCCA CTGGCTCGAA GATAAAATTA ATCGCACTGC CGACGCTGTT TTGACCAGCT CCTTCAACGC TGCCAATATG CTCCGCGATG ATTGGAAGTT TCCGGCGGAG CGGCTTTACA CTGTGCCAGA TAGCGTTAAC ACCGATCGCT TCAAGCCCTT CGATGGCTCG GCAGAGTGGC ATGCTGAGCG TGAACGCATT CGTAGCGAGC TAGGAATTCC GGCAGGCCGC AAGATCGTGG CCTATTTAGG CTTGCTGGCA GCCTATCAAG GCACGAATGT CTTGCTTGAA GCCGCCCAAA TTATTCGCCA ACAACGCGAT GATGTGCATT TCTTGATTAT GGGCTACCCC GATGTGCGTT CGTATTTGGC CTTGGCTGAA TCGCTGGGCG TTGCTGATAT TGTGACCATG CCAGGCCGCA TTTTATACAA AGATGCCCAT GCCTACTTGG CCTTAGGTGA TGTGGCGGTT GCTCCCAAAA TGTCGGCAAC CGAAGGCGCT GGCAAAATCC CCAATTATAT GGCAGTTGGC TTGCCTGTGA TCACGTTTGA TACGCCAGTT AGCCATGAAA TTTTGGGCGA TGCTGGGGTG TATGCCAAGT TTGGCGATGC TCAATCGCTA GCCGACGAAA TTCTAGGTTT GATCGATAAC CCTGAGCGAC GGCATAATTT GGCGCAAACG GTGCGCACTC GGGCAGTCAA CGAACATTCT TGGGAACTCG CCGCCCGCCA GATCGAAGCA ATTTATGAGC GGGTGTTGGC CAAACGGGCA GGCAATCCGC TACCCGAATT TCCAACGAGC TTGCAACGCG AACAAGGTTC ATAA
|
Protein sequence | MGYTILSIAS TSFFADYGAH VRIWEETRAL QKLGHRIVIA TYHNGDNMPG FEIRRSWDVP WVKRTMVGAS HHKMYLDVAL SWRALRVAME IKPDLIHAHI HESALIGSVL SRMFKIPLVF DYQGSLTAEM LDHGFLKRDG MFYKPFHWLE DKINRTADAV LTSSFNAANM LRDDWKFPAE RLYTVPDSVN TDRFKPFDGS AEWHAERERI RSELGIPAGR KIVAYLGLLA AYQGTNVLLE AAQIIRQQRD DVHFLIMGYP DVRSYLALAE SLGVADIVTM PGRILYKDAH AYLALGDVAV APKMSATEGA GKIPNYMAVG LPVITFDTPV SHEILGDAGV YAKFGDAQSL ADEILGLIDN PERRHNLAQT VRTRAVNEHS WELAARQIEA IYERVLAKRA GNPLPEFPTS LQREQGS
|
| |