Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1032 |
Symbol | |
ID | 5732936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1177640 |
End bp | 1178881 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278167 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543808 |
Protein GI | 159897561 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCAT TAAAGATTCT GGTGTTGACC AGCTTATTTC CACCGCACTA TATCGGTGGC TACGAGGTTG GTTGTGGCGA TGTGGTTGCT GGTTTACGCG CCAAAGGCCA CGATGTAACG GTGCTAACCA GCACCTACGG CGTGGATCGC GAAGTGCGCG AAGGCTTTAT CGAGCGTTCG CTGACGATCG ATATTCACCC CGTGGCTCAA ACTACTTTAG CCCACCAACG CCGTTTATAC GAACGTGAAC TCCACAATCA GCCACGCATT TTAGCACTTG CCGAAGCGCT AAAACCCGAT ATTGTGTTTG TTTGGAGTTT CTACCAAGCC TCGATGTCGG CAGTGCGTAT GCTCCAAGCA CGCGGCTATC GCACAGTCTT TTTCATCTCC GATCATTGGC CAATTCGGAT TGGAATTTTC GATCCATGGC ACTATATGCC GCGCCATCCT GTGCGTTGGC TTGGGCGCAA ACTGCTTGGC CGAAAGCTTG GGCGCAAAAC TGGTTTTCGC AATCATTTGC CATTGAGCTA TGCCAATGCA ATTTTTGCTA GCGACTTTTT GCTCAAGCAA ACCCAACCCA TCGGCTCCAT GCCCAATGCT CAAGTGATTC GCTGGGGTGT TGATTACGAT TTGTTTGTGC GCCGTGAACG ATCAGCTAGC ATGGGCCAGC GCTTATTGTT TGTTGGTCAA ATTTCCGGCC ACAAAGGCAT GGAAACCTTA CTTGAGGCCT TTGCCTTGGT GCAACAAAAG CATGGCTTTG AACAAGCGAC GCTCTCGATT GTTGGGGCTG GGCTTTCCGC TGAATATACC AGCGAGATGC AGCGCTACGC CGAAACCCTT GGTATCAGCC AAAGCGTTGC TTGGCGGGGC AAATTGCCGC GCCAAGCACT ACCCGACATC TACGCCAGCC ACGATATTTT GATCTTCCCA TCACGTTGGG ATGAGCCATT TAGCATTACT GTCGTTGAGG CGCTGGCAAT GGGGTTGGTG GTAGTGGCCT CGGATACTGG CGGCACAACC GAAATTATCA AGCATGAACA AACTGGCATG GTGTTTGCCC GCGACGATGC TGCAAGCTGC GCCGACCATA TTGCAACGCT CTTTACCCAA CCAGCCTTAG CCCAAAAACT CCACAACCAA GCCACCCAGA TTGTCGATCG CGAGTTTCGC ATGAGCACCA TGATCGATCG GGTTGAAGCA GCGCTCTATG CTGCCTGTAA CAAGGATTCG CACCATGACT AA
|
Protein sequence | MQPLKILVLT SLFPPHYIGG YEVGCGDVVA GLRAKGHDVT VLTSTYGVDR EVREGFIERS LTIDIHPVAQ TTLAHQRRLY ERELHNQPRI LALAEALKPD IVFVWSFYQA SMSAVRMLQA RGYRTVFFIS DHWPIRIGIF DPWHYMPRHP VRWLGRKLLG RKLGRKTGFR NHLPLSYANA IFASDFLLKQ TQPIGSMPNA QVIRWGVDYD LFVRRERSAS MGQRLLFVGQ ISGHKGMETL LEAFALVQQK HGFEQATLSI VGAGLSAEYT SEMQRYAETL GISQSVAWRG KLPRQALPDI YASHDILIFP SRWDEPFSIT VVEALAMGLV VVASDTGGTT EIIKHEQTGM VFARDDAASC ADHIATLFTQ PALAQKLHNQ ATQIVDREFR MSTMIDRVEA ALYAACNKDS HHD
|
| |