Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1027 |
Symbol | |
ID | 5732931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1172859 |
End bp | 1173992 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278162 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543803 |
Protein GI | 159897556 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGC GACTCTGGTT CGTCACCGAT AGCACAGCCC TTGGGGGAGC CGAGGGCTAT TTAGAAACCC TGTTGCTCAA TGCCGACCAA CAGCAATTTG AACTTGGCTT GTTGCTGCCG CCGCGCCCAG CCACCCAACC CTTGATCGAT CGAGCCAAAG CCCATGGAGC GAGCATTGCA ACCTTGGATG TGGTGCACGA GCATGGGTTA TCGCTGAAGG CAGTCAATCA AGCGGCACGA CTTTTTCGCC AATTACAGCC TGATATTGTG CATTTCGTTG TGCCTTCACC GCGCCGCGCC GCAGAATTGG TGCTTGGGGC GGCCTTGGCG CGAGTGCCAC GCCGAGTTAT CACCTTCCAG TTGGTCACGC CAATTCCCCG CTTCAATTGG CTTTCCCATC ATCTGCGGCT GCTCAATCGG CGTTGGCAAT ACGCCACCTT ACACGCTGGC ATCGCGGTTT CGCAAGGCAA TGCTCAATTG TTATTAGAGC AATTTGGCTT TCCCAAGCGG CGTTTGCATA CCATTTATAA TGCGGTTGAT AGCCAACGCT GGCAGCCGCA ACCGCGCGAT CCTGCAACTC GTGCAGCGTG GCAAATTCCC GCCGATGTGC CACTTTTAGG CGTGGTTGGG CGTTTGAGCC GCCAAAAAGG CCACCAAATT TTATTCGAGG CCTTACCAAC GTTGTGGCAA GCGCAGCCGA ATTTGCATGT CGCATTAATC GGCGAGGGCG ATTTAGCTGA CGAATTACGT CAAGCTGCCC AACAACTACC CAAGCCAAAT CAAGTGCATT TTGTCGGCCA GCAAACTAAT ATGCCTGCGG CTTTGGCCGC ACTTGATGTT TTTGTCTTGC CATCGCTGTA CGAAGGCTTA TCGTTTGCCT TGCTCGAAGC CATGGCCAGT GGGCAAGCAA TTGTTGCCAG CAGCACCGAT GGTACACGCG AAGCAATCAG CGATGGAATC CAAGGTCTAT TGGTTGAGCC AGGCCAAAGT GCTGCGCTGG CGCAGGCAAT CGGGCGCATG CTCAGCGATC AATCATTAAA CCAAGCCTGT CGCCAAGCCG CCCGCCAACG CATTCAACAA CAATTTGAGT TGCAAACGAT GTTGCAACGC ACGTTTGATT TGTATCGAGC ATAG
|
Protein sequence | MKRRLWFVTD STALGGAEGY LETLLLNADQ QQFELGLLLP PRPATQPLID RAKAHGASIA TLDVVHEHGL SLKAVNQAAR LFRQLQPDIV HFVVPSPRRA AELVLGAALA RVPRRVITFQ LVTPIPRFNW LSHHLRLLNR RWQYATLHAG IAVSQGNAQL LLEQFGFPKR RLHTIYNAVD SQRWQPQPRD PATRAAWQIP ADVPLLGVVG RLSRQKGHQI LFEALPTLWQ AQPNLHVALI GEGDLADELR QAAQQLPKPN QVHFVGQQTN MPAALAALDV FVLPSLYEGL SFALLEAMAS GQAIVASSTD GTREAISDGI QGLLVEPGQS AALAQAIGRM LSDQSLNQAC RQAARQRIQQ QFELQTMLQR TFDLYRA
|
| |