Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1760 |
Symbol | |
ID | 5733648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2046869 |
End bp | 2048155 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278903 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001544531 |
Protein GI | 159898284 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCGC TGCATGTGCT GTTACCAACC GACGTATTTC CACCACGCAC CGGTGGCGCA GGCTGGAGTA GTCATGCCTT GGCGTTGGCG TTGCTCGAAC GCGGCCATCA GGTGACGGCA TTGGTTCCCA AGGCTGGAGT GCGTGGGCTG CATCGGCGGG TTGAAGCAGG TGTTCCGGTG GTTGAGGTCG GCTATCAACC TGCTCGTTTG CCGTTTGTGG CCAATTGGTC GCGCTTTGAA CTGTTCTGGC CCCAATTTGC CCAAGCAATT GTCAAGACGA TCGGCAAGCA GCGCGAACAC GTAATCATTC ATGGACAGCA CGTCCAAGGG ATTGGTGCAG CCGTCTTGGC GGGGCAACAG CTGAACATTC CGGTTGTGGC AACCGTGCGT GATCATTGGC CAAATCATTA TTTTGGCACA AATTTACATG GCGATCAATT CCCGCTTGAA GATTTTGATT GGGCGGCGGC GGCCACCGAT TTGGTTGCCC GCCGCAAACC ATTACTTGGG ATACTTTCGC TCTTGGCCTT ACCCTATGTC CAAGCGCATA TGCAACGGCG ACGACAATTG TTACAAGCCT GTGATGCGGT TATTTCGCTG AGTAGCTATA TCACCCAACG GCTTAGCAGC GTAGTTGCAC CCGCCAAATT ATGGCCAATT CCCAACATGG TCAATGTAGC AGCGATAAGC AAAGTGCTAG CAACGCCTGC GCAAACGACC ATTAATCAGC CATATATCTT GTTTACTGGC AAATTGGCTC GCAACAAAGG AGCCTATTTA CTACCCGAAA TTATGGCTAG TTTTCGGGCA GCTGGTGGTG AGGCAACCCT CGTCATCGCT GGCGGCAAGA ATTCAGAGCT GGTTGCAGCA ATTCAAGCTC AAGGCATTGA GGTGCTAGCC CTCGATTGGG TTGAGCACGA TGAGGTTTTG CGCTTGATGG CGGGGGCCAA GCTCTTAATT TTCCCCTCAA CGTGGGGCGA ACCACTCAGT CGGGTTTTGC TTGAAGCTTG TGCTGTGGGC ATGCCGATTG TGGCAATGGC AACGGGCGGC ACACCGGATC TAATTCAGCA TGGCCTGAAT GGCTATCTGG CTCGCTCAGC CAAGCAACTA GGTGTATTGG CGGCAGAATT GCTGCACAAC CCGCAACGAG CCGAGCAATT GGGCCAAGTT GCCTATCAAA CCGCCCAAAC CCGTTTAGCT AGCACAGTGG TTGCTGAGCA AGTGGAACAA CTCTATTGGA CACTACTTAC CCAACAACCA CAGCGTGCGC TGACTGGGTA TGATTAA
|
Protein sequence | MKPLHVLLPT DVFPPRTGGA GWSSHALALA LLERGHQVTA LVPKAGVRGL HRRVEAGVPV VEVGYQPARL PFVANWSRFE LFWPQFAQAI VKTIGKQREH VIIHGQHVQG IGAAVLAGQQ LNIPVVATVR DHWPNHYFGT NLHGDQFPLE DFDWAAAATD LVARRKPLLG ILSLLALPYV QAHMQRRRQL LQACDAVISL SSYITQRLSS VVAPAKLWPI PNMVNVAAIS KVLATPAQTT INQPYILFTG KLARNKGAYL LPEIMASFRA AGGEATLVIA GGKNSELVAA IQAQGIEVLA LDWVEHDEVL RLMAGAKLLI FPSTWGEPLS RVLLEACAVG MPIVAMATGG TPDLIQHGLN GYLARSAKQL GVLAAELLHN PQRAEQLGQV AYQTAQTRLA STVVAEQVEQ LYWTLLTQQP QRALTGYD
|
| |