Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4004 |
Symbol | |
ID | 5735865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5110163 |
End bp | 5111359 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281154 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546764 |
Protein GI | 159900517 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAAC CTTTAGTGCT GCATATTGCA ACTGCCGATA TGGGCTTGCG CTTTTTACTG CTCGAACAGA TGCAGGCAAT TCATCATGCA GGCTATCAGG TACGCGGAGT TGCCAGCGAT GGCCCCTATC GCGCCGAGGT TGAGGCCGCT GGAATTCCGG TCGATGTGAT CAAGATGCCT CGCGCAATTA CTCCAAACCG CGATTTGTTA GCGCTAACCC AGCTTGTGCG TTTGTTTCGT GAACTCAAAC CAACGATTGT CCATACCCAT AATCCTAAAC CTGGTTTGCT TGGCCAGCTA GCAGCACGCA TTGCTGGCGT GCCAATTATT ATCAACACCA TTCATGGCTT TTATTTTCAT GAGCATTCGA GCGCCAATCA GCGGCGCTTC TACATTGCCA TGGAGAAAAT AGCCGCCCGT TGTTCGCATG CAATTCTTTC GCAAAACCGC GAAGATCTCA ACGCAGCGCT TGCGCTCAAG ATTGCGCGGC CAGAGCAAAT TAGCTTTTTG GGCAATGGCA TCAATTTACA AGTGTTTGAT CGGCGGGCCG TGAGCCAAGC CGACATTCAA GCTGCCCGCC AAGAACTGGG TATTCCTGCT GATGCCCAAG TGATCGGAGC AGTTGGGCGT TTGGTTGCCG AAAAGGGCTA TCACGAGTTG TTTCAGGCGT GCCAACAACT GATGGCAACT CGCCCCAATT TACATTTGCT GGTGGTTGGC CCCGAAGAAC CAAATAAAGC CGATGGCCTG ACCGCCGCAA CCGCCGCCAA ATATGGCATT GCTGAGCGCA CCCATTTTGC AGGCCTGCGC CGTGATATGC CGGTGCTGTA TCGGTTGATG GATGTTTTGG CCCATCCTTC CTATCGCGAG GGCTTTCCGC GTGCGCCAAT GGAAGCGACC GCAATGGGTG TGCCAGTGGT TGCCAGCGAT ATTCGCGGTT GCCGCGAAAC CGTGGTGCAT AGCCTTAACG GCATGTTGGT GCCAGTGCGC GATGTAGCGG CCTTAGCACA TAGCCTTGGC CGCATGATCG ACGATCGGGT GTTACGTGAG GCCTTTGCGC GGCTAACTCG GCGGGTTGCT CAGCGCGAGT TTGATCAACA ACGGGTTTTT GATCGAGTGC TGTTGACCTA TGCCAAGCAA TTACAAGCCC ATGGAATGGC CGTGCCCGAA CCAATTCAGA GCCAAGCCTC AACCTAA
|
Protein sequence | MNQPLVLHIA TADMGLRFLL LEQMQAIHHA GYQVRGVASD GPYRAEVEAA GIPVDVIKMP RAITPNRDLL ALTQLVRLFR ELKPTIVHTH NPKPGLLGQL AARIAGVPII INTIHGFYFH EHSSANQRRF YIAMEKIAAR CSHAILSQNR EDLNAALALK IARPEQISFL GNGINLQVFD RRAVSQADIQ AARQELGIPA DAQVIGAVGR LVAEKGYHEL FQACQQLMAT RPNLHLLVVG PEEPNKADGL TAATAAKYGI AERTHFAGLR RDMPVLYRLM DVLAHPSYRE GFPRAPMEAT AMGVPVVASD IRGCRETVVH SLNGMLVPVR DVAALAHSLG RMIDDRVLRE AFARLTRRVA QREFDQQRVF DRVLLTYAKQ LQAHGMAVPE PIQSQAST
|
| |