Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4438 |
Symbol | |
ID | 5736289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5678504 |
End bp | 5679649 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281601 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547198 |
Protein GI | 159900951 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATAG CAATTGATTA CAATGCAGCG GTACGTCAGG GCGGCGGAAT TGGCCGCTTT GTTCGCGAAA TAACTCAGGT CGCCGCGCAG GCAGCGCCTC AGCATCGCTT CTCGCTGTGG TATGCTGCGC GTGGGCTTGA CCCAAAAAGT GCTCAGATGC AGGCCTTGCA TGAGCTTCAA CGGCGTTTGC CCAATATCAA GCCGCGCCCA ATTCCAATTA ATGAGCGCTT GTTGACGATT CTTTGGCAAC GCTTGCGCAT GCCCTTGCCT GTCGAGACGA TTGTTGGGGC GGTTGATGTG GTGCATGGCA CTGATTTTGT GTTGCCGCCG ACCAAGGCTA AAACGCTGCT CTCGATTCAC GATTTTGCCT ATATTATTCA CCCTGAAACT GCGCCACCCG AGTTGCGGCG TTATTTGGGT GGGGTTGTAC CGCGCAATGT GCGCCGCGCC GACCATATTC ACGTTAATTC GCGGGCAACC AAAGCCGATA TGGAGCGCTT GCTGGGCACA GCACCCTCTA AATCGACAAT CGTTTATTCG GGTAGTGGCA GCGATTTTTA TCCTCGGCCT GCGGCGGAAA TTGCCGAAAT GCGCCAACGC TTGGGCTTGC CCGAACGCTA CCTTTTAAAT GTAGGCACGG TGCAGCCGCG CAAAAATGTT GAGCGTTTGA TCGAAGCCTT TGGTCAATTG CCCGCTGAGT TGCGCAGCCA GCCCTTGGTG ATCGGCGGCA AACGGGGTTG GTTGGCCGAG CCAATTTATG CAGCGGTACA ACGCCATGGC CTTGAGCAAG CAGTCATCTT CTTGGATTTT GTCAGCGACA GCGATTTGCC CAAGCTCTAT AGCGGCGCGA CCGCCATGGT TTATCCCTCG TTGTATGAAG GATTTGGCGT GCCGATTGTT GAAGCTCAAG CATGTGGCAC ACCCGTGATT ACCTCAACCA TCTCCAGTTT GCCCGAAATT GCTGGCAACG CGGCCTTGCT GGTCGATCCA CATGATACAG CGGCACTAAC AGCGGCTTTA CAAAAAATTT TAACTGAGCC TGATGTTTGC CAAAGCTTGG CTGAAGCAGG CCCACGCCAA GCCGCTAAAT TTACGTGGGA AGGCACTGGT TTGGGCGTTT TGGGGTTATA CGAATTGCTG GGGTAA
|
Protein sequence | MHIAIDYNAA VRQGGGIGRF VREITQVAAQ AAPQHRFSLW YAARGLDPKS AQMQALHELQ RRLPNIKPRP IPINERLLTI LWQRLRMPLP VETIVGAVDV VHGTDFVLPP TKAKTLLSIH DFAYIIHPET APPELRRYLG GVVPRNVRRA DHIHVNSRAT KADMERLLGT APSKSTIVYS GSGSDFYPRP AAEIAEMRQR LGLPERYLLN VGTVQPRKNV ERLIEAFGQL PAELRSQPLV IGGKRGWLAE PIYAAVQRHG LEQAVIFLDF VSDSDLPKLY SGATAMVYPS LYEGFGVPIV EAQACGTPVI TSTISSLPEI AGNAALLVDP HDTAALTAAL QKILTEPDVC QSLAEAGPRQ AAKFTWEGTG LGVLGLYELL G
|
| |