Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3357 |
Symbol | |
ID | 5735227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4235142 |
End bp | 4236308 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280504 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546121 |
Protein GI | 159899874 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.656502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAAAA TAACTGTCGC TCAAGTGATT ACCGGCTTTG CGAGTGCCGA AGGTGCTGGC GGCTCAGCCT TATTTGGCAT CGAAGTAGCA CGAGCTTTAG ATAAAAGCCG TTTTCGGCCA ATTTTGTGTG GAATTCATCG GTTTAATGCA CCTTCGGAGC AGCGTTGGCT CAAAACCTTG GCCGATGAGG GCATTGAAAC CAGAATTATG GTGCAAGAAC GCAGCAAATT GCGCTACGAT ATGGTGCGCT TCAGTGCGTT GCTCAATCAA CTGATTCAAG CACAAGCCGT TGATATCATT CACACCCATG TTGAGCGAGT TGAATTTTTC ATTAGTTTGC AAAAATTACT CCACCCCAGC CACTATCCCA AACTTGTCCG CACCATTCAT GTCAATGCCA TGTGGGTTAC GCGGCCATTA GTACGACGCT TGATGAACAT TGTCTACACC CAACTATTTG GCGAGGAAAT CGCAATTTCC CAAGCCACCA AAACCATGCT TGATCAACGC ATGGCAGCCA AGGTCTTTGG GCGCTCGGCC AGCTTAATTC AAAATAGCCT ACCGCTCGCA CGCCTGCAAA AATTCGATCT ACCCAAACAG CACCAGCGAT TTAGCCCACC CCGTTTTTTA GTGATTGGCC GGCTAGAAAT CCAAAAAGCT CAAGATATTT TTATTCAAGC GGCGGCGTTG GTGTTGCAAC AATACCCTGA AGCCGAGTTT TGGTTGGCAG GCGAAGGCAC CCAAGAGGCC AATTTTCGCC AATTGACGGC CAATTTAGCG ATTGAGCATG CAGTTAAATT CCTTGGGCCA CGCGGTGATA TTCCCGAAGT GTTGAGCCAA GTCGATGTGC TGGTCTCAAC CTCACGCTGG GAAGGCTTTG CAACGGTAAT TTTAGAGGCA ATGGCAGCAC GCACGCCAGT GATTGCTACC GATATTGGCG GCAATAACGA ACAAATCGTT GATGGCGAAA ATGGGCGTTT GGTCGCAAGC GAAAATCCTA GCGCAGTCGC CGATGCCATG ATCTGGATGC TTGAACATCC TCAAGCAACT GCGCTGATGG CACAGCGCGG CTACGAATGG GGGCAGCAGT TTACGATGGA ACGCACTGCT GCCCAGTATG GCGAACTGTA CGAGCGTTTG CTTAGGGAGC AAAAATATCG ACCTTAA
|
Protein sequence | MRKITVAQVI TGFASAEGAG GSALFGIEVA RALDKSRFRP ILCGIHRFNA PSEQRWLKTL ADEGIETRIM VQERSKLRYD MVRFSALLNQ LIQAQAVDII HTHVERVEFF ISLQKLLHPS HYPKLVRTIH VNAMWVTRPL VRRLMNIVYT QLFGEEIAIS QATKTMLDQR MAAKVFGRSA SLIQNSLPLA RLQKFDLPKQ HQRFSPPRFL VIGRLEIQKA QDIFIQAAAL VLQQYPEAEF WLAGEGTQEA NFRQLTANLA IEHAVKFLGP RGDIPEVLSQ VDVLVSTSRW EGFATVILEA MAARTPVIAT DIGGNNEQIV DGENGRLVAS ENPSAVADAM IWMLEHPQAT ALMAQRGYEW GQQFTMERTA AQYGELYERL LREQKYRP
|
| |