Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4282 |
Symbol | |
ID | 5736141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5468219 |
End bp | 5469436 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281442 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547042 |
Protein GI | 159900795 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.870736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAATTG CTTATATTGC TTATCCAACC AGTTTGATGC TGGCTTCGGC CAATGCGATT CAAACCTGGA CAACCCTACG CGAATTGCGC CAACAAGCGC CCAACACCTT GATTATTATT CCGCGCTGGT TGCGTGAACC AAGCCGATTT AAAGAGGTCG GCGCAACACA CTTGCAACGC CCAGCAATTG GCAAGCTCTC GCGTTTTAAA AAATCGACTT TGTGGTATTA CGCTGAACGT AGCGTTTTTG CCGCCATGAG TGCTGCTGTC GTAGCCAGCC AACGTTGGCG CGGCGAGGCC GTCGATGTGG TCTATGTGCG CGAGGTGATT GCCGCTGGTT GGTGGGCCAC GCTTTGGGGG CCGTTGCTCA ACATTCCGGT GATCTACGAG GCCCATGATC TGGAAAGCTG GAATCCGTCA CGCGCCAAAG AATCATGGGT GCAGCCCGTG CTCAATTTGC TTGATCGCTT GACGCTTGGG CGGAGCGCTG CTGTAGCTTC ATTGACCGAT GATTTTCGCC AACTCTTGGC ACGTTTGGGC TGGCGTAAAC CCAGCGATGT GGCCGTAATC CCCGATGCGT TTGATGATTC GCTGTATCAG CCCCACGATC GCCAACAGGC GCGGGCGCAG CTTGGGCTTG ATCCAACTGC ACCATTAATT GTCTATGCTG GAATGACCTT CTCCTATCGT GGGATTGATC GCTTGATTGC TGCTTTTGCG AGCCTACGCC AAGCGATGCC AAACGCTCAA TTATTGTTCA TCGGCGGGCG GCCAGCTGAA ATTGCCCAAT TTAGCCAGCA GGCCAACCAT TTGGGGCTTG GCGAGAGCGT GCGTTTTCTG GGAGCCTTGC CGCAAAGCGC CACGCCAGCC TATTTACATG CTGCCGATGT TTTGGTCATT CCCGATACAG TCACCGACGT AACCGCCTCG CCGCTCAAAT TATTTGAATA TTTGGCGGTT GAGCGGGCGG TCGTTTTGCC GAATATTCCA GCCTTGCGCG AAATTTTGCC CGAACAGATC GGCTATTATT TTGAGCGTGG CAGCATCCAA GGCTTAGAGC AAGCTCTCGT CGATGCCTTA ACCGATCCGC TGCGCCCTGA GCGTGAGCAG GCTGGCCGCC AATGTGTGCA AGAGCATACC TATCGCGCCC GCGCTGGCAG GATCAAGGCC TTGTGCCAAC AAATTAGCCA AACAACCAGT AGTAGTGCAT TAGATTAA
|
Protein sequence | MRIAYIAYPT SLMLASANAI QTWTTLRELR QQAPNTLIII PRWLREPSRF KEVGATHLQR PAIGKLSRFK KSTLWYYAER SVFAAMSAAV VASQRWRGEA VDVVYVREVI AAGWWATLWG PLLNIPVIYE AHDLESWNPS RAKESWVQPV LNLLDRLTLG RSAAVASLTD DFRQLLARLG WRKPSDVAVI PDAFDDSLYQ PHDRQQARAQ LGLDPTAPLI VYAGMTFSYR GIDRLIAAFA SLRQAMPNAQ LLFIGGRPAE IAQFSQQANH LGLGESVRFL GALPQSATPA YLHAADVLVI PDTVTDVTAS PLKLFEYLAV ERAVVLPNIP ALREILPEQI GYYFERGSIQ GLEQALVDAL TDPLRPEREQ AGRQCVQEHT YRARAGRIKA LCQQISQTTS SSALD
|
| |