Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0774 |
Symbol | |
ID | 5732658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 875254 |
End bp | 876411 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641277904 |
Product | glycosyl transferase family protein |
Protein accession | YP_001543550 |
Protein GI | 159897303 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00080815 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCCTG TTGTGATTGG GCTGCTTTTA GGGCTAGCCT CATTGATTTT ATTGGTGCGT GATGCCTTGT TGCTCAACAA GATTCCCAAG ATTGAGCCAC GACCAAGCCC TGAACCTATG CCCAGCGTTG CTGTGTTGGT GCCAGCCCGC AATGAAGCGC AAAATATTGG CCATGTGTTG CGTGGCATGG CTCAGCAAAC CCGTAGTGAT TGGCAACTAA CCATTCTTGA TGATCATTCA ACTGATGCCA CGGCTGCGAT TGTAGCCGAT GTTGCGGCGC AGGATCAACG GGTACATTTG CTGCAAGGCC AGGCATTGCC CGCTGGCTGG ACAGGTAAGT GCTGGGCATG TTGGCAATTG GCCGAGGCTA GCACTAGCGA ATGGTTGCTG TTTCTTGATG CTGATACCAA GCCGCAGCCT GAGATGTTGC AACAAGCCCT AGCCTATGCC GAGGCCGAAA AACTCGATCT GCTGACGTTT TTGCCCTTCT CGGAGCTAGG CAGTTTTTGG GAGCAAACTT TGCTGCCAGC CTTTTTCTCA ATCATTCAGG CGGCCTATCC GGTCAGCAAA GTTAATACGC CTGGTTCGGG CGTGGTGCTG GCGAATGGTC AATTTATTTT GGTGCGACGC AGCGCTTACC AACGAGCAGG CGGCCATGCG GCAGTGCGTG ATCGAGTGCT TGAGGATGTT GAGTTAGCCC AAGCGATTGT GCGGGCTGGC GGGGTGATGC GGGCGGTGTA CGCTGGTGAG TTGTTGCGCG TGCGAATGTA CACCAAGGGC AGCGAAGTTC GCGAGGGCTT GGTCAAAAAT GCGATTGCGG GCTTACGTAA TGGTGGCGTG CGTTCTTCAT GGGCTGGTTT GCGCCAGATT TTGGTTGGAG TTGTGCCATT CGGGCTGGGT TTGCTAAGCC TATGGGCTTG GCTGCGGCGC TGGACATTCT GGCCAAAAAT CTTATTGAGT GCGATGGCGG TGCTGCTCAA TGGCTTTGCT TTGTGGAGTT GGGGCCGTTT TATGCAGCAA TTGTATGGCT TATCGCGCCG TCACGCCCTG CTTTTTCCCT TGGGGATTGT CTGCTATATG CTGCTGGCGG CTGAAGCGGC TTGGCGGATC TGGTCGGGGC GCGGGGTGAC GTGGAAAGGC CGCACCTACA AAGAGTAA
|
Protein sequence | MLPVVIGLLL GLASLILLVR DALLLNKIPK IEPRPSPEPM PSVAVLVPAR NEAQNIGHVL RGMAQQTRSD WQLTILDDHS TDATAAIVAD VAAQDQRVHL LQGQALPAGW TGKCWACWQL AEASTSEWLL FLDADTKPQP EMLQQALAYA EAEKLDLLTF LPFSELGSFW EQTLLPAFFS IIQAAYPVSK VNTPGSGVVL ANGQFILVRR SAYQRAGGHA AVRDRVLEDV ELAQAIVRAG GVMRAVYAGE LLRVRMYTKG SEVREGLVKN AIAGLRNGGV RSSWAGLRQI LVGVVPFGLG LLSLWAWLRR WTFWPKILLS AMAVLLNGFA LWSWGRFMQQ LYGLSRRHAL LFPLGIVCYM LLAAEAAWRI WSGRGVTWKG RTYKE
|
| |