Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3583 |
Symbol | |
ID | 5735444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4505328 |
End bp | 4506515 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280732 |
Product | glycosyl transferase family protein |
Protein accession | YP_001546347 |
Protein GI | 159900100 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCGG CGTTATTTTG GCTGTTTACA GGCAGCGTCA TCTATACCTA TGCTGGTTAT CCAGCGCTGT TGACTTTACT AGCACGCTTT CGACGTGAGC GCGGGCCTTA TCCGCACGCT GAGCCAGCAG TTACTTTGCT GATTGCGGCC TACAACGAAG AGGCTGAAAT CGCTCGCAAA ATTGAAAACT CGTTGGCCTT GGATTATCCA CGTGAGCAAC TGCAACTATT AATTATTGCC GATGGCTCAA GCGACCGCAC CCCCGAAATC GTGCAGCAAT ATGCCGATCG CGGCGTGGAA TTGCTGTATG AAGCTCCACG GCGCGGCAAA ATGGCGGCGA TCAATCGGGC AATTCCATTT GTGCGCGGCG AAATTATTGT TTTCTCTGAT GCGAACAATC GCTATGATGC CAATGTGATT CGCGAATTGG TTGCACCATT TGCCGATGCT GATGTTGGCG CTGTTTCTGG CGCAAAGGTA ATCGAAAAGG GCGACGGCGC TTTGGGCGAA TCCGAAGGGC TGTATTGGCG CTACGAATCG TTCGTTAAAA AGCAAGAAAG CAAATTGGCT GGTTGTACCG CCGTTGCTGG CGAAGTGCTA GCGATTCGCA GCGATTTATT TGAGTCGCCG CCCGATGAAA TTATCAACGA CGACACCCAT ATGGGCATGC GGATCATCGC CCGTGGCTTT CGGATTCACT ATGCGCCCCA GGCTCGCTCA CACGAACGGG TTTCGCAATC GGCGCAGGAT GAAATTGCCC GTCGCGCTCG AATTTTCGCT GGACGCTATC AAGCCATTAC TTATGCTCGT AGTTTGTGGC CATTCCGCCG CCCACTCGCC TTGTGGCAAT TACTCTCGCA CCAAACCTTA CGCACCTTGG TAGCGGTCAA TATGGTCGGC GCTTTGGTGA TGAACATTCT GGCAGTGCTC TCGCCAAGTA AGGCAAAATC GCATAAACTA CTACGTTTAG CAGCACCATT CAATTGGCTG TTGTTAATCT TCCAAATTGT GTTCTACACC ATGGCTTGGC TGGGTGGCCG AGTTGACTCA AATAGTAAAC TAGGCAAGGT GGTTTACATT CCAACCTTCC TAGTCAATAG CAATTGGGCT GGCTTGATTG GCCTATATCG CTTTGTCCGA CGACGACAAA CCACGCTATG GCAGCGAGTG GGGCGGCGCA CGAACTAA
|
Protein sequence | MIAALFWLFT GSVIYTYAGY PALLTLLARF RRERGPYPHA EPAVTLLIAA YNEEAEIARK IENSLALDYP REQLQLLIIA DGSSDRTPEI VQQYADRGVE LLYEAPRRGK MAAINRAIPF VRGEIIVFSD ANNRYDANVI RELVAPFADA DVGAVSGAKV IEKGDGALGE SEGLYWRYES FVKKQESKLA GCTAVAGEVL AIRSDLFESP PDEIINDDTH MGMRIIARGF RIHYAPQARS HERVSQSAQD EIARRARIFA GRYQAITYAR SLWPFRRPLA LWQLLSHQTL RTLVAVNMVG ALVMNILAVL SPSKAKSHKL LRLAAPFNWL LLIFQIVFYT MAWLGGRVDS NSKLGKVVYI PTFLVNSNWA GLIGLYRFVR RRQTTLWQRV GRRTN
|
| |