Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5097 |
Symbol | |
ID | 5737055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 123439 |
End bp | 124461 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641282262 |
Product | glycosyl transferase family protein |
Protein accession | YP_001547853 |
Protein GI | 159901607 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.912612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTAA GTATCATTAT TGTCAGTTAT AACAGTCAGG ATGATCTCGT TGAGTGTCTC GACTCCGTGA TCCAAGCCTG TCCTGATCAA ACACGGTATG AAATTCTTGT TGTTGATAAT GACTCACAGG ATGCAAGTCG TCTCGTCGTC CAGCAGCAGT ACCCGATGGT TCGACTCCTT GAAAACAGTA ATACTGGCTA TGCAGGGGGG AATAACTATG GTGCAGCGAT GGCGCGTGGT GAATACCTTC TCTTTCTTAA TCCCGATACG GTGGTCATGC CAGGTGCCAT TGATGCGCTC GTAGCCCCCT TCAAGACTGA TCCGACCATT GGTCTTACCA CGGCATGTCT TGTCCACCAT CAGCACCCCC AATCCATTAA TGCCTGTGGA AATGAGATGC ACTATACCGG GTTAACGTAC TGTCGAGGTG CCAATCAACC CCGGACTGCC TATCAAACAA GTTCCTATGT TGATGCCGTG TCAGGCGCTG CATGTGCGAT CCGTCGCAGC CTATTTACCA CGTTGGGGGG GTTTGATCAG CAGTTTTTTA TGTATGTCGA AGATAGTGAT TTATCGTTAC GGGTGCGGCT CTATGGATTG CAGTGTTTTT ATGTTGCGGA TGCGGTTATC CAGCATAAGT ATCACATGAA GTATACCGCC CAAAAAGCAT TTTTGATTGA ACGCAACCGC TATTCGATGC TCATTAAAAA TTTCTCACCG AGCGTCTTAG GTCGCTTACT GCCAGGACTT CTTCTGGCCG AAGTGATTAC CGGAAGTTAT TTTTTGCTGC GTGGACCACA GTATTGGAGT ATCAAACCGC GACTCTATCA GCATATCTGG CGGTATTGGA GGACTACATC GACACCAGCG ACCAGTCCGA TCCAAGAACT GGCCGTTGTC AAAGCATTAA CCAGTCAGCT TAATTTTCAG TCACTCCATC AGGGCAGGGT TACGCGGTTG CTTGCAGGGA TGGTCAATCC GCTATTGGGG CTGGCTCATC GATTTGCAGG AGGGTGGGCA TGA
|
Protein sequence | MDVSIIIVSY NSQDDLVECL DSVIQACPDQ TRYEILVVDN DSQDASRLVV QQQYPMVRLL ENSNTGYAGG NNYGAAMARG EYLLFLNPDT VVMPGAIDAL VAPFKTDPTI GLTTACLVHH QHPQSINACG NEMHYTGLTY CRGANQPRTA YQTSSYVDAV SGAACAIRRS LFTTLGGFDQ QFFMYVEDSD LSLRVRLYGL QCFYVADAVI QHKYHMKYTA QKAFLIERNR YSMLIKNFSP SVLGRLLPGL LLAEVITGSY FLLRGPQYWS IKPRLYQHIW RYWRTTSTPA TSPIQELAVV KALTSQLNFQ SLHQGRVTRL LAGMVNPLLG LAHRFAGGWA
|
| |