Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1795 |
Symbol | |
ID | 5733697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2083356 |
End bp | 2084276 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278938 |
Product | glycosyl transferase family protein |
Protein accession | YP_001544566 |
Protein GI | 159898319 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCATA TTGCTGTTGT GATTCTCAAT TTTAATTCGG GCGATGTCCT GAAGCCGTGT TTAGCCTCAC TTGCCGACCA AGATTGGAAT GGTCGGCTTG ATGTGTGGGT GGTCGATAAT GCTTCATCCG ACGAGAGCGT GGCCATGGTG CGCGAGGCGT TTCCATGGGT GCGGTTAATT GCCTCGCCGC AAAATAGTGG CTTTTCGGCG GGCAATAATC TTGCTTTGCG CCAGATTTTG GCCGAAACTC CAACTCCTGA GGCTGTGTTG GTGCTTAACC CCGATACGGT TGTGCCAACG AATGGTATTG CGGGCATGGT TGAGGCCTTA GTTGAACGCC CAAAAGCAGG CATTGTCGGG CCAAAATTGG TGCTAGCCGA TGGCTCGCTC GATTTAGCAT GTCGGCGGGC GTTTCCTTCA GCCGAGGTAG CACTCTACCG TATGATTGGT TTATCGAAGC TATTTCCCCG TTCGCCGCGC TTTGGCCGCT ACAACATGAC CTACCTCGAC CCTGATCAAG CCACCGAGGT TGATTCGATT GTTGGGGCGG CGATGTTGTT GCGGACTGAG GTTTTGCGCG ATGTAGGCTT GCTCGATGAA GCCTTTTTTA TGTATGGCGA AGATATTGAT TGGTGCTATC GAACAAAAAG CTATGGCTGG CAAGTTTGGT ATGATCCACG GGTGACGATT TTGCACTATA AACGGGTTTC GAGCACTCGT CGCGCCGTGC CATCAATTCG GGCTTTTTAT GATGCCATGC GGATTTTTCA CCGCAAACAT TATGAAGCAA CTACATTGGC CCCATTGAAT TGGCTGATTT ACCTTGGAAT TACCCTAAAA GAATGGCAAG CGCTGGTGCG TAATCGATTG CGCCCATTAG CCTCACGACG AGCAACCCAT GGCAACAACC ACAATGCTTA A
|
Protein sequence | MNHIAVVILN FNSGDVLKPC LASLADQDWN GRLDVWVVDN ASSDESVAMV REAFPWVRLI ASPQNSGFSA GNNLALRQIL AETPTPEAVL VLNPDTVVPT NGIAGMVEAL VERPKAGIVG PKLVLADGSL DLACRRAFPS AEVALYRMIG LSKLFPRSPR FGRYNMTYLD PDQATEVDSI VGAAMLLRTE VLRDVGLLDE AFFMYGEDID WCYRTKSYGW QVWYDPRVTI LHYKRVSSTR RAVPSIRAFY DAMRIFHRKH YEATTLAPLN WLIYLGITLK EWQALVRNRL RPLASRRATH GNNHNA
|
| |