Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0775 |
Symbol | |
ID | 5732659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 876412 |
End bp | 877449 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277905 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543551 |
Protein GI | 159897304 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0449927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGA TTCTTGTTTG TACAGCCCAA GTGCCTTTTG CCCGTGGTGG AGCCGAGTTG TTAGCCGAAG GCCTCCTGCA AGCCTTGCGC AAGGCAGGCC ATGAAGCCGA TTTAGTCGCC TTGCCCTTTA CTCGCACACC ACATCGCGAG TTGCTCAATA GCGCTTTGGC CTGGCGCATG CTCGATCTCA GCCAGGTTGA AGATCGACCA GTCGATCAAG TAATATGTAC CAAATTTCCT TCATATGCGG TGGCCCACCC CAAAAAAGTC GTTTGGTTGG TACATCAACA TCGGCAACTC TACGATTGGC GCGGCACAAA CTGGAGCGAT TGGGGCAGTC AACCTGATGA TGATCAACTC GCTCGCAGCC TGACGCGGCT CGATCAACAA GCCTTAGCCG AAGCCAAACG CCGCTTTAGC ATCTCCAAAA TTGTCAGCCA ACGCTTGCAA CGCTTCAATG GACTCGCCAG CACCCCGCTG TATCCACCGT CGATTTATAG CGGGCGCTTA CGTCAAGGCC GCTACGAACC GTATATTCTC AGCATTTCGC GGCTTGACCC CGCCAAACGA CTCGATTTAT TGCTGCATGC CCTAACGCAT ACCGAACAAC CAGTTAAGGC GATTATCGGC GGGCGCGGCC CAGCTTTGGT AGAACTCCAA GGGCTAACCA AGCAACTTGG GCTTGAACAA CGGGTTGAGT TTCGCGGCTG GATGGATGAT CAAACGCTGA TCGATGTATA TGCCGATGCC CGCGCCGTGT TCTATGCCCC GATCGACGAG GATTTTGGCT TTGCCACGAT CGAAGCGCTT GAGGCGGCCA AGCCAGTGCT GACCGCCCAA GATTCGGGCA CAGTTTTAGA ATTTATTCAC GATGGCACAA CCGGCTTTGT TGCGCCAGCC GAACCGCGGG CCATGGCCGC CCGCCTCGAC GCATTGTGGG CTTCTGCCGA TTTAGCAGCC CAACTTGGCA GCAATGGACC AGCGATGGTT GCCAACATTC GCTGGGAACA TGTCGTCAAT CAATTAGTTT TAGCTTAA
|
Protein sequence | MKRILVCTAQ VPFARGGAEL LAEGLLQALR KAGHEADLVA LPFTRTPHRE LLNSALAWRM LDLSQVEDRP VDQVICTKFP SYAVAHPKKV VWLVHQHRQL YDWRGTNWSD WGSQPDDDQL ARSLTRLDQQ ALAEAKRRFS ISKIVSQRLQ RFNGLASTPL YPPSIYSGRL RQGRYEPYIL SISRLDPAKR LDLLLHALTH TEQPVKAIIG GRGPALVELQ GLTKQLGLEQ RVEFRGWMDD QTLIDVYADA RAVFYAPIDE DFGFATIEAL EAAKPVLTAQ DSGTVLEFIH DGTTGFVAPA EPRAMAARLD ALWASADLAA QLGSNGPAMV ANIRWEHVVN QLVLA
|
| |