Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SYO3AOP1_0086 |
Symbol | |
ID | 6331277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfurihydrogenibium sp. YO3AOP1 |
Kingdom | Bacteria |
Replicon accession | NC_010730 |
Strand | - |
Start bp | 95491 |
End bp | 97029 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642656366 |
Product | glycosyl transferase family 39 |
Protein accession | YP_001930288 |
Protein GI | 188996037 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00741033 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATACTA AAAAAATAGA TTACAATCTC ATATTGATAG GCTTATTTTC TTTTTTTCTA TTTTCTTTAA ACATCGGCGG AGTATCAATT TTCAGTCTTG ATGAAGCAAA AAACGCATCC TGTGCAAGAG AGATGTTAGA ATCTAAAAAC TTCATTGTCC CAACATTCAA CTACGAACTT AGAACAGACA AGCCTCCACT ACATTATTAC TTTATGATGC TTTCTTATCT TATCTTTGGA GTAAGTGAAT TTTCAGCAAG ATTTTTCTCA TCAGTCTTTG GAAGTTTAAC GGTAATGATT ACATACTTCT TTGCAAAAAA AATATTTAGC ACAAAAACAG CCATTTTATC AGCCATTGTT CTGATTTCAT CACTGCATTT TGTTTTTCAA TTTCATATGG CAGTTCCAGA CCCGTATTTG ATATTTTTCA TAAACTTGGC ATTTTTTAAT TTTTATCTTT TTTATAAATT CAAAGAAGAA AAATATCTTT GGACTTTATA CACAGCCCTT GGCTTTGGGA TGTTAGCAAA AGGAATAGTT GCGATTGTTT TACCATTTTT TATAATTTTT GCGTTTTTAT TTTCAGTAAA AAAAAGCATA TCAGCCATTA AAAGCTTGAA ACTTCATAAA GGTTTAATTT TAACAGCTTT AATATCTCTT CCTTGGTATA TATTAGTTAG CATAGAAACA AACTTTCAAT GGACGAAAGA ATTTTTCTTA AAGCACAACA TTAGTAGATT TACAGACTCA ATGGAAGGAC ACGGGGGAAT ATTTCTAATT ACAATCATTT TCGTTCTTAT CGGAATGCTT CCATTTAGCA TTTTTACTTA CCAATCGGTC AAAGAGACTA TCAAAAACAG ACTAAATCCA GATTACCTTT ACTTAGGTCT GATAGTATTA ATATACACAG GATTTTTTAG CATTTCTAAA ACAAAACTGC CAAACTACAC AGTGCCCGTA TATCCAGCTT TTGCAATACT TCTATCTCTT ACGCTTTTAA AGATTAGAAA CTATCTATTT TCTTTAATCT TTTATCTTAT CTTTACAGCA TCACTGCCTT TTGTTTTATA CACAACATTA AAGAATGACA AAAATCTATA TCTACTTGCA AATTACAGTT TTTACTTTCT TATTTTGGCA GCTGGTGGAT TGTTTGCTTT AATCTACTTC AAGGATATTA AAAAGGTTGT TCTTTCTCTT TTTATTTCAA GCGTTGCAAT GAGCATTACA CTTTACACAG TTATTTTGCC AGAGATTGAC AAATACTCAT CAGTTAGAAT TATACTAAAT TACATGGAAA AAGACAGACC CGTAGGCTAT TATAAACGAT ACAATCCAGC TTTTTCTTTT TACTTAAAGA AGAAGATAAT TCCTTTAAAC TCTAAGCAAG ATGTTGAAAA TTTTATTAAA TCAGGCAGAG TTTATATCCT AACAAGGGAT GAGTATTTAG AAGAGTTGAA AGACATTAAA GACTTAAAAG TTATCATTCA AAAAAAAGAT TTGTTTGAAA ATTCAGTATC GGTTTTGATT TCAAATTGA
|
Protein sequence | MDTKKIDYNL ILIGLFSFFL FSLNIGGVSI FSLDEAKNAS CAREMLESKN FIVPTFNYEL RTDKPPLHYY FMMLSYLIFG VSEFSARFFS SVFGSLTVMI TYFFAKKIFS TKTAILSAIV LISSLHFVFQ FHMAVPDPYL IFFINLAFFN FYLFYKFKEE KYLWTLYTAL GFGMLAKGIV AIVLPFFIIF AFLFSVKKSI SAIKSLKLHK GLILTALISL PWYILVSIET NFQWTKEFFL KHNISRFTDS MEGHGGIFLI TIIFVLIGML PFSIFTYQSV KETIKNRLNP DYLYLGLIVL IYTGFFSISK TKLPNYTVPV YPAFAILLSL TLLKIRNYLF SLIFYLIFTA SLPFVLYTTL KNDKNLYLLA NYSFYFLILA AGGLFALIYF KDIKKVVLSL FISSVAMSIT LYTVILPEID KYSSVRIILN YMEKDRPVGY YKRYNPAFSF YLKKKIIPLN SKQDVENFIK SGRVYILTRD EYLEELKDIK DLKVIIQKKD LFENSVSVLI SN
|
| |