Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SYO3AOP1_1565 |
Symbol | |
ID | 6331160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfurihydrogenibium sp. YO3AOP1 |
Kingdom | Bacteria |
Replicon accession | NC_010730 |
Strand | + |
Start bp | 1614465 |
End bp | 1617377 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 642657840 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001931717 |
Protein GI | 188997466 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000603283 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGATT TAAAAATATC AGTTTTAATT CCAACTTATA ACAGACCGGA GTATTTAAGA GAAGCTTTAA AAAGTATTTT AGATCAATCA ATTAAACCAT ATGAAGTTAT AGTAGCAGAT GATAATCCAA ATGAAGAGAT AAACAAAAAG AATTTTGAAG TTGTAAACGA GTTTGCTAAA GATTATCCTT TTATAAAATA TCATAAAAAT GAGAAAAATC TTGGACCGGC TGAAAATTAC AAAAATCTGT TTTACTTAGC GTCAGGAGAT TTAATCCATT TTTTATGTGA TGATGATATC CTTGCACCTT ATACATTAGA AGAGTTAAGC AAACCTTTTA AAGAAGATGA TGATATAGTA ATATCTGCCG GAAAAACTTT ATTTGTAGAT GAAAAATTAA GGGTTATTTA TTTGCCGGCA TTAAGAAACC ATTTAGAATT TTATGAAAAA TATTTTAATG ATTCTTATGT CAATGGTAAG TGGCTGATAG AAAAATCCTT AGAAAAGCTT TTTAACATTA TGGAAAGTTT TTCAGGTTTT ATGTTTAGAA AAGATAAAGT AGATTTTGAG CTATTTAAAT TTGATGCTAT AGATTTTGAA GCTAATGCAG ATTGGTTTTT ATGGATGAAT TTAGCAAAAA AAGGAAAAGT TTATTTAAGT AATAAGGCCT CTAACCTGCA TAGAATGCAT GAAACAAATA ATCAGCTTAA CATAAAAACT ATGCTTAAAG AAGCAATCGA GAGGCAAACT TTTTTAACAA AAGAATTTAT TGGTAGTCTT GGTATAGATG AAAATTTATT TAGATACAAA GATGGGTTTA CAAGAAATTT ATATTACATT GATGCCTTTT ACCAATACAT AAAAACAGAC AGAAAACTTT CCAAAGATTT AAAGCAATTG CTAAACAAAA TCTACAATGC AAATTTAGAT TTAAAAATTG AAAGAGAACC ATTTTCTATA ATTATAGTCT CTTACACCTC TTCAGATACT ATCAAAGAAT GCATTCAATC AGTTTTGGAT TCTTATTTAA TAGAAGATGA TGAAATAATT GTTGTTGACA ATAATTCGAA AGATAACACG GTTGAAATTG TAGAATCTTT TAATGATAGT AGAGTAAAGA TAATAAAAAA TTCAGAAAAT CTTGGATATT CTAAAGCTAT AAATCAAGGA GTTGATTTAT CAAAGAACCC TTATTTGGTA TTTTTAAATC CAGATACTGT TGTAATCTCT AAAGACTGGC TAAACAGATT TTACAAAGAG CTTCAAGATA AACATGTTGC AATGGTTGGT CCGGTTTCTG ACATAGCAAT GTATAAAAAT AATTTATCTA AATATATTCT TCCAAACCTG AGATTTTTAG ACTTAAAAAG ATATGAAAAA TTTTTAAAGT ATATTTACAA CCAATATCAT GAAGATACAA CTCTTTTGTC AGGATTTTGT ATTGGAATAT CAAAAGAAAG ATTTTTAGAA TTTGGAAAAT TTGACGAAGA CCTTATCCTT GGATTTGATG ACTTTGATTT TTCATTAAAA GCACAAGAAA AAAATCTAAA ACAGTATGTT TTACCTTCAG TATTGATAGC ACATAAAAAT CATAAATCTT TTGAAAAAGA TATAAAAAAA GCTAATTACT TAAATAAATT AAGTTTATAC AACTTTTTAA AAAAACTAAT CAAAAAATAC GGATATGGAA ACGTTCCAAA TCCAATAGAT TTGTTTATGA AAGATTTAAA GAAAACAGAG CCATTTTATA CATTTGACTT GTCCGACGGA AGATACAGAT ATGTTTTCAA ATTTACTGAT AAAGAAAAGC CGAAAGACTT TTTTAGACAA AAGGCTAAAA TAATAAAATT AAAGCCAAGA ATAGCAATCA TAACGGTTAA TTATTTCTCT TCTGAATATA TAAAAAGTCT TTCCAAATCT ATCGTAGATT CTGATTATCC TAACATAGAT TTTATAGTTG TAGATAATTC TGAAAATGAA GAAGAGTTTA ATAAGCTTGA GAATATCTTA AAAGAAGTGT TTGGAGAAAC TACAACAAAA CAATGTTATC TACTAAAAAA TATTAACAAT GGCTATGCAG GCGGAAACAA TTTAGGCATA AGATATGCTA TAGAAAATTT AAAATCAGAA TATATATGGC TGCTAAATCC GGACACGAAA ATAGAAAAAA ATACACCTTT GGAGCTTTTA AAGACCTTAG AGTATACAGA CATTCCGGTA GTTACTTGTA AGATAAAAAA TTACGATGAT GAGAATGTTC AATATGTAGG GGACAATGTA TTCTTAGGTG GAGCTGAAAA CTTAAAAGAT TACGGCTTAA GATATGTAAA GTTTTTATCA GGTGCCAACA TCTTTATGAA AAGTGAAGTT ATTGAAAAGG TTGGATATAT AAACGAAGAT TTCTTTTTAT ACTTTGAAGA TAATGAATTT CTTATAAGAT TAAGAAAAAA CGGAATAAAT CCAATTTATA CACCTTTTAC ATTTATTAGA CACAAAGTTG GAGGTACTAC AGATAAAAGT AATAAATATC CTTTATGGTA TTATTATGAG ATAAGAAATA GATTGTTAAT TCTTGATTAT GCAGCAGATA ATAAATATGA TTTTATGGAT TATCTGACAA GCACTTTAGG TGAGATTAGA AAAAGCTATA TCGAATCAGA GCATAACAAG AAACAAGCTA TTTTGGATGG TATTTTTGAT TTTAGTAAAA GGATAAAAGG GAAAGTAAAT AGAGGTAATT TTGACAAAGA AGAAATAGGA AATAGACTGA TAGAGTTTGA AAGTTATATA GAAGATGTTA TTATAGACTA TGATATTGAA AAGATGTATT TAAAATTAAA ATTGGACTCG GAGAAAATGG ATTTATTTGA TGAGTTTTTG GAGTTGGTGA AGATAAAGGA AATAGGAAGA TAA
|
Protein sequence | MTDLKISVLI PTYNRPEYLR EALKSILDQS IKPYEVIVAD DNPNEEINKK NFEVVNEFAK DYPFIKYHKN EKNLGPAENY KNLFYLASGD LIHFLCDDDI LAPYTLEELS KPFKEDDDIV ISAGKTLFVD EKLRVIYLPA LRNHLEFYEK YFNDSYVNGK WLIEKSLEKL FNIMESFSGF MFRKDKVDFE LFKFDAIDFE ANADWFLWMN LAKKGKVYLS NKASNLHRMH ETNNQLNIKT MLKEAIERQT FLTKEFIGSL GIDENLFRYK DGFTRNLYYI DAFYQYIKTD RKLSKDLKQL LNKIYNANLD LKIEREPFSI IIVSYTSSDT IKECIQSVLD SYLIEDDEII VVDNNSKDNT VEIVESFNDS RVKIIKNSEN LGYSKAINQG VDLSKNPYLV FLNPDTVVIS KDWLNRFYKE LQDKHVAMVG PVSDIAMYKN NLSKYILPNL RFLDLKRYEK FLKYIYNQYH EDTTLLSGFC IGISKERFLE FGKFDEDLIL GFDDFDFSLK AQEKNLKQYV LPSVLIAHKN HKSFEKDIKK ANYLNKLSLY NFLKKLIKKY GYGNVPNPID LFMKDLKKTE PFYTFDLSDG RYRYVFKFTD KEKPKDFFRQ KAKIIKLKPR IAIITVNYFS SEYIKSLSKS IVDSDYPNID FIVVDNSENE EEFNKLENIL KEVFGETTTK QCYLLKNINN GYAGGNNLGI RYAIENLKSE YIWLLNPDTK IEKNTPLELL KTLEYTDIPV VTCKIKNYDD ENVQYVGDNV FLGGAENLKD YGLRYVKFLS GANIFMKSEV IEKVGYINED FFLYFEDNEF LIRLRKNGIN PIYTPFTFIR HKVGGTTDKS NKYPLWYYYE IRNRLLILDY AADNKYDFMD YLTSTLGEIR KSYIESEHNK KQAILDGIFD FSKRIKGKVN RGNFDKEEIG NRLIEFESYI EDVIIDYDIE KMYLKLKLDS EKMDLFDEFL ELVKIKEIGR
|
| |