Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2388 |
Symbol | |
ID | 3683231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 2965749 |
End bp | 2968772 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637717734 |
Product | glycosyl transferase family protein |
Protein accession | YP_322901 |
Protein GI | 75908605 |
COG category | [M] Cell wall/membrane/envelope biogenesis [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.257411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000141924 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGAAAA ATATCATTGC TAATGATGAG CCATTGGTTA GTGTTTGTAT TCCGACATAT AATGGTGAAT TTTTTATTGA TCTGGCGCTT CAAAGTATTG ATTCACAAAC ATATAATAAT ATAGAACTTA TCATTTCAGA TGATGACTCA GAAGATAAAA TAATAGAAAA GATTAATATT TTTCGAGAAA AATCAAAAAA AAAGATTTAT TTATTCACCC ATGAAAGATT AGGTTTGGTT AACAACTGGA ATTTCTGCAT CTCTCAGACT CAAGGTAAAT ACATTAAGTT TTTATTTCAA GACGACATTT TAGAACCAAA TGCCATTAGA GAAATGGTTA CCTTAGCAGA ACAAGATGAA GAGATAGGTT TAGTATTCTC ACCACGTAAA CTATTTTCTG TTTATAAAGA TGTTACCTAC AATCCAAAGT CTTTAGAATA TCATGAAGCC AAGGATATAC ATAAATATTG GTCAAACTTA AAAAGAATTC AATTAGGCAA AGAGCTTCTA GAAGACCCAA ATATACTTGA TGCTCCTATT AATAAGATTG GTGAACCAAC TACTGTTTTA ATCAAAAAAG AAGCTTTTGA GAAAGTAGGC TTATTTAATC CCGAACTGTG TCAGATTGTA GATTTAGAAA TGTGGCTCAG AATTATGAGC CGATACAAAA TCGGATTTAT TGATCAATAT TTATCACAAT TTCGCATTCA TCACCAACAA CAAACCCACC GGAATGCATC TTTAAAAGAT GTGATTTTCT TAGATTATCA AAAACTTTTT TATTTTATTG CTAATGATAG CCGTTATCCT GGCTTCACAA GACAAATGGC TGCTTGTAAA TATGCTATTT TGAGCAGGGA TAATGCTGAA CTAAATCGGT TGCAAAAACA AACAGCAGAA CAATGGCTTA GTCTTCCAGA TGAAAAATTA GCTGAGATGT ATGCTGGTTT GTTTGGAAAA ATACACAAAA TACTCCTCAG GAATAGCATT AATGATAAAA GTTTAACTAA GAAAGATGGA ATCCTTTTTA ATGAGATATT TATTTCTCAA GAATTAAATC GCCCCAAAGC TATCCAGAAT TTATTAGCAG CTATGCTGTT TGGTGATTTT AATCAATTAC TACTATCGTC TAACTTTTCA CAAGTACCTG AATGGCTGTT ATATGACTAT CTGCAATTTT TATTGTCGCC ACAAGGTTAT TTTAAAGCAT TGGGAGATTC AAAAAAATAT CATGAATACC TCGAAAAATG TACTTATTCC TTACATGAAT ATATTTTTAA GGAGTTAGGT TCATCTTCGT CTTATCAAAT CACTAATTAT TTTACTCAGA TTGCTAATTT TACCCATATT TATTTTAATG ACAATAATCT GAAGGATATA TATGTTAAAC GGGCAGAAAT AATAGAATGT TACCTGAAAC TCAATGGCAA TAAAATTGAT TATAAATTTG TAGAACGACC TGTAAATATC AAAAGAATTA GGCTTGGTAT ACTTGCATCG CATTTTAGAC CTTCAGCCGA AACATTTGCT TGTCTTCCTG TTTATGAACA TATTAGTCGA GATTTTGAGG TAATTTTGTA CTCACTTACA GAAACAAGTC ATCGACTAGA GCAATATTGT CAACGTTCTG CGAATTCTTT TAAACTGTTG CCACAGGAAT TATCTGCACA GGTAAGTACC ATTCGTGCTG ATGACTTAGA TATATTGTTC ATAGCTACCA ATGTCACCGC AGTAACCAAT CAAATATGCC TGTTAGCAAT TCATAGGTTA GCCAGAATAC AAGTTACTAG TGGTGCTTCA GTTGTGACAA CCGGAATGCG AAATATAGAT TATTATATTT CCGGCACATT AACTGATCCT TCACCAATAG CACAAGACCA TTATCAAGAA AAACTAATTA AACTAGAAGG AACTGCTCAC TGTTTTAGTT ACGGTACGGA AGAGGGAAAA TTAACAATTC TAGTCAAGAG GAATAGTTTA GGTATTCCTG AAAATGCTGT TGTTTTTATC TCTGGTGCTA ACTACTTCAA AATAGTTCCA GAATTGGTAG CAACCTGGGC AAACATCATT TCTAGAGTAC CAAATTCAGT TTTAGTGCTG TTACCATTTG GGCCAAATTG GTCAAATGCT TATCCAAAAG CAAATTTTAT CGATCACCTA AATTCTATAT TTTCTCAGCA TGGGTTAGCT ACTGAACGTT TAATAGTATT AGATATTCAA CCCATTCCAG ACCGGGAGGA CATGAAAGAA TACTATAAAA TTGCTGATGT TTACTTAGAT TCCTATCCAT TTGCAGGGAC GACTTCATTA ATAGAACCAT TACAGGTGAA TCTACCTGTC ATCGCTAGAC AAGGAAATTG CTTCCGTTCG GCAATGGGAG CAGCGATTAT ACAAACATTG AATATTCCTG ATTTAGTTGC AGATAGTGAA GAGTCCTATA TTGAATTAGC AGTTGCATTA GGTACTAATT CTGAACTGCG GCGACAGAAG AGTGACCAAA TTAGGGAAAA AATGCAGGAT AATCCTAGTT TTTTAGATAG TCGCTCTTAT GCAAGTAAAA TAGAAAGTCT ATTCAAAGAA CTTTTCAATA ATTATCTTGC AGATACACTA AGTCAAAATT TACGGTTAGA AGATATTAAC CTAATTATTT TTCCTGATTG GTCACAACCA GAGGAATTAA TAAGTTTAGA AGTGAAACAG GTAATTAAAA CAGTTGTAAC TAGTCCTAAT GGCGGAAAAA CTATGTTAAT GGTCAACATT ACTAATGTTG CTGTTGATCA TGTTGAACTG TTGTTATCGT CTATAACCAA TAATCTGCTG ACACAAGAGG GTTTAGATGT GACTGAGAGA TTAGAAATCG CTTTGGTAGA AAGTTTGGGT GATGTTCAAT GGAAGGCTTT ACTATCTCGC CTTCATGGAC GAGTTGTTTT GGAACATGAA AATCAAGATG CAATCAGACA AGCTAAAGCA GAAGCTTTGT TAACTTACGA ATTAGAAACC TTTACCCAGG TGCGAGAGGA ATAG
|
Protein sequence | MMKNIIANDE PLVSVCIPTY NGEFFIDLAL QSIDSQTYNN IELIISDDDS EDKIIEKINI FREKSKKKIY LFTHERLGLV NNWNFCISQT QGKYIKFLFQ DDILEPNAIR EMVTLAEQDE EIGLVFSPRK LFSVYKDVTY NPKSLEYHEA KDIHKYWSNL KRIQLGKELL EDPNILDAPI NKIGEPTTVL IKKEAFEKVG LFNPELCQIV DLEMWLRIMS RYKIGFIDQY LSQFRIHHQQ QTHRNASLKD VIFLDYQKLF YFIANDSRYP GFTRQMAACK YAILSRDNAE LNRLQKQTAE QWLSLPDEKL AEMYAGLFGK IHKILLRNSI NDKSLTKKDG ILFNEIFISQ ELNRPKAIQN LLAAMLFGDF NQLLLSSNFS QVPEWLLYDY LQFLLSPQGY FKALGDSKKY HEYLEKCTYS LHEYIFKELG SSSSYQITNY FTQIANFTHI YFNDNNLKDI YVKRAEIIEC YLKLNGNKID YKFVERPVNI KRIRLGILAS HFRPSAETFA CLPVYEHISR DFEVILYSLT ETSHRLEQYC QRSANSFKLL PQELSAQVST IRADDLDILF IATNVTAVTN QICLLAIHRL ARIQVTSGAS VVTTGMRNID YYISGTLTDP SPIAQDHYQE KLIKLEGTAH CFSYGTEEGK LTILVKRNSL GIPENAVVFI SGANYFKIVP ELVATWANII SRVPNSVLVL LPFGPNWSNA YPKANFIDHL NSIFSQHGLA TERLIVLDIQ PIPDREDMKE YYKIADVYLD SYPFAGTTSL IEPLQVNLPV IARQGNCFRS AMGAAIIQTL NIPDLVADSE ESYIELAVAL GTNSELRRQK SDQIREKMQD NPSFLDSRSY ASKIESLFKE LFNNYLADTL SQNLRLEDIN LIIFPDWSQP EELISLEVKQ VIKTVVTSPN GGKTMLMVNI TNVAVDHVEL LLSSITNNLL TQEGLDVTER LEIALVESLG DVQWKALLSR LHGRVVLEHE NQDAIRQAKA EALLTYELET FTQVREE
|
| |