Gene Ava_2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2388 
Symbol 
ID3683231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2965749 
End bp2968772 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content33% 
IMG OID637717734 
Productglycosyl transferase family protein 
Protein accessionYP_322901 
Protein GI75908605 
COG category[M] Cell wall/membrane/envelope biogenesis
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.257411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000141924 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAAAA ATATCATTGC TAATGATGAG CCATTGGTTA GTGTTTGTAT TCCGACATAT 
AATGGTGAAT TTTTTATTGA TCTGGCGCTT CAAAGTATTG ATTCACAAAC ATATAATAAT
ATAGAACTTA TCATTTCAGA TGATGACTCA GAAGATAAAA TAATAGAAAA GATTAATATT
TTTCGAGAAA AATCAAAAAA AAAGATTTAT TTATTCACCC ATGAAAGATT AGGTTTGGTT
AACAACTGGA ATTTCTGCAT CTCTCAGACT CAAGGTAAAT ACATTAAGTT TTTATTTCAA
GACGACATTT TAGAACCAAA TGCCATTAGA GAAATGGTTA CCTTAGCAGA ACAAGATGAA
GAGATAGGTT TAGTATTCTC ACCACGTAAA CTATTTTCTG TTTATAAAGA TGTTACCTAC
AATCCAAAGT CTTTAGAATA TCATGAAGCC AAGGATATAC ATAAATATTG GTCAAACTTA
AAAAGAATTC AATTAGGCAA AGAGCTTCTA GAAGACCCAA ATATACTTGA TGCTCCTATT
AATAAGATTG GTGAACCAAC TACTGTTTTA ATCAAAAAAG AAGCTTTTGA GAAAGTAGGC
TTATTTAATC CCGAACTGTG TCAGATTGTA GATTTAGAAA TGTGGCTCAG AATTATGAGC
CGATACAAAA TCGGATTTAT TGATCAATAT TTATCACAAT TTCGCATTCA TCACCAACAA
CAAACCCACC GGAATGCATC TTTAAAAGAT GTGATTTTCT TAGATTATCA AAAACTTTTT
TATTTTATTG CTAATGATAG CCGTTATCCT GGCTTCACAA GACAAATGGC TGCTTGTAAA
TATGCTATTT TGAGCAGGGA TAATGCTGAA CTAAATCGGT TGCAAAAACA AACAGCAGAA
CAATGGCTTA GTCTTCCAGA TGAAAAATTA GCTGAGATGT ATGCTGGTTT GTTTGGAAAA
ATACACAAAA TACTCCTCAG GAATAGCATT AATGATAAAA GTTTAACTAA GAAAGATGGA
ATCCTTTTTA ATGAGATATT TATTTCTCAA GAATTAAATC GCCCCAAAGC TATCCAGAAT
TTATTAGCAG CTATGCTGTT TGGTGATTTT AATCAATTAC TACTATCGTC TAACTTTTCA
CAAGTACCTG AATGGCTGTT ATATGACTAT CTGCAATTTT TATTGTCGCC ACAAGGTTAT
TTTAAAGCAT TGGGAGATTC AAAAAAATAT CATGAATACC TCGAAAAATG TACTTATTCC
TTACATGAAT ATATTTTTAA GGAGTTAGGT TCATCTTCGT CTTATCAAAT CACTAATTAT
TTTACTCAGA TTGCTAATTT TACCCATATT TATTTTAATG ACAATAATCT GAAGGATATA
TATGTTAAAC GGGCAGAAAT AATAGAATGT TACCTGAAAC TCAATGGCAA TAAAATTGAT
TATAAATTTG TAGAACGACC TGTAAATATC AAAAGAATTA GGCTTGGTAT ACTTGCATCG
CATTTTAGAC CTTCAGCCGA AACATTTGCT TGTCTTCCTG TTTATGAACA TATTAGTCGA
GATTTTGAGG TAATTTTGTA CTCACTTACA GAAACAAGTC ATCGACTAGA GCAATATTGT
CAACGTTCTG CGAATTCTTT TAAACTGTTG CCACAGGAAT TATCTGCACA GGTAAGTACC
ATTCGTGCTG ATGACTTAGA TATATTGTTC ATAGCTACCA ATGTCACCGC AGTAACCAAT
CAAATATGCC TGTTAGCAAT TCATAGGTTA GCCAGAATAC AAGTTACTAG TGGTGCTTCA
GTTGTGACAA CCGGAATGCG AAATATAGAT TATTATATTT CCGGCACATT AACTGATCCT
TCACCAATAG CACAAGACCA TTATCAAGAA AAACTAATTA AACTAGAAGG AACTGCTCAC
TGTTTTAGTT ACGGTACGGA AGAGGGAAAA TTAACAATTC TAGTCAAGAG GAATAGTTTA
GGTATTCCTG AAAATGCTGT TGTTTTTATC TCTGGTGCTA ACTACTTCAA AATAGTTCCA
GAATTGGTAG CAACCTGGGC AAACATCATT TCTAGAGTAC CAAATTCAGT TTTAGTGCTG
TTACCATTTG GGCCAAATTG GTCAAATGCT TATCCAAAAG CAAATTTTAT CGATCACCTA
AATTCTATAT TTTCTCAGCA TGGGTTAGCT ACTGAACGTT TAATAGTATT AGATATTCAA
CCCATTCCAG ACCGGGAGGA CATGAAAGAA TACTATAAAA TTGCTGATGT TTACTTAGAT
TCCTATCCAT TTGCAGGGAC GACTTCATTA ATAGAACCAT TACAGGTGAA TCTACCTGTC
ATCGCTAGAC AAGGAAATTG CTTCCGTTCG GCAATGGGAG CAGCGATTAT ACAAACATTG
AATATTCCTG ATTTAGTTGC AGATAGTGAA GAGTCCTATA TTGAATTAGC AGTTGCATTA
GGTACTAATT CTGAACTGCG GCGACAGAAG AGTGACCAAA TTAGGGAAAA AATGCAGGAT
AATCCTAGTT TTTTAGATAG TCGCTCTTAT GCAAGTAAAA TAGAAAGTCT ATTCAAAGAA
CTTTTCAATA ATTATCTTGC AGATACACTA AGTCAAAATT TACGGTTAGA AGATATTAAC
CTAATTATTT TTCCTGATTG GTCACAACCA GAGGAATTAA TAAGTTTAGA AGTGAAACAG
GTAATTAAAA CAGTTGTAAC TAGTCCTAAT GGCGGAAAAA CTATGTTAAT GGTCAACATT
ACTAATGTTG CTGTTGATCA TGTTGAACTG TTGTTATCGT CTATAACCAA TAATCTGCTG
ACACAAGAGG GTTTAGATGT GACTGAGAGA TTAGAAATCG CTTTGGTAGA AAGTTTGGGT
GATGTTCAAT GGAAGGCTTT ACTATCTCGC CTTCATGGAC GAGTTGTTTT GGAACATGAA
AATCAAGATG CAATCAGACA AGCTAAAGCA GAAGCTTTGT TAACTTACGA ATTAGAAACC
TTTACCCAGG TGCGAGAGGA ATAG
 
Protein sequence
MMKNIIANDE PLVSVCIPTY NGEFFIDLAL QSIDSQTYNN IELIISDDDS EDKIIEKINI 
FREKSKKKIY LFTHERLGLV NNWNFCISQT QGKYIKFLFQ DDILEPNAIR EMVTLAEQDE
EIGLVFSPRK LFSVYKDVTY NPKSLEYHEA KDIHKYWSNL KRIQLGKELL EDPNILDAPI
NKIGEPTTVL IKKEAFEKVG LFNPELCQIV DLEMWLRIMS RYKIGFIDQY LSQFRIHHQQ
QTHRNASLKD VIFLDYQKLF YFIANDSRYP GFTRQMAACK YAILSRDNAE LNRLQKQTAE
QWLSLPDEKL AEMYAGLFGK IHKILLRNSI NDKSLTKKDG ILFNEIFISQ ELNRPKAIQN
LLAAMLFGDF NQLLLSSNFS QVPEWLLYDY LQFLLSPQGY FKALGDSKKY HEYLEKCTYS
LHEYIFKELG SSSSYQITNY FTQIANFTHI YFNDNNLKDI YVKRAEIIEC YLKLNGNKID
YKFVERPVNI KRIRLGILAS HFRPSAETFA CLPVYEHISR DFEVILYSLT ETSHRLEQYC
QRSANSFKLL PQELSAQVST IRADDLDILF IATNVTAVTN QICLLAIHRL ARIQVTSGAS
VVTTGMRNID YYISGTLTDP SPIAQDHYQE KLIKLEGTAH CFSYGTEEGK LTILVKRNSL
GIPENAVVFI SGANYFKIVP ELVATWANII SRVPNSVLVL LPFGPNWSNA YPKANFIDHL
NSIFSQHGLA TERLIVLDIQ PIPDREDMKE YYKIADVYLD SYPFAGTTSL IEPLQVNLPV
IARQGNCFRS AMGAAIIQTL NIPDLVADSE ESYIELAVAL GTNSELRRQK SDQIREKMQD
NPSFLDSRSY ASKIESLFKE LFNNYLADTL SQNLRLEDIN LIIFPDWSQP EELISLEVKQ
VIKTVVTSPN GGKTMLMVNI TNVAVDHVEL LLSSITNNLL TQEGLDVTER LEIALVESLG
DVQWKALLSR LHGRVVLEHE NQDAIRQAKA EALLTYELET FTQVREE