Gene Ava_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0349 
Symbol 
ID3682723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp448307 
End bp451438 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content34% 
IMG OID637715677 
Productglycosyl transferase, group 1 
Protein accessionYP_320870 
Protein GI75906574 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0126249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0342304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA AAAATACTAA TTCTGCTTTA GATAATTTAA TTCCTCCTGA AATCAAAAAT 
GATGAATTTT ATGTAGCTAT TCAGGATATT GTTAAGAATG AAGAAATTAA AACAATTTTA
GAAATTGGTT CATCTTCGGG AGAAGGAAGT ACAGAGGCAT TTGTTACAGG AATCCGACAA
AATACGAATA ATCCCATCTT GTTTTGCATG GAAGTATCGA AGACAAGATT TAATGAGCTA
AAAAATAGGT ATAAAAATGA AAATTTTGTA AAAGTATATA ACACATCATC TGTTCCTATC
GAAAGCTTTC CTAATGAACA GGAAGTGATA GATTTTTATA AAAACACTAC TAACAATCTT
AAGCTTTATC CATTAGAAAG CGTTCTTAAC TGGCTGTATC AAGATATTGA ATATGTCAAA
GAATGTGGAT ACTCTGAGAA TGGAATTAAA ACAATTAAAA ATGAGAATAA CATTGATTAT
TTTGATTTAG TATTAATTGA CGGCTCAGAA TTTACAGGTA GTGCAGAATT AGATGAAGTT
TATGGAGCAA AATATATTCT TTTAGATGAT ATTAATACAT TTAAAAATCA TAATAACTTT
CATAAATTAT TAAAAGATCA TAATTATTCA ATCATTAAAT ACAACCAAGA AATACGTAAC
GGCTATGCTA TTTTTAGACG CAATAATAAG ATAGAATTAC CTATCCATTT TTTCACTATT
GTTCTCAACG GAGAACCCTT TATCCGTTAT CACATCGACA TTTTTAAACA GCTACCTTTT
AAATGGCATT GGCATATTGT TGAGGGTGTT GCCGACTTAA AACATGATAC TAGCTGGAGT
GTTAAGTTAG GTGGACATAT TAGCAATGAC TTTCATAAAA ATGGACGTAG TTGTGATGGC
ACTACAGAAT ATATAGATGA ACTACTGCAA CTTTATCCAG ACAATATTAC AGTTTACCGC
CAACCAGAGG GTACTTTCTG GGACGGAAAG CGAAATATGG TAAATGCACC ACTTACAAAT
ATTCAAGAAG AATGTTTGTT GTGGCAAGTG GATGTTGATG AACTATGGAC TTTAGAGCAG
ATTTGTACTG CTAGAGAAAT GTTCATTAGT AACCCAGATA AAACAGCTGC TTTCTATTGG
TGTTGGTACT TTGTTGGCGA AAATTTAATT ATTAGCACTC GTAACTGTTA CGCACATAAT
CCTCAGCAAG AGTGGTTAAG AACTTGGCGA TTTAAACCAG GATGTATTTG GGCTGCACAC
GAGCCACCTG TGTTAGTAGA ACCCTTAGCC AATGGTGAAT ATAAAAATTT AGCTACTGTT
AATCCTTTTC TGCATCCAGA AACAGAAGCA TATAATTTAG TATTCCAGCA TTTTGCTTAT
GTCACACCAG AACAATTAAG CTTCAAAGAA AAATACTATG GTTATAAAAA TGCTGTTGAA
CGATGGAGTA ATTTACAAGA AAACAGCAAA TTTCCTATTT TATTAAGAGA ATATTTTCCT
TGGGTTTACG ATGAAACTCA GGTAGATACT GTAAATCACT CTGGAATAGT TCCGATTGCT
CAAAGAGACG AGAACGATAA AAGCTGGCGA TTTTTACAAC CAGAGGAAGT ACAGCAGCAA
ATTAATAAAA TCAGTAAACC ATCACCGATG ATTCTCATTG ATGGGATATT TTTTCAACTT
TACCAAACTG GGATTGCTCG TGTTTGGAAA TCACTTTTGG AAGAATGGTC AAATAAAGAA
TTTGCTAAAC ATATTTTATT CATTGACCGG GCTGGAACTG CGCCTAAAGT ATCTGGAATT
AAGTATTTAA ATTTACCTCG TTATAACTAT AAAGACACCA ATCATGAACG AGAACTATTA
CAGCAAGTAT GTGATCAAGA GGGTGTAGAT TTATTTATTT CATCTTACTA CACAACGCCA
ATCACAACAC CTTCTGTATT CATGGCTTAT GACATGATTC CAGAAGTCAT GAAATGGGAT
GTGAGTAATC CCATGTGGCA AGATAAACAC CAAGCAATAG AACACGCATC TGCTTATATA
GCTATTTCTA AAAATACAGC ATTTGATTTA ACACAATGCT TTAATCAAAT ATCTTTAGAG
TCAGTTATTC TAGCCTATTG CGGTGTTAGT AGCACCTTTG CGCCATCCAC ATTGGATGAC
ATCAGTCTTT TCAAGACAAA GTATGGCATT ACCAAGCCTT ACTTCTTATT ACCTGGGGTT
GGTTCTGGCT ATAAGAATAG TATTTTATTC TTCCAAGCTT TTTCAGAACT TGTGAGTAGC
TATGGTTTTG ATATTGTGGT TACAGGTGGC GGAGGTGGAT TAGATGCTCA GTTTAGAAAC
TACACATTTG GTAGTGTGGT TCATAGTTTA CAACTGAGTG ACGAAGAGTT AGCAATAGCT
TACTCTGGTG CTGTGGCTTT AGTTTATCCT TCTAAATATG AAGGTTTTGG GATGCCTGTA
ATTGAAGCAA TGGCTTGTGG TTGTCCTGTG ATTACCTGTC CTAATGCTTC AATTCCAGAA
GTAGCTGGAG AAGCCGCAAT CTATGTCAAG GATGATGATA TAGATGAACT AGCAAATGCA
CTATGCGAAG TACAAAAACC TGCTATACGT CAATCATTAA TTACTGCTGG TTTAGCCCAA
GCGCAAAAAT TTTCTTGGTC AACAATGGCA GAAATTGTCA GTTCTACTTT AATTAATACA
ACTCTTTTAT CATTAAATTT AAGAGAAATT AATTTAATTA TTTTCCCAGA TTGGTCAGAG
TCGGAAGATT TAATTGGTTT AGAATTGACG CAGATAATTA AGACACTGGC AACTCATCCT
GATAGCGACA AAACTACTTT ATTGATTGAT ACTACTAATT TTTTAACTGA AGATGCTGAG
TTGTTGTTAT CTAGTGCGAC TATGAATCTT CTGATGGAAG AAGACTTAGA TATTACTGAT
GGAATAGAAA TTTCTTTGGT GGCAAATTTG TCTGATATTC AGTGGGAAGC TTTACTACCT
CGCATTCACG GCAGAATTAC TCTAGAACAT GAAAATCAAG AAGCACTGAA ACAAGTAAAA
GCAGAAAATC TCACATCTTA TGAGTTAACA AGTTTTAGCC AAGTATGCGA GGAAGAGTTT
TTTTTTACCT AA
 
Protein sequence
MNKKNTNSAL DNLIPPEIKN DEFYVAIQDI VKNEEIKTIL EIGSSSGEGS TEAFVTGIRQ 
NTNNPILFCM EVSKTRFNEL KNRYKNENFV KVYNTSSVPI ESFPNEQEVI DFYKNTTNNL
KLYPLESVLN WLYQDIEYVK ECGYSENGIK TIKNENNIDY FDLVLIDGSE FTGSAELDEV
YGAKYILLDD INTFKNHNNF HKLLKDHNYS IIKYNQEIRN GYAIFRRNNK IELPIHFFTI
VLNGEPFIRY HIDIFKQLPF KWHWHIVEGV ADLKHDTSWS VKLGGHISND FHKNGRSCDG
TTEYIDELLQ LYPDNITVYR QPEGTFWDGK RNMVNAPLTN IQEECLLWQV DVDELWTLEQ
ICTAREMFIS NPDKTAAFYW CWYFVGENLI ISTRNCYAHN PQQEWLRTWR FKPGCIWAAH
EPPVLVEPLA NGEYKNLATV NPFLHPETEA YNLVFQHFAY VTPEQLSFKE KYYGYKNAVE
RWSNLQENSK FPILLREYFP WVYDETQVDT VNHSGIVPIA QRDENDKSWR FLQPEEVQQQ
INKISKPSPM ILIDGIFFQL YQTGIARVWK SLLEEWSNKE FAKHILFIDR AGTAPKVSGI
KYLNLPRYNY KDTNHERELL QQVCDQEGVD LFISSYYTTP ITTPSVFMAY DMIPEVMKWD
VSNPMWQDKH QAIEHASAYI AISKNTAFDL TQCFNQISLE SVILAYCGVS STFAPSTLDD
ISLFKTKYGI TKPYFLLPGV GSGYKNSILF FQAFSELVSS YGFDIVVTGG GGGLDAQFRN
YTFGSVVHSL QLSDEELAIA YSGAVALVYP SKYEGFGMPV IEAMACGCPV ITCPNASIPE
VAGEAAIYVK DDDIDELANA LCEVQKPAIR QSLITAGLAQ AQKFSWSTMA EIVSSTLINT
TLLSLNLREI NLIIFPDWSE SEDLIGLELT QIIKTLATHP DSDKTTLLID TTNFLTEDAE
LLLSSATMNL LMEEDLDITD GIEISLVANL SDIQWEALLP RIHGRITLEH ENQEALKQVK
AENLTSYELT SFSQVCEEEF FFT