Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0349 |
Symbol | |
ID | 3682723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 448307 |
End bp | 451438 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637715677 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_320870 |
Protein GI | 75906574 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0126249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0342304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA AAAATACTAA TTCTGCTTTA GATAATTTAA TTCCTCCTGA AATCAAAAAT GATGAATTTT ATGTAGCTAT TCAGGATATT GTTAAGAATG AAGAAATTAA AACAATTTTA GAAATTGGTT CATCTTCGGG AGAAGGAAGT ACAGAGGCAT TTGTTACAGG AATCCGACAA AATACGAATA ATCCCATCTT GTTTTGCATG GAAGTATCGA AGACAAGATT TAATGAGCTA AAAAATAGGT ATAAAAATGA AAATTTTGTA AAAGTATATA ACACATCATC TGTTCCTATC GAAAGCTTTC CTAATGAACA GGAAGTGATA GATTTTTATA AAAACACTAC TAACAATCTT AAGCTTTATC CATTAGAAAG CGTTCTTAAC TGGCTGTATC AAGATATTGA ATATGTCAAA GAATGTGGAT ACTCTGAGAA TGGAATTAAA ACAATTAAAA ATGAGAATAA CATTGATTAT TTTGATTTAG TATTAATTGA CGGCTCAGAA TTTACAGGTA GTGCAGAATT AGATGAAGTT TATGGAGCAA AATATATTCT TTTAGATGAT ATTAATACAT TTAAAAATCA TAATAACTTT CATAAATTAT TAAAAGATCA TAATTATTCA ATCATTAAAT ACAACCAAGA AATACGTAAC GGCTATGCTA TTTTTAGACG CAATAATAAG ATAGAATTAC CTATCCATTT TTTCACTATT GTTCTCAACG GAGAACCCTT TATCCGTTAT CACATCGACA TTTTTAAACA GCTACCTTTT AAATGGCATT GGCATATTGT TGAGGGTGTT GCCGACTTAA AACATGATAC TAGCTGGAGT GTTAAGTTAG GTGGACATAT TAGCAATGAC TTTCATAAAA ATGGACGTAG TTGTGATGGC ACTACAGAAT ATATAGATGA ACTACTGCAA CTTTATCCAG ACAATATTAC AGTTTACCGC CAACCAGAGG GTACTTTCTG GGACGGAAAG CGAAATATGG TAAATGCACC ACTTACAAAT ATTCAAGAAG AATGTTTGTT GTGGCAAGTG GATGTTGATG AACTATGGAC TTTAGAGCAG ATTTGTACTG CTAGAGAAAT GTTCATTAGT AACCCAGATA AAACAGCTGC TTTCTATTGG TGTTGGTACT TTGTTGGCGA AAATTTAATT ATTAGCACTC GTAACTGTTA CGCACATAAT CCTCAGCAAG AGTGGTTAAG AACTTGGCGA TTTAAACCAG GATGTATTTG GGCTGCACAC GAGCCACCTG TGTTAGTAGA ACCCTTAGCC AATGGTGAAT ATAAAAATTT AGCTACTGTT AATCCTTTTC TGCATCCAGA AACAGAAGCA TATAATTTAG TATTCCAGCA TTTTGCTTAT GTCACACCAG AACAATTAAG CTTCAAAGAA AAATACTATG GTTATAAAAA TGCTGTTGAA CGATGGAGTA ATTTACAAGA AAACAGCAAA TTTCCTATTT TATTAAGAGA ATATTTTCCT TGGGTTTACG ATGAAACTCA GGTAGATACT GTAAATCACT CTGGAATAGT TCCGATTGCT CAAAGAGACG AGAACGATAA AAGCTGGCGA TTTTTACAAC CAGAGGAAGT ACAGCAGCAA ATTAATAAAA TCAGTAAACC ATCACCGATG ATTCTCATTG ATGGGATATT TTTTCAACTT TACCAAACTG GGATTGCTCG TGTTTGGAAA TCACTTTTGG AAGAATGGTC AAATAAAGAA TTTGCTAAAC ATATTTTATT CATTGACCGG GCTGGAACTG CGCCTAAAGT ATCTGGAATT AAGTATTTAA ATTTACCTCG TTATAACTAT AAAGACACCA ATCATGAACG AGAACTATTA CAGCAAGTAT GTGATCAAGA GGGTGTAGAT TTATTTATTT CATCTTACTA CACAACGCCA ATCACAACAC CTTCTGTATT CATGGCTTAT GACATGATTC CAGAAGTCAT GAAATGGGAT GTGAGTAATC CCATGTGGCA AGATAAACAC CAAGCAATAG AACACGCATC TGCTTATATA GCTATTTCTA AAAATACAGC ATTTGATTTA ACACAATGCT TTAATCAAAT ATCTTTAGAG TCAGTTATTC TAGCCTATTG CGGTGTTAGT AGCACCTTTG CGCCATCCAC ATTGGATGAC ATCAGTCTTT TCAAGACAAA GTATGGCATT ACCAAGCCTT ACTTCTTATT ACCTGGGGTT GGTTCTGGCT ATAAGAATAG TATTTTATTC TTCCAAGCTT TTTCAGAACT TGTGAGTAGC TATGGTTTTG ATATTGTGGT TACAGGTGGC GGAGGTGGAT TAGATGCTCA GTTTAGAAAC TACACATTTG GTAGTGTGGT TCATAGTTTA CAACTGAGTG ACGAAGAGTT AGCAATAGCT TACTCTGGTG CTGTGGCTTT AGTTTATCCT TCTAAATATG AAGGTTTTGG GATGCCTGTA ATTGAAGCAA TGGCTTGTGG TTGTCCTGTG ATTACCTGTC CTAATGCTTC AATTCCAGAA GTAGCTGGAG AAGCCGCAAT CTATGTCAAG GATGATGATA TAGATGAACT AGCAAATGCA CTATGCGAAG TACAAAAACC TGCTATACGT CAATCATTAA TTACTGCTGG TTTAGCCCAA GCGCAAAAAT TTTCTTGGTC AACAATGGCA GAAATTGTCA GTTCTACTTT AATTAATACA ACTCTTTTAT CATTAAATTT AAGAGAAATT AATTTAATTA TTTTCCCAGA TTGGTCAGAG TCGGAAGATT TAATTGGTTT AGAATTGACG CAGATAATTA AGACACTGGC AACTCATCCT GATAGCGACA AAACTACTTT ATTGATTGAT ACTACTAATT TTTTAACTGA AGATGCTGAG TTGTTGTTAT CTAGTGCGAC TATGAATCTT CTGATGGAAG AAGACTTAGA TATTACTGAT GGAATAGAAA TTTCTTTGGT GGCAAATTTG TCTGATATTC AGTGGGAAGC TTTACTACCT CGCATTCACG GCAGAATTAC TCTAGAACAT GAAAATCAAG AAGCACTGAA ACAAGTAAAA GCAGAAAATC TCACATCTTA TGAGTTAACA AGTTTTAGCC AAGTATGCGA GGAAGAGTTT TTTTTTACCT AA
|
Protein sequence | MNKKNTNSAL DNLIPPEIKN DEFYVAIQDI VKNEEIKTIL EIGSSSGEGS TEAFVTGIRQ NTNNPILFCM EVSKTRFNEL KNRYKNENFV KVYNTSSVPI ESFPNEQEVI DFYKNTTNNL KLYPLESVLN WLYQDIEYVK ECGYSENGIK TIKNENNIDY FDLVLIDGSE FTGSAELDEV YGAKYILLDD INTFKNHNNF HKLLKDHNYS IIKYNQEIRN GYAIFRRNNK IELPIHFFTI VLNGEPFIRY HIDIFKQLPF KWHWHIVEGV ADLKHDTSWS VKLGGHISND FHKNGRSCDG TTEYIDELLQ LYPDNITVYR QPEGTFWDGK RNMVNAPLTN IQEECLLWQV DVDELWTLEQ ICTAREMFIS NPDKTAAFYW CWYFVGENLI ISTRNCYAHN PQQEWLRTWR FKPGCIWAAH EPPVLVEPLA NGEYKNLATV NPFLHPETEA YNLVFQHFAY VTPEQLSFKE KYYGYKNAVE RWSNLQENSK FPILLREYFP WVYDETQVDT VNHSGIVPIA QRDENDKSWR FLQPEEVQQQ INKISKPSPM ILIDGIFFQL YQTGIARVWK SLLEEWSNKE FAKHILFIDR AGTAPKVSGI KYLNLPRYNY KDTNHERELL QQVCDQEGVD LFISSYYTTP ITTPSVFMAY DMIPEVMKWD VSNPMWQDKH QAIEHASAYI AISKNTAFDL TQCFNQISLE SVILAYCGVS STFAPSTLDD ISLFKTKYGI TKPYFLLPGV GSGYKNSILF FQAFSELVSS YGFDIVVTGG GGGLDAQFRN YTFGSVVHSL QLSDEELAIA YSGAVALVYP SKYEGFGMPV IEAMACGCPV ITCPNASIPE VAGEAAIYVK DDDIDELANA LCEVQKPAIR QSLITAGLAQ AQKFSWSTMA EIVSSTLINT TLLSLNLREI NLIIFPDWSE SEDLIGLELT QIIKTLATHP DSDKTTLLID TTNFLTEDAE LLLSSATMNL LMEEDLDITD GIEISLVANL SDIQWEALLP RIHGRITLEH ENQEALKQVK AENLTSYELT SFSQVCEEEF FFT
|
| |