Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3348 |
Symbol | |
ID | 3680224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4164993 |
End bp | 4168217 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637718698 |
Product | glycosyl transferase family protein |
Protein accession | YP_323850 |
Protein GI | 75909554 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0861977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0615574 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTATAT CTAAAGTTTT ATTAACAAAT CATCACCTAA AAAGCTACGC AGGTTCAGAA CTTGTCACTC TCGATTTAGC AATTGAATTT CAACAAAAAG GCTGGTCAGT AACTGTTGCA ACTTTCTTGT TTGGTGGTGA TCTAGCAAGG CATTTTTATG CACGGGGTAT AGATGTAGTT AATGTTTTAG AGAAACCTTT GACAGAAAAT GAATTTGATT TAGTATGGGG TCATCATTTC CCCGTTTTAA TTAAATGCTT AATAGAAGAC TCTGTTAAAA CAAAATATTT AGTTTTAAGT AGTTTATCTC CATATGAACC TTTGGAAGCA ATTCCTTTTT TCTATTCTAA GTCTGATTTA ATACTGTGTA ATTCAGAAGA GACTAAGAAA GAAATCATAG AACAGAATCA TTTACAAGAA ATTGATAAGA ATAAATTGTT CGTATTTAAT AATTCTGTTC CTGCAAATTG GTTTAATCTA CCAGTAGATA TCAAGGAGAC AGAGTTAAGA AAGGTAGGAG TGATTTCTAA CCATCCACCA ACAGAAGTTT TAACTGCTAT AGATATATTA AAATCTAAAA ATATAGATGT TGATTTAATA GGGATATTAG AAAGCCCTCA ACTAGTAAAT ATTGATATTC TTAATTCATA CGATGCAATC ATAACTATTG GGCGTACTGT CCAGCATTGT ATGGCTTTAG GAAAACCCGT GTTCTGCTAC GATCATTTTG GTGGGCCTGG TTGGTTAACT CCAGATAACT TCAAACTTGC TGAGTGGTTT AACTATTCAG GAAGATGCTG TTACCAAAAA ATGTCGGGTG AACAAATAGT AGAAAACTTA ATTAATGGTT TTCTTGCAAA TAATCAACAC ATACATTTTT TTAAAAACTA CTCTTTGGAG AATTATTCAC TAACAAGAAA TGTTGAGAAT GTTTTAAGTT GCATCAATAA CATTAATAAA GATTACGTTG ACTTTAATTC TGAGCAAGTA ATTGGAAAGG TAGGAGAAGC TTACCGACGT GTATTTACTG AAAATGGTTT CTTAAAGCTT GAACGGGAGC GATCGCAGTC TCAACTGCAA CAAACCCAGA CAGAATTGGA GCGATCGCAG TCGCAACTAC AACAAACTCA GACAGAATTG GAGCGATCGC AGTCGCAACT ACAACAAACT CAGACAGAAT TGGAGCGATC GCAGTCGCAA CTACAACAAA CCCAGACAGA ACTGGAGCGA TCGCAGTCGC AACTGCAACA AACCCAGACT GAGTTGACTT TGTCTCAGTC CCAGCTATAT ATAACTCAGA CAAAGTTAGA ACATTGGAAA AACCTCGTCT CTTGGATGGA AGGGAGTAAG TTTTGGAAGT TACGAGCCTT ATCTTTGAGT ATAAAGAAAG GAATAATAAA TTTACCTATC CTTAAGCTTG CCTTTAACAG GTTGCCTGTT TTTCAGAAAA AAAACTATCC CATATTTATC AAACGTATAG GTAAGTGGCT CAAGCATCAG ATCATGAACT TTCGTACTAG AAATGAATTC GCTGATGCAC TATTAGCAAA AGTTTTACAA AATTTCTCTG GCTTTCCCGA ATATAGTCAA TGGATTAAAG ATTATGAAGC AAAAGATGAA GAACTGACGC AACAAAAAAA GAATTCTTTA TTATTCCATT ATCAACCTGT ATTAAGTATA GTTTTTCCCG TTTATAAACT ACCACTAACC GTCTTACAAG AGACAATCAA TAGTGTTATT CAACAAACTT ACTCCAATTG GGAGCTATGT ATTGCTTTTG CAGATATAGA TAATTATCAA ACCATTGATT ATTTAAAAAC CTTGAGTTTA CAGGAAAAAC GCATAAAACT TAAGGTAATG GCGGAAAATA AGGGAATTTC TGGTAACTCT AATGTATCTC TAGATATGGC TTCGGGCGAA TTTGTAGCTT TGCTAGATCA TGATGACTTG TTGGCACCTT TTGCATTTTA TGAAGTCATC AGTGAATTGA ACAAGCAACC CGATTTAGAC TTTATTTACT CTGATAAAGA CTGTATTAGT GCCAACAGCA TGGTAAGGTC AAGGCTCTTA CTCAAACCAG AGTGGAGTCC AGAAATCCTA TATTCTGCTA ATTATTTAAC CCATCTGTGT ATTGCTCGTC GCACACTTTT GGAAAAGATT GGTGGCTTTC GTCCAGAAAC AGATGGCGCG CAAGACTGGG ATTTATTTTT ACGTATCACA GAAAATACAT CACGTATCGC TCGAATTAAT TCTGTGCTTT ACCATTGGCG GATTATACAA GGTTCAACCT CCTTGGGAAT AGATTCTAAG CCATATGCAC TTGAAGGACA ACTGCGATCT ATCCAAGATC ATCTTACTAG AACAAAATTG CCGGCCACAG TTTCACCACA CCCTGAATCT GGTTTTCGCT TAGAATGGCA AGCCTCGCCT GCAACAGTAT CAATTTATAT TGATGGTGAT GTACCTTGGG ATTCTCTATT AGCTTGCATT AATGCTGTTG CCCAATTTTC TGACCCCAAA TTACACAAAG CAAAAATTAC TTTACCCGAA CACACATATA CTTCCAAAGC TACTGAAAGA GAAAATCTGA TAAAAGCAAT CAGTTTACCT ATTGATTGGC TACCAATTAA AGAAGGAAAT TCTAAATTAG TTACTTTAGC TAACAATATA AAACAAGATA AGACTGATGT AGTTGTGTTT GTATCAGGTC TTGTTAAGAG GTTTACCGAA GGATGGATAC AAGAACTAAG CGGATGGGTG CTTAACCATC CAGATATTGG ATTTGTGTCC TCACTTATTT TAACCGATAA CAATTTAGTC GTAGAAGCTG GATTAGTTGT TGACAAATAT GATAACGGCT CTCCCTTAAT GCGAGGAAAT TTCTTATATT CCTGGGATAT ATTTGGTGGG GCTTTATGGT ACAGAAATTG TAGCGCTAGT TCTCCTTGGG CTATAGCTTT TAGTTATAAA AATTACTTAG AAGTCGGAGG ATTATCTTCT AACTCTCCTT CTTTATCACA CGCCATGATT AAGCTTTGTC AAGCTATTCG CGCAAATAAT AAAAGAGGCT TAGTTAACCC TCATGCTCGC GCATTTTTGC AGGATTTACC AAAAAATGAT ATTCCGGAAT TTGACGATTC TCTAGGAAAT GATCCTTATT TTCATCCCGC CTTTGCTTCA GTTGTTCCTT TAAAATTAAG AGTTAAAAAT GGTAAAAAAA ATTAA
|
Protein sequence | MGISKVLLTN HHLKSYAGSE LVTLDLAIEF QQKGWSVTVA TFLFGGDLAR HFYARGIDVV NVLEKPLTEN EFDLVWGHHF PVLIKCLIED SVKTKYLVLS SLSPYEPLEA IPFFYSKSDL ILCNSEETKK EIIEQNHLQE IDKNKLFVFN NSVPANWFNL PVDIKETELR KVGVISNHPP TEVLTAIDIL KSKNIDVDLI GILESPQLVN IDILNSYDAI ITIGRTVQHC MALGKPVFCY DHFGGPGWLT PDNFKLAEWF NYSGRCCYQK MSGEQIVENL INGFLANNQH IHFFKNYSLE NYSLTRNVEN VLSCINNINK DYVDFNSEQV IGKVGEAYRR VFTENGFLKL ERERSQSQLQ QTQTELERSQ SQLQQTQTEL ERSQSQLQQT QTELERSQSQ LQQTQTELER SQSQLQQTQT ELTLSQSQLY ITQTKLEHWK NLVSWMEGSK FWKLRALSLS IKKGIINLPI LKLAFNRLPV FQKKNYPIFI KRIGKWLKHQ IMNFRTRNEF ADALLAKVLQ NFSGFPEYSQ WIKDYEAKDE ELTQQKKNSL LFHYQPVLSI VFPVYKLPLT VLQETINSVI QQTYSNWELC IAFADIDNYQ TIDYLKTLSL QEKRIKLKVM AENKGISGNS NVSLDMASGE FVALLDHDDL LAPFAFYEVI SELNKQPDLD FIYSDKDCIS ANSMVRSRLL LKPEWSPEIL YSANYLTHLC IARRTLLEKI GGFRPETDGA QDWDLFLRIT ENTSRIARIN SVLYHWRIIQ GSTSLGIDSK PYALEGQLRS IQDHLTRTKL PATVSPHPES GFRLEWQASP ATVSIYIDGD VPWDSLLACI NAVAQFSDPK LHKAKITLPE HTYTSKATER ENLIKAISLP IDWLPIKEGN SKLVTLANNI KQDKTDVVVF VSGLVKRFTE GWIQELSGWV LNHPDIGFVS SLILTDNNLV VEAGLVVDKY DNGSPLMRGN FLYSWDIFGG ALWYRNCSAS SPWAIAFSYK NYLEVGGLSS NSPSLSHAMI KLCQAIRANN KRGLVNPHAR AFLQDLPKND IPEFDDSLGN DPYFHPAFAS VVPLKLRVKN GKKN
|
| |