Gene Ava_3348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3348 
Symbol 
ID3680224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4164993 
End bp4168217 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content36% 
IMG OID637718698 
Productglycosyl transferase family protein 
Protein accessionYP_323850 
Protein GI75909554 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0861977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0615574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATAT CTAAAGTTTT ATTAACAAAT CATCACCTAA AAAGCTACGC AGGTTCAGAA 
CTTGTCACTC TCGATTTAGC AATTGAATTT CAACAAAAAG GCTGGTCAGT AACTGTTGCA
ACTTTCTTGT TTGGTGGTGA TCTAGCAAGG CATTTTTATG CACGGGGTAT AGATGTAGTT
AATGTTTTAG AGAAACCTTT GACAGAAAAT GAATTTGATT TAGTATGGGG TCATCATTTC
CCCGTTTTAA TTAAATGCTT AATAGAAGAC TCTGTTAAAA CAAAATATTT AGTTTTAAGT
AGTTTATCTC CATATGAACC TTTGGAAGCA ATTCCTTTTT TCTATTCTAA GTCTGATTTA
ATACTGTGTA ATTCAGAAGA GACTAAGAAA GAAATCATAG AACAGAATCA TTTACAAGAA
ATTGATAAGA ATAAATTGTT CGTATTTAAT AATTCTGTTC CTGCAAATTG GTTTAATCTA
CCAGTAGATA TCAAGGAGAC AGAGTTAAGA AAGGTAGGAG TGATTTCTAA CCATCCACCA
ACAGAAGTTT TAACTGCTAT AGATATATTA AAATCTAAAA ATATAGATGT TGATTTAATA
GGGATATTAG AAAGCCCTCA ACTAGTAAAT ATTGATATTC TTAATTCATA CGATGCAATC
ATAACTATTG GGCGTACTGT CCAGCATTGT ATGGCTTTAG GAAAACCCGT GTTCTGCTAC
GATCATTTTG GTGGGCCTGG TTGGTTAACT CCAGATAACT TCAAACTTGC TGAGTGGTTT
AACTATTCAG GAAGATGCTG TTACCAAAAA ATGTCGGGTG AACAAATAGT AGAAAACTTA
ATTAATGGTT TTCTTGCAAA TAATCAACAC ATACATTTTT TTAAAAACTA CTCTTTGGAG
AATTATTCAC TAACAAGAAA TGTTGAGAAT GTTTTAAGTT GCATCAATAA CATTAATAAA
GATTACGTTG ACTTTAATTC TGAGCAAGTA ATTGGAAAGG TAGGAGAAGC TTACCGACGT
GTATTTACTG AAAATGGTTT CTTAAAGCTT GAACGGGAGC GATCGCAGTC TCAACTGCAA
CAAACCCAGA CAGAATTGGA GCGATCGCAG TCGCAACTAC AACAAACTCA GACAGAATTG
GAGCGATCGC AGTCGCAACT ACAACAAACT CAGACAGAAT TGGAGCGATC GCAGTCGCAA
CTACAACAAA CCCAGACAGA ACTGGAGCGA TCGCAGTCGC AACTGCAACA AACCCAGACT
GAGTTGACTT TGTCTCAGTC CCAGCTATAT ATAACTCAGA CAAAGTTAGA ACATTGGAAA
AACCTCGTCT CTTGGATGGA AGGGAGTAAG TTTTGGAAGT TACGAGCCTT ATCTTTGAGT
ATAAAGAAAG GAATAATAAA TTTACCTATC CTTAAGCTTG CCTTTAACAG GTTGCCTGTT
TTTCAGAAAA AAAACTATCC CATATTTATC AAACGTATAG GTAAGTGGCT CAAGCATCAG
ATCATGAACT TTCGTACTAG AAATGAATTC GCTGATGCAC TATTAGCAAA AGTTTTACAA
AATTTCTCTG GCTTTCCCGA ATATAGTCAA TGGATTAAAG ATTATGAAGC AAAAGATGAA
GAACTGACGC AACAAAAAAA GAATTCTTTA TTATTCCATT ATCAACCTGT ATTAAGTATA
GTTTTTCCCG TTTATAAACT ACCACTAACC GTCTTACAAG AGACAATCAA TAGTGTTATT
CAACAAACTT ACTCCAATTG GGAGCTATGT ATTGCTTTTG CAGATATAGA TAATTATCAA
ACCATTGATT ATTTAAAAAC CTTGAGTTTA CAGGAAAAAC GCATAAAACT TAAGGTAATG
GCGGAAAATA AGGGAATTTC TGGTAACTCT AATGTATCTC TAGATATGGC TTCGGGCGAA
TTTGTAGCTT TGCTAGATCA TGATGACTTG TTGGCACCTT TTGCATTTTA TGAAGTCATC
AGTGAATTGA ACAAGCAACC CGATTTAGAC TTTATTTACT CTGATAAAGA CTGTATTAGT
GCCAACAGCA TGGTAAGGTC AAGGCTCTTA CTCAAACCAG AGTGGAGTCC AGAAATCCTA
TATTCTGCTA ATTATTTAAC CCATCTGTGT ATTGCTCGTC GCACACTTTT GGAAAAGATT
GGTGGCTTTC GTCCAGAAAC AGATGGCGCG CAAGACTGGG ATTTATTTTT ACGTATCACA
GAAAATACAT CACGTATCGC TCGAATTAAT TCTGTGCTTT ACCATTGGCG GATTATACAA
GGTTCAACCT CCTTGGGAAT AGATTCTAAG CCATATGCAC TTGAAGGACA ACTGCGATCT
ATCCAAGATC ATCTTACTAG AACAAAATTG CCGGCCACAG TTTCACCACA CCCTGAATCT
GGTTTTCGCT TAGAATGGCA AGCCTCGCCT GCAACAGTAT CAATTTATAT TGATGGTGAT
GTACCTTGGG ATTCTCTATT AGCTTGCATT AATGCTGTTG CCCAATTTTC TGACCCCAAA
TTACACAAAG CAAAAATTAC TTTACCCGAA CACACATATA CTTCCAAAGC TACTGAAAGA
GAAAATCTGA TAAAAGCAAT CAGTTTACCT ATTGATTGGC TACCAATTAA AGAAGGAAAT
TCTAAATTAG TTACTTTAGC TAACAATATA AAACAAGATA AGACTGATGT AGTTGTGTTT
GTATCAGGTC TTGTTAAGAG GTTTACCGAA GGATGGATAC AAGAACTAAG CGGATGGGTG
CTTAACCATC CAGATATTGG ATTTGTGTCC TCACTTATTT TAACCGATAA CAATTTAGTC
GTAGAAGCTG GATTAGTTGT TGACAAATAT GATAACGGCT CTCCCTTAAT GCGAGGAAAT
TTCTTATATT CCTGGGATAT ATTTGGTGGG GCTTTATGGT ACAGAAATTG TAGCGCTAGT
TCTCCTTGGG CTATAGCTTT TAGTTATAAA AATTACTTAG AAGTCGGAGG ATTATCTTCT
AACTCTCCTT CTTTATCACA CGCCATGATT AAGCTTTGTC AAGCTATTCG CGCAAATAAT
AAAAGAGGCT TAGTTAACCC TCATGCTCGC GCATTTTTGC AGGATTTACC AAAAAATGAT
ATTCCGGAAT TTGACGATTC TCTAGGAAAT GATCCTTATT TTCATCCCGC CTTTGCTTCA
GTTGTTCCTT TAAAATTAAG AGTTAAAAAT GGTAAAAAAA ATTAA
 
Protein sequence
MGISKVLLTN HHLKSYAGSE LVTLDLAIEF QQKGWSVTVA TFLFGGDLAR HFYARGIDVV 
NVLEKPLTEN EFDLVWGHHF PVLIKCLIED SVKTKYLVLS SLSPYEPLEA IPFFYSKSDL
ILCNSEETKK EIIEQNHLQE IDKNKLFVFN NSVPANWFNL PVDIKETELR KVGVISNHPP
TEVLTAIDIL KSKNIDVDLI GILESPQLVN IDILNSYDAI ITIGRTVQHC MALGKPVFCY
DHFGGPGWLT PDNFKLAEWF NYSGRCCYQK MSGEQIVENL INGFLANNQH IHFFKNYSLE
NYSLTRNVEN VLSCINNINK DYVDFNSEQV IGKVGEAYRR VFTENGFLKL ERERSQSQLQ
QTQTELERSQ SQLQQTQTEL ERSQSQLQQT QTELERSQSQ LQQTQTELER SQSQLQQTQT
ELTLSQSQLY ITQTKLEHWK NLVSWMEGSK FWKLRALSLS IKKGIINLPI LKLAFNRLPV
FQKKNYPIFI KRIGKWLKHQ IMNFRTRNEF ADALLAKVLQ NFSGFPEYSQ WIKDYEAKDE
ELTQQKKNSL LFHYQPVLSI VFPVYKLPLT VLQETINSVI QQTYSNWELC IAFADIDNYQ
TIDYLKTLSL QEKRIKLKVM AENKGISGNS NVSLDMASGE FVALLDHDDL LAPFAFYEVI
SELNKQPDLD FIYSDKDCIS ANSMVRSRLL LKPEWSPEIL YSANYLTHLC IARRTLLEKI
GGFRPETDGA QDWDLFLRIT ENTSRIARIN SVLYHWRIIQ GSTSLGIDSK PYALEGQLRS
IQDHLTRTKL PATVSPHPES GFRLEWQASP ATVSIYIDGD VPWDSLLACI NAVAQFSDPK
LHKAKITLPE HTYTSKATER ENLIKAISLP IDWLPIKEGN SKLVTLANNI KQDKTDVVVF
VSGLVKRFTE GWIQELSGWV LNHPDIGFVS SLILTDNNLV VEAGLVVDKY DNGSPLMRGN
FLYSWDIFGG ALWYRNCSAS SPWAIAFSYK NYLEVGGLSS NSPSLSHAMI KLCQAIRANN
KRGLVNPHAR AFLQDLPKND IPEFDDSLGN DPYFHPAFAS VVPLKLRVKN GKKN