Gene Ava_5045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5045 
Symbol 
ID3678901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6335812 
End bp6337059 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content42% 
IMG OID637720405 
Productglycosyl transferase, group 1 
Protein accessionYP_325537 
Protein GI75911241 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.462239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATA TCTCGCAACT GGCTACAAAA ATCAGAGACA CGACTGCATC CCCAAATATC 
TTAGTAATAT CGCGTTTTTT CTTGCCTAAA GAAGCTGTAA TTGGGGAATA TCTCTACAAT
CGCTGTCTTC AAGATCCAGA ACAAGTAATA GTTCTGGCAG CTAGTTGTAG AGGTGATAAA
GTGTTTGACC AAGCACAAAA ATTTCCTGTA TATCGCTGGC CTAACCCTAA ATACTGGCTT
GGTGGTTTTT TAGGAAGTTG TCTACAACCT TTGTTTAATT TGGTTGCACC ATTTGTTCTG
GCGCTCAAAC TTTATTTTCG TTATCACTAC CGCTATATTG AATGGGGACA TGGCTATAAC
TTCCCATCAC TCCTATTATT AAGCTATATC TTACCCATAC GCTTCTTTAT TTACTTACAC
GGTAATGACC TTGGTAGCGT TGTAGACAAT CCTGTATGGC GATCGCTCTT TAAGTTAACA
CTGTCACGAG CTGAAGGCAT TGTGTGTAAC AATTCCTTAA CCCAAGATTA TCTGAGAAAT
ACCTTCCGGT TACAAACCCC TACCCACGTT ATCCATCCAG TAGTTAGACC AGAAAAATTT
GGACTGGGAA GCAATAGCCA GAGTCTAGAT GAATTAGGTG ATGGCATTCT ACCCACTTTA
GGCGATCGCC TGCGTCAAGC TTATAATATC CCCCAAAGTG CCATAGTCAT TCTTTCCGTA
GGGCGCTTAG TCAAACAAAA AGGCTTTGAC CGTGTTATTG AAAACCTACC ACTACTTTTA
ACAATTGGTG TAGATGTTCA TTACATCATT TGTGGTCAAG GTCCTTGTGA GTCTGAACTA
AAAGCCTTAG CAGAGAGGTT GCGTGTAGAT AAGCGGGTAC ACTTTGCCGG GTATGTGAAC
AATCGAGAAT TAGCAGGTTA TTACGCAGCT TGCGATATAT TTGCCATGCT CGCTTTGTCA
AATACTCCAG CTAGCCGATT AGAAGGATGT GGTAGTGTCT ACTTAGAAGC TAGCTATTTT
GGTAAACCTG TAATTGCTTC TCGTCACCCG TCCTTAATTG ATACGGTACG CCATGAAGAA
AATGGCTTGC TGGTAAATCC CAAGTCTGGT TACGAAGTTT TTCAAGTATT CAAACAATTG
TGCCAAAATC AGCAATTGCG TGAACAACTT GGTCGTCAGG GAAAAGAATT AGCCAAGCGC
AAAACCTTAC ACCGTTCTTT ATACCTTGGG CAAGGGGTTA GAAGCTAG
 
Protein sequence
MEHISQLATK IRDTTASPNI LVISRFFLPK EAVIGEYLYN RCLQDPEQVI VLAASCRGDK 
VFDQAQKFPV YRWPNPKYWL GGFLGSCLQP LFNLVAPFVL ALKLYFRYHY RYIEWGHGYN
FPSLLLLSYI LPIRFFIYLH GNDLGSVVDN PVWRSLFKLT LSRAEGIVCN NSLTQDYLRN
TFRLQTPTHV IHPVVRPEKF GLGSNSQSLD ELGDGILPTL GDRLRQAYNI PQSAIVILSV
GRLVKQKGFD RVIENLPLLL TIGVDVHYII CGQGPCESEL KALAERLRVD KRVHFAGYVN
NRELAGYYAA CDIFAMLALS NTPASRLEGC GSVYLEASYF GKPVIASRHP SLIDTVRHEE
NGLLVNPKSG YEVFQVFKQL CQNQQLREQL GRQGKELAKR KTLHRSLYLG QGVRS