Gene Ava_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1109 
Symbol 
ID3678524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1353009 
End bp1354046 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content35% 
IMG OID637716445 
Productglycosyl transferase family protein 
Protein accessionYP_321628 
Protein GI75907332 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.945444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAT TATTAACAAT TGCCATTCCT ACTTATAATC GTGCTAAGCT GCTTGACCAA 
CAGTTAGCAT GGCTGGCAAA TGCTATCAAG GGATTTGAAG AATATTGTGA GATATTAGTT
TCTGATAATT GTTCTACTGA TAATACTCAA GGCATTATCC AAAAATGGCA GCATCAACTA
AATAACGTCA CCTTTAAGTC TAACAAACAT TCAAAAAATT TAGGCGTAAT GAAAAACATT
ATGTACTGCC TAAGTTCTGC GGAAACACAA TATGTTTGGA CAATTGGCGA TGATGATCCT
ATACAAGATA GGGCTATTGC CTATGTCATC AATAAGATCA AACAGCATCA AAATTTAGGA
TTAATCTTTC TCAACTTTTC TGGTCGCAAT AAAATTACTA ATCAACCAGT ATTCCCACCA
ACAATTATTG GGAATCGTTG GTTTGATGCT GATTGTGAAG ATGGGTGTAG AGATAGTAAA
GAAGTTTTTG AACATTGTTT TGCTAAAAGT GTAGGTGCAG TTATCTTCCT GAGTGCTAGC
ATCTATCGTA CTGATTTGGT TAAGCAAGCT CTACAAAATT GGCCAGATGC AGCCAATAAT
TGGATATCTT TAGCATATCT GGCTGGTTAT TGTGCTGCTA ACGGTAATGT AATTGTCACA
AAAGAAAATT TCTTAGAGTG TATTGTTGGT GTGAGCTATT GGCAGAAAGA GCCGAAATCA
GCATTATTAA TGCAATACAA ACACATCCCC GAAGTGATTT TAAAACTTCG AGAAATTGGA
TATTCTAACC AATTTTGTCG GCAGATGCTT TTGCATAATT CTAAAGAAGT AGACTTGAAA
GTTTTCTTGG GTGCTTTAAG AAGATGGCCA ATATCTGCTG TTAAAACAGT TATCCCCTTT
GTGGCTTTAG TGAGTTTGTC TGCTTTTGAA ATGATGCCTT TTAAAGAGAT AAGGGTTGCT
GAAAATAGTG AGCCAATATC TCAACAATCC TCGGCAAACA ACGATAGAAA ATCACTCCAA
AAATTACTGA ATAAATAA
 
Protein sequence
MSKLLTIAIP TYNRAKLLDQ QLAWLANAIK GFEEYCEILV SDNCSTDNTQ GIIQKWQHQL 
NNVTFKSNKH SKNLGVMKNI MYCLSSAETQ YVWTIGDDDP IQDRAIAYVI NKIKQHQNLG
LIFLNFSGRN KITNQPVFPP TIIGNRWFDA DCEDGCRDSK EVFEHCFAKS VGAVIFLSAS
IYRTDLVKQA LQNWPDAANN WISLAYLAGY CAANGNVIVT KENFLECIVG VSYWQKEPKS
ALLMQYKHIP EVILKLREIG YSNQFCRQML LHNSKEVDLK VFLGALRRWP ISAVKTVIPF
VALVSLSAFE MMPFKEIRVA ENSEPISQQS SANNDRKSLQ KLLNK