Gene Ava_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3571 
Symbol 
ID3679517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4447156 
End bp4449147 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content39% 
IMG OID637718922 
Producthypothetical protein 
Protein accessionYP_324072 
Protein GI75909776 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.68126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00704751 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGTACG CAATACCAGA TGATATCCAA AAGTTACAAT TGAGGATAAA AAGACGATTC 
AATAAAACAT TTAAGTATTT AAATTATAAA TTTTTCAAAT CATGGATTTA TTTATTTATT
GCTACTATCT CATTTCTATT TATTACTATT GTTTCAACAG ATTTAACATT CAGTCAAGTT
CCAGAGGATA CCTATAAAAG AGATTTGATC AGTGATATTT TTGCTGGCAG TCAGAAATCT
ATACAAAACT ATATTGAGGG AATGTATGTT GCGCCTAATG GCACAGTTTA TACAAATAGT
CATGAGGATA AAGCAGGAGC CGAAGCATCA ATATATAAAG ATGGTAATGT AATTGGGGTT
CTCAATGATC TTCATGGTTG GAGTCGTCAT GGTGGTAAAG CTGTCACAGC AAATAGTAAG
TATATTTACA TTGCTATGAG CCAAGGATTT GTCGGCAAAA TAGACAAAGA TTATCCACCA
GAGGGAACAA CTTGGTATTG TGTGAGACGT TATAACTTAT CTGGTAAACC AGCGCCCTTT
GCTAATGGAC GTGGTTGGGA TAAGAGTATG TTAATCGTCA ATACTAAAAG TGAGGTGACT
GGATTAGCAA CGAAAGATAA GAATTTATAT GTCAGTGATG CAGCGAATAA TAAAATTCGC
GTCCATAACA CTGAGACAAT GAAAGAATTA CGCAACTTTA GTTTCCTGAG ACCAGGGGAT
ATAACTATTG ATAAGCAGGG GAATTTGTGG ATTATTCAAA AGCAAGATGC CAATAATCCT
GCTAAAATTT TGCGTTATTC TTCATCAGGA AAACAATTAC CGCAAAAAAT AACTAATGTT
GTTTTACCCG GAGCGATCGC ACTTGATAAT CAAGGCAGAC TGTTAGTAGC GGAAAATGGG
ACACTTCAGC AAGTATTGAT TTATAACATT CAGGATCAAC CAGTCCAGGT CGGCACTTTT
GGCCACACAG GGGGGATTTA TGGAGGTGTT CCTGGTGAAG TCCAAGATTT AAAGTTTTAT
GGACTCACAG GAGTCGGCGC AGATACTCAG GGGAATTTGT ATATCAATAA TAATGGGTTC
AATAATTCCG GCACAGATTT AAGAAAATTT TCGCCATCAG GAAAGCAACT ATGGCGATTA
TTAGGATTAT CCTTAGTCGA AAATGCCGAC GTTGACCCCA CTACAGATGG ACGAGAAGTT
TTTACCAAGT ATGAACACTT TTTGATAGAT AGTAGCAAGC CTAAGGGTCA ACAATTGACT
TATAAAGCTT ACACCTTAAA CGGTTTTAAA TATCCTCAAG ATCCACGACT GCATATATCC
CCCGATGCTA CCTTTGTACG TCGAATCAAT GGTGAAAGAT TTTTGTTTCT CTTAGATCCG
TACAGTAACG CCTTAGAGAT TTATCGATTT AATCCCAGCA AAGATGGGAA TATAGCTATT
CCAGCAGGGA TGTTTGTTGG TAAAAATAAC GAAGGTAAAT CAGCAATCTC AGGGACTTGG
CCACCTCAGC AACCAAAAAC AGGAAAATGG ATTTGGCGCG ATCGCAACGG TAACGGCGCT
TTCGATAAAG AAGAATATGA TCATAGTGAA GATTCTTCCT CTGTTACAGG ATGGTGGGTG
GATAGTAAAG GCGATGTGTG GACAACTCTG GGCGATCGAC AAGGTATTCG CCATTATTTC
TTGCAAGGAC TAGACACTAA CGGTAATCCC ATCTATACCT ATAGTTCCAT GCAAAAAGAA
ACAACTCCCC GCATTTTTAC AGATTTACGG CGAATCAAAT ACTTTCCTGA AAGTGACACC
ATGTATCTGT CCGGCTTTAC AGTTAAAAAT CCAGCAACCT TTGTAGATCC CCTAGCCGCA
GGCTCTGAAA TTGCCCGTTT TGACAATTGG AGTCAAGATA ATCGTATTCT CCGTTGGCGA
ATTGTAGTTC CTAACGACAC CATCCGTAAA CGTGAAATTA TTACTGCGTC TACGCACTCA
GTCCCAGATT AA
 
Protein sequence
MKYAIPDDIQ KLQLRIKRRF NKTFKYLNYK FFKSWIYLFI ATISFLFITI VSTDLTFSQV 
PEDTYKRDLI SDIFAGSQKS IQNYIEGMYV APNGTVYTNS HEDKAGAEAS IYKDGNVIGV
LNDLHGWSRH GGKAVTANSK YIYIAMSQGF VGKIDKDYPP EGTTWYCVRR YNLSGKPAPF
ANGRGWDKSM LIVNTKSEVT GLATKDKNLY VSDAANNKIR VHNTETMKEL RNFSFLRPGD
ITIDKQGNLW IIQKQDANNP AKILRYSSSG KQLPQKITNV VLPGAIALDN QGRLLVAENG
TLQQVLIYNI QDQPVQVGTF GHTGGIYGGV PGEVQDLKFY GLTGVGADTQ GNLYINNNGF
NNSGTDLRKF SPSGKQLWRL LGLSLVENAD VDPTTDGREV FTKYEHFLID SSKPKGQQLT
YKAYTLNGFK YPQDPRLHIS PDATFVRRIN GERFLFLLDP YSNALEIYRF NPSKDGNIAI
PAGMFVGKNN EGKSAISGTW PPQQPKTGKW IWRDRNGNGA FDKEEYDHSE DSSSVTGWWV
DSKGDVWTTL GDRQGIRHYF LQGLDTNGNP IYTYSSMQKE TTPRIFTDLR RIKYFPESDT
MYLSGFTVKN PATFVDPLAA GSEIARFDNW SQDNRILRWR IVVPNDTIRK REIITASTHS
VPD