Gene Ava_B0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_B0006 
Symbol 
ID3677747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007410 
Strand
Start bp6452 
End bp8989 
Gene Length2538 bp 
Protein Length845 aa 
Translation table11 
GC content34% 
IMG OID637714715 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_319909 
Protein GI75812290 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGCA AATTTACTAT TTTATTTACA GAAATATCTT TTTTAATTAC AATTTTCACA 
ACTTTCATTC CATTGTGTCA AGCTCAAATA AATGCTGATA ATAGCTCACA AAATAATAGT
GCCGTAAATG TCCAAGGTAA TACAATATTT ATTGAGGGCG GCACTCAATT AGGAGGGAAT
TTATTTCATA GCTTTAAAGA TTTTTCAATT AATTATGGAG ACACTGCATA TTTTAATAAC
GCTTCTATTA TACAAAATAT TATTAGCAGA GTAACTGGCA ATTCTATTTC TAATATTAAT
GGAACAATCC GAAACAATGG TCAAGCTAAT TTATTTATTC TCAACCCCAA CGGATTTTTT
TTTGGTAATA GTGCTAGATT AAATATTGGT GGTTCTTTTT TGGCTACTAC AGCAAATAAT
ATTAAGTTTT CTAACAATAT AGATTTTAGT GCTAATCGCA ATCAGCTTGA CCCTTTGTTA
AAAGTTAGCG TTCCTGTAGG GTTAGTTTTT ATAGGTAATT CTGGAGAGAT TAGACTTCAA
GGTAGCGGTC ATAATATAAA TAGAATTGCT AATAGGAATT TTGATTTTAT TACTCCTATA
GACGCTAGTA GATTTTCTGG ATTACAAGTA CAACCAGGAA ACAATATTTC CTTAATAGGC
GGAAATGTTT CACTTGAAGG GGGGATACTA AGCACCAGAA ATGGAAAACT CCAGATTGGC
AGTGTACTAG ATGGATACGT TGGATTAGAG CAGCAGGACA ATGATATAGA TTTTGATTTC
AATAATGTTA ATACATTTAG GAATATAATT CTAACAAAAA ATTCATTATT ATTCATTAAT
AGTAATGCAG GCACAGGTAA TACTATTGAT ATTCAAGGTA AAAATATTAA CATTTTAGAT
GGTTCATTAG TATTTACACA AAATCATGGC TTTAAAACTG GTGCTGTTAA AATTGATGCT
CAATCTCTAA ATATTCAAGG ATCATCTAAT CTAGCTTTAA GTGCAATATA TACAAGCAAT
TTTGGCTCTA CTCCTGGCGA GTCTATTCAA CTTGACGTAA AAGATGTAAC AATTCAAGGT
GGACAAATAG CAACTACTAC TTTTACTAAT GCCCCTAGTG GATTAATCAC TATTAATTCA
AATTCTTTAA AAATTTCCGG TGACACCCCT TCTTATGCTA ATCCTGATGG CTTAGGTGGA
ATCAATACTT TCAGTTATAG TTCTGGAAAG GGTGGAGATA TTGCAGGTAA AATTAATAAT
ATAATCATAG GGCTAGATGG AGTTTTAAAT ACTGTTGCAT CTGGTTCAGG TGCAGGAGGA
AATTTATTTT TAGAATTGGA AAATCTTGTC ATTAAAGATG GTGGAGCTTC ATTAGGGTCT
AGCACTATTC GTAGTGGGCA AGGCGGAAAT GTTTTTATCA AGAGTCAAAA CATAGATATA
TCAGGACAAT CAGCATTATT ACGTCCTAGT AATATTACGT CATCCACTTT TGGTTATGGC
AATGGTGGTA ACATAGACAT AAACACTTTA AATTTAATTA TTAGCAACGG AGGAGGTATA
AGTAGTAGCA CACTATCTGC GGGAAAGGCT GGTAATATTT CCATTAACTC AAGTAATTCA
ATAAATGTAG TAGGTACGAA TCTTAATTCA AACTCACCAA GTTTTATTAA TTCATCAAAT
TTTCTTTTAA TCGATCCGAA TTTGCAAAAA TTATTGTATC GACAACCTCC TTTACTTATT
GGTCAGGCAG GCAATATATT TTTAAATACA AATACTATAA ATATAAGTAA TGGTGGGCTG
ATTAATGCGA GAAATGAAGG TGTCAATGAT GCAGGAAATA TTAGGATTAG TGCGAACACA
ATAAACATTA ACTCTCAAGG AGAAGTTAAC GCCACCACTA CGATTGGAGA AGGTGGCAAT
ATTATTCTCA ATTCTAGAAA TTTATTTTTA AATAATAGCA GGATTACAAC AACGGCTGGA
GGCATGGGCA ACGGCGGTAA CATTAGAATT AACACAGGCA TTTTGGTAGG TTCAAACAAT
AACCAAATCG TAGCTAATGC CTTCGAGGGT AGAGGTGGCA ATATTCAAAT TAATGGTCAA
GGCGTTTTTC TTTCACCTAA TACTCAAGTA AACTCCAAAT CACAAAGAGG TATTGACGGA
ACAGTTGACA TCAATGCAAA TGTCTTTTTA GCCCAAACTC CAGTTAAATC ACAGGGATTT
CAAGAGTCAC CGCAGATTGT TTCCACCTGC CAAGGAAGGT CAAATACAAA AGGAAATAAT
GAATTTATCA TGACTGGTAC AGGAGGATTA CCAGCAAGTT CAGAACAATT ACCCGATGTT
GAATCTACTT GGCAAGCAAA CTCGACTGAG AACATCTCTT ACATCGAACC AACAGTAGAT
AATGAAATCA TAGAAGTGCA AGGATGGGTA AGAAATTCAG ATGGCTCAAT CACATTAACT
GCCCAAGCAA ATCGGGTAAG TGCCAATGCA AATCAATCTG CAAGTTCTTG TAATTATCAA
CCAAAACCCA AGGTCTGA
 
Protein sequence
MNSKFTILFT EISFLITIFT TFIPLCQAQI NADNSSQNNS AVNVQGNTIF IEGGTQLGGN 
LFHSFKDFSI NYGDTAYFNN ASIIQNIISR VTGNSISNIN GTIRNNGQAN LFILNPNGFF
FGNSARLNIG GSFLATTANN IKFSNNIDFS ANRNQLDPLL KVSVPVGLVF IGNSGEIRLQ
GSGHNINRIA NRNFDFITPI DASRFSGLQV QPGNNISLIG GNVSLEGGIL STRNGKLQIG
SVLDGYVGLE QQDNDIDFDF NNVNTFRNII LTKNSLLFIN SNAGTGNTID IQGKNINILD
GSLVFTQNHG FKTGAVKIDA QSLNIQGSSN LALSAIYTSN FGSTPGESIQ LDVKDVTIQG
GQIATTTFTN APSGLITINS NSLKISGDTP SYANPDGLGG INTFSYSSGK GGDIAGKINN
IIIGLDGVLN TVASGSGAGG NLFLELENLV IKDGGASLGS STIRSGQGGN VFIKSQNIDI
SGQSALLRPS NITSSTFGYG NGGNIDINTL NLIISNGGGI SSSTLSAGKA GNISINSSNS
INVVGTNLNS NSPSFINSSN FLLIDPNLQK LLYRQPPLLI GQAGNIFLNT NTINISNGGL
INARNEGVND AGNIRISANT ININSQGEVN ATTTIGEGGN IILNSRNLFL NNSRITTTAG
GMGNGGNIRI NTGILVGSNN NQIVANAFEG RGGNIQINGQ GVFLSPNTQV NSKSQRGIDG
TVDINANVFL AQTPVKSQGF QESPQIVSTC QGRSNTKGNN EFIMTGTGGL PASSEQLPDV
ESTWQANSTE NISYIEPTVD NEIIEVQGWV RNSDGSITLT AQANRVSANA NQSASSCNYQ
PKPKV