Gene Ava_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1390 
Symbol 
ID3682684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1714143 
End bp1717244 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content41% 
IMG OID637716727 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_321908 
Protein GI75907612 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.379495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC ATTGCATCAA AATAGGTATC TCTATCATTT TAGGCGTAGG TGCAATAACT 
GGCATTGAGC ATCCTAGCAA CGCCGAAATT ATTCAAGATA GTACACTACC TGAAAATACT
AATGTGACTG TTAATTTCGG TAAAGAGGGT GATTCTACCT TCATTATTAA TGGCGGAACA
GCAATAGGCA ACAATCTATT TCATAGTTTT AGCAAATTTT CCTTACTCAA AGGTCAAAAT
GCTTACTTTA ATAGTACTGA CAATATTCAA AATATCTTTA GTCGGGTTAC TGGTGGAGAA
GTATCAAATA TTGATGGCTT AATCACAGCT AGAAGCAGGG TTAATTTATT TCTCATTAAT
CCCAATGGAA TTATTTTTGG GCCTAATGCT CAGATAAATA TTCAGGGTTC TTTCGTTGCT
AGTACTGCTC ATAGTGTCAG ATTTGCTGAG GGTGGAGAGT TTATAGCTAA AACTGCAAGT
GTTACGCCCT TACTCAATGT CACAACTCCT ATTGGTTTAC AATATGGGGC TAATTCTGGC
AGCATTCAGT TACAAGGAGA ATCAATACAT CAGCCTAGTA GGCTGCAAGT TGAAGGCAAT
CAAACTTTAG CCTTAATTGG TGGAGATGTT ATTTTAGGAA ACGCTAATCT TAGCACTATA
AAGTCTGAAG GCAGGATAGA ACTAGGCAGT GTTGCATCAG CAAGTTTAGT GGGAATTAAA
TCAGTCAGTT CAGGCTTCTT GTTTAAATTT GACGGGGTAT CTGAATTTGG TGATATTCAA
CTATCTTCAG GCACTAATAT TAATAGTTTA GGTGACCTCA ATCTCACGGG AAAAAATATT
GTACTTCAAG ATGCGTCTTT GAATGTGTTC AATAATCTCA CAGTCAATGC CCAAGAGAGC
ATTCAGTTAA TAGATAGAAG TGCAATTGAT ACTAGAAGTT CTAATAATTC GCCAGGAAAT
CACACTATTA ATACTCGCAG ATTATTACTG AGTAATCAAT CTTTTATCGG TATAAACGGC
GGTAATCTGG AGGTGAATGC CTCAGATGTG GTGGAACTGA CAAGTGATCT CAAAGGTACT
CCCAGCCGTA TCTATGCCAC AACTTCAGAT GGTGACGCAG CCAATGGAAA TTTAACAATT
AATACTGGGG ATTTATTAGT CGAGAATGGC TCACAAATTA TCACTAATTC TTCTACTGCA
TTTTCCGGCA TTATTCCCCT TAATGTGCAG ACAAAAGCTA GTTTAACTAT TAATGCCGCA
AATTCAGTGA CTTTGCGGGG TAGTTCATTG GACGAAATAT ATTCTAGCGG TGTGTTTAGT
CAAACTGATG GTGATGGCAA GGCTGGAGAC CTGACGATTA ATACTCGTGT GTTACGAATT
GAAGACGGGG CGCAGGTCAT TGCCAGAAAT TTGAGTGTCG GTAAAGGCGG AAACTTGACT
GTAAATGCTT CTGATAAGGT GCTAATAATT GGTACTTCTC CCAAAGGTTC AATTCCTGGG
AACTGGCCTA ACGATTCAGA GCTAGATGAG CAAGTACCTA GAGCTGTCGG CCCCTTAAAA
GAGTTGTTGA TAACAGGGGT GTTACGCAGA GATAGTTTAC CTAGTGGTAT ATTTACTGAC
TCAGTTTCTT CCGGTGATGC TGGCATGATC ACGATTAACA CTGGTGAGTT AACAGCACAA
AATGGAGGGC GAATTAGTGC CGATGCTTTT TCAGCAGGTA AAGGGGGAGA TTTAATGATC
AGTGCTACTG ATAAGGTGGA ATTGATTGGT ACTGCTGTCA ATGGGGTTGC TAGTGGTTTG
TTCACGAGAA CAGGTTCTTC AGCTACGGGA AATGCCGGAT CTTTGACAAT TGTCACTGGT
AATTTATTAG TAAAAGATGG GGCGCAAGTC AGTGTTAGTA CTTTTGGTAC GGCTAAGGGT
GGCAACTTGT TGGTGCAGGC GGCTGAGGGG ATAAAACTCA TCGGTGTTTC TCAGAGGAAT
ATTGCCAGTG GTTTGTTTGC CCAAGCTAAT CGTTACGCAA CAGGAGATGC GGGTAGTTTA
AAGATTGATA CCTCGACCTT ATTGGTACGT GATGGCGCAC AAGTTAGTGC TAGCACTTTC
GGGGCAGGTA AAGGTGGAGA TTTGTTTGTC CAAGCTTCTG ATATCAAACT AATCGGTACT
GCTGCTGATG CTTCGTTCTC TAGTGGCTTG TTTACTGTAG CAACGGCAAA CTCTACTGGT
AGCGCTGGTA AACTCACGGT GAACGCGGAT GTATTGGATA TTGAGCAGGG AGCGGGAGTA
GGAGTGCAGA GTAGTGGTAA GGGGAGTGCA GGCAACTTAA ATATTAATGC TCATAGAATC
AGATTGGATG ATCAAGCTTT TATCAGTGCC GATACTCGTG ATAATGGGAG TGACCCTAAC
CGATCGCAAG CAAATATCAA TCTGCGATCG CGCAATCTCA TCCTATCTCG TGGCAGTAGC
ATCACCACTA ACGCCACAGG CAGTAATGTC ATTGGCGGCA ATATAGACAT TGATACTAAC
ACTCTAGTTG CTATTCAAAA TAGTGATATT AGTGCTAATT CTGCTGACTT TCGTGGTGGT
CGAATCAATA TCAATGCCCA AAATATTTTT GGAACCAAAT TTCGCAATCA ACGTACCCCC
AATAGCGATA TTACTGCTAC TGGTGCTAGT CCTGAGTTAA GCGGTGCAGT GGAAATTACT
ACCCCTGATG TAGACCCTAG TCAAAGTTTA AGCCAACTCC CATCAGAGGC TGTTGATGTG
TCTAATCAAA TCTCTCAAGA ATGCCGGATT GATGAAGCGA CGGCACAGAG ACAAAATCAG
TTTATTATTA CTGGACGCGG TGGTGTACCA ACAAACCCCT ATGAAACCTT AGATAACACG
GCGATAATTA CGGATTGGGT AACTGTTAAT GATGTCAATA CTGTTGCTCA CAAAGAAAAT
AATTTCGCCC AAGAGGAAAA CACTGTTGCC AATAGTATTG TTGAGGCTCA AGGCTGGGTT
TATGATACCC AAGGAAATTT GGTTCTGACT GCCGAAGCAA CTAAAATAAC AGGTCATGGT
TCAGGATTGA CAACCGATTT CTGCCAGGTA AATAAGGGAT AA
 
Protein sequence
MSRHCIKIGI SIILGVGAIT GIEHPSNAEI IQDSTLPENT NVTVNFGKEG DSTFIINGGT 
AIGNNLFHSF SKFSLLKGQN AYFNSTDNIQ NIFSRVTGGE VSNIDGLITA RSRVNLFLIN
PNGIIFGPNA QINIQGSFVA STAHSVRFAE GGEFIAKTAS VTPLLNVTTP IGLQYGANSG
SIQLQGESIH QPSRLQVEGN QTLALIGGDV ILGNANLSTI KSEGRIELGS VASASLVGIK
SVSSGFLFKF DGVSEFGDIQ LSSGTNINSL GDLNLTGKNI VLQDASLNVF NNLTVNAQES
IQLIDRSAID TRSSNNSPGN HTINTRRLLL SNQSFIGING GNLEVNASDV VELTSDLKGT
PSRIYATTSD GDAANGNLTI NTGDLLVENG SQIITNSSTA FSGIIPLNVQ TKASLTINAA
NSVTLRGSSL DEIYSSGVFS QTDGDGKAGD LTINTRVLRI EDGAQVIARN LSVGKGGNLT
VNASDKVLII GTSPKGSIPG NWPNDSELDE QVPRAVGPLK ELLITGVLRR DSLPSGIFTD
SVSSGDAGMI TINTGELTAQ NGGRISADAF SAGKGGDLMI SATDKVELIG TAVNGVASGL
FTRTGSSATG NAGSLTIVTG NLLVKDGAQV SVSTFGTAKG GNLLVQAAEG IKLIGVSQRN
IASGLFAQAN RYATGDAGSL KIDTSTLLVR DGAQVSASTF GAGKGGDLFV QASDIKLIGT
AADASFSSGL FTVATANSTG SAGKLTVNAD VLDIEQGAGV GVQSSGKGSA GNLNINAHRI
RLDDQAFISA DTRDNGSDPN RSQANINLRS RNLILSRGSS ITTNATGSNV IGGNIDIDTN
TLVAIQNSDI SANSADFRGG RININAQNIF GTKFRNQRTP NSDITATGAS PELSGAVEIT
TPDVDPSQSL SQLPSEAVDV SNQISQECRI DEATAQRQNQ FIITGRGGVP TNPYETLDNT
AIITDWVTVN DVNTVAHKEN NFAQEENTVA NSIVEAQGWV YDTQGNLVLT AEATKITGHG
SGLTTDFCQV NKG