Gene Ava_2352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2352 
Symbol 
ID3683467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2921564 
End bp2924026 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content42% 
IMG OID637717697 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_322865 
Protein GI75908569 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA CCTTGGTTGG GTTTGGCTTT CTGGGTATAT TCTGCCTATC TGCTGTTGAT 
AATAATTGCG TTTACGCACA AGTAATTCCC GATCACACCC TCAAAACTTC TGTAAGTGGC
AGTAATCCTT ACATCATCAC TGATGGTACT CGTATCGGCA ATAATTTATT TCATAGCTTC
AGCCAATTCT CTATTCTTAA AGGTGATTCT GCATCATTTG ACAATGCTAT AGATATTCAA
AATATTTTTA GTCGTGTGAC TGGCGGTAAT ATTTCCAACA TTGATGGTTC CATCAGTGCT
AATGGTAACG CCAACTTATT CTTAATCAAC CCGGCTGGGA TTATATTTGG GCAAAATGCC
AGTTTAAATA TTGGTGGATC ATTTGTCGGG ACAACAGCTG AGACTATCAC ATTTGCTGAT
GGGAAGAGAT TTAGTAGCAC AGACACTGAT ACATCACCCT TGTTAACCAT GAATGTACCC
ACTGGGCTGC AATTTGGTAG TAATTCAGGA GGGATAACAG TTCAGGGTAC AGGACATAAT
GCCCAATTAA GTGACTCACT ACAAGTTTCT GGACTAAATC TCAGTACCAG AGGATTACAA
CTACAACCAG GAAAAACACT AGGGTTATTA GGCAGCAATA TTGCCTTGGA TGGTGGACTG
CTCTCTGCAC CAGGAGGACT AATAGAATTG GGTGGTATCA CCAGTGGCAA CGTCATTCTC
AATTCTACTC CGCAAGGATT TGCCTTCCAG TATTCCCAAG TTCCGAGTTT TGGCAATCTT
CAAATCACTC AACGAGCTTT AGCATCAACA CGGGATTTCA ATGGGGAAAG TGGAGGAGCC
ATCCATATCC AGGGAAAACA AGTCAGTATC CGCGATGGTT CGTTGATATT AGTACAAAAT
CGCAGCCAAA AGACAGCTGG TGATATTGCT GTTGATGCTT CAGAGTCTTT AGAACTTATT
GGCAAGTCTC CTGATTTTCA AAGTTCCAGC AGTTTAGTGA ATGAAAGCAC TTCCTCTGGT
AAGGCTGGGA ATATTATCAT TAACACTCCA CGATTGAACA TTGATCAGGG TGGGTATATT
TATAACCGGA CGTTCAGCAC AGCACCAGGT GGCAATATTA TCGTCAATAC CGATGAAACG
CGGGTTAATG GTTTTGCTTA TGGTGATCCT TCCGCTTTTC GAGCAGTTAG TCAAATATTA
GCTGCTTCCT ATGCAGGCGG GAAAGGCGGT AATATTTCTG TCTCTACTCA AAAGCTATCT
GTTTTAGCTG GGGGAAATAT AGCAGCCAGA CCCTATGCTG GAGGTAATGG GGGTGATGTA
ACAGTAAAAG CAGATACGAT AGAAGTAGAT AATACAGGCT CTCCGACTGG TTTTTATTTT
AGCTTAATTT CGACTGCCAC CTTTGGGTCA GGTAACGCGG GAAATTTGGC ACTCGATACT
CGTAAATTAT CCGTGCAAGC TGGCGGCAGA GTGTCTGCTT CTAGCATAAT TTTAGGTAAT
GCTGGTGTAC TCACTATTAA TGCTTCGGAA TCGATTGATG TAAGTGGGGT TAAGGATGCA
GACAATCCCA GCTATATCGG CACTGCTGTG CGTCCTTTTG GTGCATTTGC CCGAACTTCT
CGTGCTAATT CAGGTAACAC CACCATAAAC ACTCCAGTTT TGAATGTTAC TGATAGGGCG
ACAGTTTTTG TGGAAAATTC TGGTTTGGGT ACGGCTGGTA CGCTGCAAAT TAATGCCAAC
ACCCTTAAAC TTGACAATCA TGCAAGTATT CTAGCATCTA CAAAAGCTGG AGAGGGAGGC
AACATCAATC TTCAACTACG GGATGTATTA TTGATGCGTC ATGGTAGCTT TATCAATGCC
GAAGCAGGTA GTAATGGCAA TGGTGGCAAT ATAAGTATTA ACTCTCCTAA TGTTGTTGGT
CTAGAAAATA GTGACATCAT TGCCAATGCC GTACAAGGAA AGGGTGGCAA TATTCAAATC
ACAACTCAGG GAATTATTGG TTTGGAGTAC CGTAATCTTC TCAATCCTAG AGAAATTTTG
AGTAATGACA TTACCGCTAG TTCCCAATTC AGTATTAGTG GTACAGTCCA AATTAATAAT
GTTGGTGTTG ATCCCAATTC TGGTTTAATA GATTTACCGA CCAATCTCTC AGACCCATCA
CAACAAGTTG CCACGGGTTG TTCTAATACT AATAGTAGTA GTTTTGTGGC TACGGGACGG
GGTGGAATAC CACAAAATCC TACACAACAA ATGAGGAGCG ATCGCACTTG GTCTGATATC
CGTGATATCT CTGCATTCCA CACCAAAGAA CTAGCACAAG CCCAAATACC ACAATCCCCA
GAAAAACTTG TCCAAGCTAC TTCCTGGCAT CGTAACGCCC AAGGCAAAAT CGAGCTGGTT
GCCGCTAAAT CCTCTTCGCA GATGTCACCA ACCTTAACCT GTGCTGCTGC ACCTCAAAAT
TAA
 
Protein sequence
MKLTLVGFGF LGIFCLSAVD NNCVYAQVIP DHTLKTSVSG SNPYIITDGT RIGNNLFHSF 
SQFSILKGDS ASFDNAIDIQ NIFSRVTGGN ISNIDGSISA NGNANLFLIN PAGIIFGQNA
SLNIGGSFVG TTAETITFAD GKRFSSTDTD TSPLLTMNVP TGLQFGSNSG GITVQGTGHN
AQLSDSLQVS GLNLSTRGLQ LQPGKTLGLL GSNIALDGGL LSAPGGLIEL GGITSGNVIL
NSTPQGFAFQ YSQVPSFGNL QITQRALAST RDFNGESGGA IHIQGKQVSI RDGSLILVQN
RSQKTAGDIA VDASESLELI GKSPDFQSSS SLVNESTSSG KAGNIIINTP RLNIDQGGYI
YNRTFSTAPG GNIIVNTDET RVNGFAYGDP SAFRAVSQIL AASYAGGKGG NISVSTQKLS
VLAGGNIAAR PYAGGNGGDV TVKADTIEVD NTGSPTGFYF SLISTATFGS GNAGNLALDT
RKLSVQAGGR VSASSIILGN AGVLTINASE SIDVSGVKDA DNPSYIGTAV RPFGAFARTS
RANSGNTTIN TPVLNVTDRA TVFVENSGLG TAGTLQINAN TLKLDNHASI LASTKAGEGG
NINLQLRDVL LMRHGSFINA EAGSNGNGGN ISINSPNVVG LENSDIIANA VQGKGGNIQI
TTQGIIGLEY RNLLNPREIL SNDITASSQF SISGTVQINN VGVDPNSGLI DLPTNLSDPS
QQVATGCSNT NSSSFVATGR GGIPQNPTQQ MRSDRTWSDI RDISAFHTKE LAQAQIPQSP
EKLVQATSWH RNAQGKIELV AAKSSSQMSP TLTCAAAPQN