Gene Ava_2810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2810 
Symbol 
ID3681646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3473739 
End bp3477197 
Gene Length3459 bp 
Protein Length1152 aa 
Translation table11 
GC content39% 
IMG OID637718156 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_323318 
Protein GI75909022 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00106289 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.309212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAAAC GAAATTATTT AAATCGGTGT TTGAGCTTAA GCTTATTAGT CAGCGTGACT 
TTAACAACAA ATGGGAAAAT TTTGGCACAA GTCACAGCCG ACCAAACCTT AGAAACACAA
GTTATAGATA TTGGTTTAAA CTCCTTTATT TTAGGTGGTA CAACAGTTGG TAACACTAAT
TTATTCCACA GTTTTGCTAG TTTTAATGTT CCTAGTAATG GTGCAGCAAT TTTTATCAAT
GACCCTAGTT TAACTAATAT ATTTGCACGA GTAACAGGTG GCACAGCTTC CGATGTTCAA
GGCAGAATAG GTACTCAAGG GACTGCCAAT TTATATTTAA TCAATCCCAA CGGCATTATT
TTTGGCACAA ATGCTAGCTT AAATATAGGT GGTTCTTTCG TCGCTACTAC AGCTAATGTT
ATTCAATTTC CTGGTGGTGC TGAATTTTCT TTAAATTCAT CAGTTTCACC AGATAATCAT
CTACTTAGGG TCAACCCCAC AGCATTCTTA TTTAACCAAA TCGCTAATCA AGGTACAAAC
TCAATTGAAA ATCGTGCCTA TTTAGGAGTT CCTAATAATA GGAGTTTAAT TCTACTTGGT
GGTAATATTG CACCCACATC TCATACCACC GGCAGAATTA TAATCGATGG CGGAGTAGTT
CAGGCATTAG GTGGTAGGGT AGAAATTGGT GGATTAGTTG AACCTGGCTC CATAGGAATT
AATATTGATG GCAATAAATT CAGCCTGAAT TTTCCTGATA GTGTAGCAAA AACAGATATT
TCATTAATCA ATAATAGCGC CCTTTTTATT TCTGGTGCTG GTGGTGGTGA TATTATAGTT
AATGCTGATA ATTTGAGCCT AAATTCTAGT TATTTTTTTG CAGGTATACT TAATAATCAA
GGCAACCCTG AGACTAAAGC AGGCGATATC TCAATTAATG CTACAGGCAT TGTCACTGTT
GCTCAAAATA GTATCATTGG AAATTCCTCT GTGGGAATTG GAGATAGTGG AAACATCAAT
ATTTTTGGTC AATCACTCCA AATAATTGGT AATAGTGCAG TTCAGTCGTC TGCCTTACAA
GGTAATTCCG GAACAATTAA TATTAAAGTC GATGATACGA TTGCTTTATT AGGCAGTCAA
ATTGGCAGTA TTGTTCGTCC GCGAGATACC TCTGTGTCTT CGTTGCAGTC ACGGTTATTA
GGAACACCAA CTAGGGGAAA AAGTGGTGGT ATCAATATCC AGGCACGCAA TCTTTTAGCA
GTAGATAGTA GTTCAATAAG TGCAAGCAAC TTTTTGGCGG ATGATTCTGG AGATATTAAA
ATTCAAGCTG CTGATACGGT GATTTTAAAT AATAGGGGTT CTATATCTTC AACCGCCTTT
GGTCAAGGAA AATCTGGTAA TTTATCTATT AGTACTAATC GGTTGAATGT AATTAATTAT
TCTCAAATAG CAGCTAATAC ATTGGGGACT GGAAATGCTG GGGATATTAA TATTAATGCT
GAGGATATTA ATATAGAGGA TAAGTCTTTT ATTACTAGCA GTACATCCTC TTTTTTTAAC
ACTATCAGCA ATGTAGGCAA TGCAGGAAAT ATCAATATTC AAACTTCACG AATTAGTGTA
AAAAATAGTG GCTTTATTAT ATCTATTTCT GGTGACAGAG AACAGGCATT AACCAATGGA
TTGGGTGGTA ACATCAATAT TACAGCTACT GAACTAATAG AAATAGATGC AAAAGGTGCA
ACTGATATAA TCACAGGTTT TAGTGCCAAA ACTTTTAGTG ATAGCCGTGC TGGTGACATA
ACACTAAATA CCAAAAATTT GATTGTGAGA AATGGCGGTG CTATTACGGC AGAAGCTGCT
AATAGTTTAG GAGGAAACGC GGGAAATATT AATATCAATG CCTCCGATAC TGTTAAGCTC
ATCAGCTCCC CAGGAAATTC CTATAGTAGA ATTTTCACGG GAGTTGTGTC TTCTGATAAT
AATGTGACGA ATGCGGGTAA TGGTGGTGAG TTAAAGATTA CCACTGGAAA ATTACAGCTA
ACAAATGGCA TTATATCAGC AGCAACATTC GGTCAAGGTC ATGCAGGCAA TATTACTATA
TTTAGTAATG ATGAGATAGT TCTTGATAGT GGGTTAATAT TTAGTTTAGT TGGTGCTGGT
GCAGTCGGGA ATGGGGGAGA TATTAATATT CAAACACCAA GACTAACTTT AACTAATGGT
AGTCAAGTTG ATGCTAGTAT TCGGGGTGGA GGAATAGGCA AAGGTGGCAC TATTCGCATT
GATGCGGCAG ATTATGTAAT TATTTCTGGA CGGGATGCAG ACGGGTTTGA GAGTTTCTTA
GCAGCTGAGA CTAGCAGAGG GGGTATTGGT CAACCAGGAG ACATCATCGT TAATACAGAT
TATTTCCTCT TAGAAAAAAA TGGTTTTGTG ACTACCGGAA CTTTTAACTC CAGTAACGGA
GGTAGTATTA CCATTAATAC TCGTATCTTT GAGGCCTTGA CTGGTGGACA AATTTTTTCC
AGCACCGGTA GTAGTGGCAA AGCAGGCGAC ATTATTATTA ATGCTACAGA TAGTATAATC
GTCTCTGGGG TAAATGCAGA AACTGGAAAT AATGCTGCGA TCGTTGCAGG AACACTTGCA
GACTCAACTG GTAATGGAGG AACAGTATCC TTATCTACTA ATAATTTTCA TTTAAGTAAC
GAGGCTGGAG TTCTCACACG CAGTCAGGGA CGAGGTATTG CAGGAGATAT CAACATTACT
GCTAGGGGCA ATTTTTATGT AAATCATGCT TTTGTCAGCG CACAAGCTGA ACAGGCTGGT
GGTGGAAATA TCGATATCAC TGCGAAAAAT ATTAATCTGC GTAATAACAG TGACATCCGC
ACTGATTTAT CTAATGGTAG CGGTAGGGGT GGGGGAATTT CCCTCACTGC AAATACCATC
ATTGCCTTAG AAGATAGTGA TATTCTCGCC TTCGCACCAG AGGGACAAGG CGGAGATATT
AAATTTAACA CCCGCGCTGT GTTTAGTGAT TCTCTCTACA CCGACAGACA AACTATACCT
GATAGAAATA GTCTCCAGTC ACTAGTGAGT AATGGTAGCT CTGACATTAA CGCCACTGGG
ACAATCTCTG GCAATATTAT TGGTGTGCCT GATATTAGCT CCATCCAAAA CGGACTCACA
GATTTACAAG CTAATCCCAT TGATACTACC GTACTGATTG CCAATAGTTG CATTGCGCGT
AGTCTCAGGC AAGAAGGTAG TTTTGTGATT ACTGGGACTG GTGGTTTACC AACTCGTCCT
GGTGAAGTTA TGGCTTCTAG TTATGCCACA GGTGATGTGC AGAGTGTCAG TGATGAAAGT
AAGGCTAGTT TGTGGAAAAA AGGCGACCCG ATTATTGAAC CGCAAGGAGT ATATCGATTA
GCAAATGGGG TTGTGGTGAT GAGTCGTGAG TGTCATTGA
 
Protein sequence
MIKRNYLNRC LSLSLLVSVT LTTNGKILAQ VTADQTLETQ VIDIGLNSFI LGGTTVGNTN 
LFHSFASFNV PSNGAAIFIN DPSLTNIFAR VTGGTASDVQ GRIGTQGTAN LYLINPNGII
FGTNASLNIG GSFVATTANV IQFPGGAEFS LNSSVSPDNH LLRVNPTAFL FNQIANQGTN
SIENRAYLGV PNNRSLILLG GNIAPTSHTT GRIIIDGGVV QALGGRVEIG GLVEPGSIGI
NIDGNKFSLN FPDSVAKTDI SLINNSALFI SGAGGGDIIV NADNLSLNSS YFFAGILNNQ
GNPETKAGDI SINATGIVTV AQNSIIGNSS VGIGDSGNIN IFGQSLQIIG NSAVQSSALQ
GNSGTINIKV DDTIALLGSQ IGSIVRPRDT SVSSLQSRLL GTPTRGKSGG INIQARNLLA
VDSSSISASN FLADDSGDIK IQAADTVILN NRGSISSTAF GQGKSGNLSI STNRLNVINY
SQIAANTLGT GNAGDININA EDINIEDKSF ITSSTSSFFN TISNVGNAGN INIQTSRISV
KNSGFIISIS GDREQALTNG LGGNINITAT ELIEIDAKGA TDIITGFSAK TFSDSRAGDI
TLNTKNLIVR NGGAITAEAA NSLGGNAGNI NINASDTVKL ISSPGNSYSR IFTGVVSSDN
NVTNAGNGGE LKITTGKLQL TNGIISAATF GQGHAGNITI FSNDEIVLDS GLIFSLVGAG
AVGNGGDINI QTPRLTLTNG SQVDASIRGG GIGKGGTIRI DAADYVIISG RDADGFESFL
AAETSRGGIG QPGDIIVNTD YFLLEKNGFV TTGTFNSSNG GSITINTRIF EALTGGQIFS
STGSSGKAGD IIINATDSII VSGVNAETGN NAAIVAGTLA DSTGNGGTVS LSTNNFHLSN
EAGVLTRSQG RGIAGDINIT ARGNFYVNHA FVSAQAEQAG GGNIDITAKN INLRNNSDIR
TDLSNGSGRG GGISLTANTI IALEDSDILA FAPEGQGGDI KFNTRAVFSD SLYTDRQTIP
DRNSLQSLVS NGSSDINATG TISGNIIGVP DISSIQNGLT DLQANPIDTT VLIANSCIAR
SLRQEGSFVI TGTGGLPTRP GEVMASSYAT GDVQSVSDES KASLWKKGDP IIEPQGVYRL
ANGVVVMSRE CH