Gene Ava_2530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2530 
Symbol 
ID3682415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3129325 
End bp3131655 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content39% 
IMG OID637717875 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_323040 
Protein GI75908744 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.957928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT TTTGGGTCTG GATTCCCAAG CTAGGAGTTG CGATCGCCGG CGTATTAACC 
TGCATGACAG ATTATGCTGT CGCGCAAGTT GTTCATGATA ATACACTAAT TAATAATTCC
CAAATTAATA CACACAATAA TATTATTTCT ATTGAAGGTG GAACTAGAGC CGGAGATAAT
CTTTTCCACA GTTTTGAGCA GTTTTCTGTA GCTCAAGGAA TTACAGCCGA GTTTAAAAAT
GCTGTGGATG TAAAAAATAT TATTAGCCGA GTGACGGGTA ATTCTATTTC TCAGATTGAT
GGCATTCTGA AAAGCCAATA TGAAGCCAAT CTTTTTCTGA TTAATCCTAA CGGCATTATT
TTTGGTTCTA ATGCTTCTTT GAATATTGGC GGTTCATTCG TAGCGACTAC GGCAGATACT
TTGATTTTCG CTGATAGAAG AAATTATAGT ACAAGAGAGT TATCCCTTGT ACCATCTCCT
TTATCAATTA GTGTTCCGAT TGGTTTACAA TTTGGTGCAA ATCCAGGTTC TATTCATAAT
CAATCTCAAA CAAGCCCTAA TGGAGCTATG AATAGTCTTT TTGAGCCAGT TGGTCTGCAA
GTGAAACCAG GTAAAACCCT AGCTTTAGTT GGTGCTGGCT TGATCATGGA AGGTGGGAAT
CTGACTGCTC CTTTGGGTCG CATTGAACTA GGAAGTGTAG GAAGTAATAG TTACGTCAGT
CTAAGTCTAG TAAATCAAGG TTGGATATTG GGATATCAAG GTATTCAAGA TTTTCAAGAC
ATACGATTAA TTCAAAGAAC TGTTGATGGT TCTGCATTTC CATCTCAAAT AGATGTGGGC
GGTGATGGAG GCGGTAATAT TCATATCCAA GGTAAAGCTG TAGAGATGAA TGGCCGTGAT
GTCCGCTTAT CGACTCAAAC ATTGGGTTTC AATAAAGCTG GAGATATAAC GATTGACGCA
AACAAATTAG TAATTCGTAA TGGGGCGCAA ATATTAACAT TCACTTTCGC TCAGGGAGCA
GGTGGAAATT TGGTTATCAA CGCGGCTGAA TCTGTAGAAG TGATTGGTGG CTTTCTTGTA
CCGAATACAA ATCGTGTTGA ACCCAGTGCA TTGACAAGTT CAACTGCGTC TGATGGTAAA
GCGGGTGATA TCACAATTAA TACAAAAAGA TTAATTGTCC AAGTTGGGGG AAACATATCT
ACCAGTTCCC TTGGGTTACC AGATTTTAAT AACGAAAGAA TAATATTAGC TACAGGAGCA
GGAGGAGATT TAATAGTCAA TGCTGCTGAT TCTGTAGAAA TAAGTGGCAA AGCCAGAAAT
GTTCCTAGTT CTTTGTCCGC TTCTACAAAT AGTTATGGTA ATGCTGGCAA GTTAATGATA
ATTACAGGAA AATTGACTGT AAAAGATTCA TCTGAAGTAG CTGTTGGCAG CGTCGTTTTA
AAATTTGCTG GACTGAGAGT TGATAATAGC AATTTAGGCT CTGCTGGGGA ACTAAACATA
ACTGCTCGTT CTATAGTTCT GGATAACCAG GGAAAACTTT TATCAAATAG TGAATCAGGT
CAAGGTGGTA ATATTGAGAT CCAGGTGCAA AACACTCTAC TGATGCGCCG TAACAGCCAA
ATCTCCACAA ATGCAGGTAT CACATCTGGG CGAGGCGATG GGGGTAATAT TATCATCAAT
GCACCTAATG GTTTTATTAT TGGCACTCCT AATGAAAATA GTGATATCAC CGCCAACGCA
TTCTCTGGTA GTGGTGGGAA AATCACTATA AATACTAATA GTGTTTTGGG GTTTGTACCT
CGTAATCGTG CCGAATTAGT AAAGTTGCTA GCAACTGAAA ATCCTAAAAA ACTAGACCCA
AATAAGCTTC CTAGTAATGA TATTACAGCT TTTTCTCAGG AAAACCCTTC ATTCAATGGC
ATCGTACAAA TTAATACACC TGATGCTGAT CCTAGTAAAG GGTTAATAGA ATTGCCGAGA
AATATCATTG ATACTTCTGA GAAAATTGTT GCTAGTTGTC ATCCTAGTCG AATTTCGCAA
AGAAGTTCAT TTGTTTCTAC AGGAAGAGGA GGAATTGCAT CTAGCCCTAC GGATACGCTG
ATTGATGATG CTGTGTTGGT AGGTTGGGCT GCATTACCGA CACAGATAGA AAGTAGTGCT
GATAGTATTC CTGAACAACA CGCACAGCAG GATAATTTGA ACGTCTCTCC CCAAATTTTA
GAAGCACAAG GCTGGCAAAT AGATCGCGAC GGTAATATAG TACTGGTGGC GCAAGCACCT
ACTGTAAATC CTCATACTCC AACACTACAT TCGTCAGCTT GTACAGAATA G
 
Protein sequence
MNNFWVWIPK LGVAIAGVLT CMTDYAVAQV VHDNTLINNS QINTHNNIIS IEGGTRAGDN 
LFHSFEQFSV AQGITAEFKN AVDVKNIISR VTGNSISQID GILKSQYEAN LFLINPNGII
FGSNASLNIG GSFVATTADT LIFADRRNYS TRELSLVPSP LSISVPIGLQ FGANPGSIHN
QSQTSPNGAM NSLFEPVGLQ VKPGKTLALV GAGLIMEGGN LTAPLGRIEL GSVGSNSYVS
LSLVNQGWIL GYQGIQDFQD IRLIQRTVDG SAFPSQIDVG GDGGGNIHIQ GKAVEMNGRD
VRLSTQTLGF NKAGDITIDA NKLVIRNGAQ ILTFTFAQGA GGNLVINAAE SVEVIGGFLV
PNTNRVEPSA LTSSTASDGK AGDITINTKR LIVQVGGNIS TSSLGLPDFN NERIILATGA
GGDLIVNAAD SVEISGKARN VPSSLSASTN SYGNAGKLMI ITGKLTVKDS SEVAVGSVVL
KFAGLRVDNS NLGSAGELNI TARSIVLDNQ GKLLSNSESG QGGNIEIQVQ NTLLMRRNSQ
ISTNAGITSG RGDGGNIIIN APNGFIIGTP NENSDITANA FSGSGGKITI NTNSVLGFVP
RNRAELVKLL ATENPKKLDP NKLPSNDITA FSQENPSFNG IVQINTPDAD PSKGLIELPR
NIIDTSEKIV ASCHPSRISQ RSSFVSTGRG GIASSPTDTL IDDAVLVGWA ALPTQIESSA
DSIPEQHAQQ DNLNVSPQIL EAQGWQIDRD GNIVLVAQAP TVNPHTPTLH SSACTE