Gene Ava_3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3737 
Symbol 
ID3678941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4650372 
End bp4652789 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content42% 
IMG OID637719087 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_324237 
Protein GI75909941 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.219183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAA ACGGCAAAGG TAGTTCATGG CAATTGAACT TAGTAATTTC CTTAACATCT 
GCTGGTGTTA TGAGCATCAG TATGCCTGCT TTGATTGTGC AATCTGCCTT TGCTCAGAGT
GTGATTACGC CCGATCAAAC CTTGGGTAAT GAAAGGTCTG AGGTAAGTGA AAACTACGAC
AATACCCCTA CAGAGTTAGT TAGAGGAGGG GCGATTCGGG GTAATAACCT CTTCCATAGT
TTCCTAGAAT TTAACGTTAA TGAAGGACGT TCAACACTTT TTGTAGCTCC CAACAGTAGC
ATCCAGAACA TTTTTGTCAG AGTTACAGGT AATAATCGCT CTGATATTTT CGGGAAGTTA
GAGACCTCTG GTGTTAATGC CAATTTGTTT GTGATTAATC CCAATGGGAT TTTTTTTGGG
CCGAATGCCA GACTGAATAT AGGTGGTTCT TTTGTGGCGA GTACAGCTAG TGGAATTCAG
TTTGGCGATC GCACCATTTT TAGCGCTACT TCTCCCCAAC CTTTACTGAC AATGAGTGTA
CCTACGGGAT TGCAATTTGG CAGAACAGGG GGAGAGATCA ATTTACAAGG GGAATTAGTA
GTTCCCACAG GCAAAACCCT GGCATTCGTA GGTGGCAATG TAACTCTAGA TGGTGGTAAT
GTTAGCTTTC TCCGGCGTAG GACTCTCAAG GCGGAAAGTG GTCAAATTGC TATAGGTGGA
ATACTAGAAG TAGGTACAGT AGGTCTAAAT TTGGCTGGTA ATAACGACAT TAACAACCAA
ATTTTAAGTT TTTCTGATGA CGCAGTCTTA GGTAACGTAT TCATCCAAAA TCAAGCCAGG
GTAGATGTCA GTGGTCAGGG TGCTGGTTAT ATCGAAATTA AAGGTAAGCA GATTGGACTC
ACTCGTGCAT CGCAAGTGCT AGCAGAAACA GAAGGTAGTC AAAGCAGTCG CGGAATTTTT
ATCCAGGCAG AAAAATTAAC CCTTGACGAT GGCTCACAAG TAACAGCATC TGTGAATAAT
CCTCAATCTA CCGTTTCCGG AGGCAATGTT ACAGTCAAGG CAAGTGATTC GGTACGAGTA
ACGGGAATAG TACCGAATAA TCTAGAACGT TCTGGGAATC CCAGTGGTCT GTTTACTAGA
ACTGCCGGTA AAAGACCTGG GGGAAATTTG ACTATTACAA CAGGTAAATT AATTGTTGAA
GATGGGGGTA ACATATCCGC TCGTACTAGT GGAAATGATA GTCAGAGTAT TGGCGGAACG
ATCAAAATAA CTGCATCAGA ATTGGTCAAA CTGATCGGCA ATACTAAGGA TTCTAGAGAA
TTTCCCAGTA GTGTGTTTGC TCAGACTTTG GGTGCTGGGA ATGCAGGTTC TGTGACGATT
GATACTCCAG CGTTATTTGT CCAAGATGGT GCAGTCATAT CAGCTGGAAC TCAGACAAAT
AGCCAGGGTA ATGGGGGAAA TATTACAATA AAAGCCTCTG ATTTTATAGA AATAAGCGGA
AGTTCGCCAA TTGAGAAATT TCCTAGTGGC TTGTTTGCTC GCAGTCGAGG TAGTGGCAAC
GCAGGTTCTA TATTAATTAC TACAGGTCAA CTTAATGTGC GCGATCTCGC TACAGTCACA
GTAGAAACAT TAGGCACAAG TAATGCTGGC GAAATAAAAA TTAACGCCAC AAGAATCAAC
CTTTATGGTA AAGCAAATTT GAACGCTACA ACTCCATTAG GTAATGGTGG TAATATCAAT
TTACAAATAG ATGACCAACT ACTTTTACGC CGTGGTAGTT TCATCTCTAC TAGAGCAGGC
ACTACAAGCG GTAGGGGTAA TGGCGGTAAT ATAAATATTA ATATTCCCAA TGGCTTTATC
GTTGCCGTTC CTGGTGAAAA TAGTGACATT ACAGCTAATG CTTTTCAAGG TCAAGGTGGT
AACGTTAGCA TCAATGCCTT TAGCGTTTTT GGCATAGAGT TTCGAGAAAA AGATAGTCCA
CTCACCAACG ACATCACAGC TAGTTCTGAA TTTGGGCTAA ACGGTACAGT CGAAATCAAT
ACCCCAGAAG TTCAACCGAA CCAAGGACTA ATTAATTTAC CAACACAACC TGTTGAACCC
CAGCTAGCCC AAGTTTGTCA AGCAGCCGCA GGACGAAATC AAGACAGTTT TACTATCACC
GGGCGTGGTG GTTTGCCAAA CAACCCTAAT GAACTCCTTT ATTCTGATGC TGTTCTTACA
GATTGGGTAG CTGTGAGTAA TGTAGAAAAC ATCCCCAATA CTCCTATCAG CAAAAGCATC
TCTACTCCAA CCCAAACTAA TATCGTGGAA GCTACGGGAT GGGTAATCAG CCCTCAAGGT
GAAGTTGTTC TCACAGTGAA TACACCTAAT ACAAAATCCC CTAATTCTTG GCAAAAAACT
AGTGCTTGTA ATTCGTAA
 
Protein sequence
MTQNGKGSSW QLNLVISLTS AGVMSISMPA LIVQSAFAQS VITPDQTLGN ERSEVSENYD 
NTPTELVRGG AIRGNNLFHS FLEFNVNEGR STLFVAPNSS IQNIFVRVTG NNRSDIFGKL
ETSGVNANLF VINPNGIFFG PNARLNIGGS FVASTASGIQ FGDRTIFSAT SPQPLLTMSV
PTGLQFGRTG GEINLQGELV VPTGKTLAFV GGNVTLDGGN VSFLRRRTLK AESGQIAIGG
ILEVGTVGLN LAGNNDINNQ ILSFSDDAVL GNVFIQNQAR VDVSGQGAGY IEIKGKQIGL
TRASQVLAET EGSQSSRGIF IQAEKLTLDD GSQVTASVNN PQSTVSGGNV TVKASDSVRV
TGIVPNNLER SGNPSGLFTR TAGKRPGGNL TITTGKLIVE DGGNISARTS GNDSQSIGGT
IKITASELVK LIGNTKDSRE FPSSVFAQTL GAGNAGSVTI DTPALFVQDG AVISAGTQTN
SQGNGGNITI KASDFIEISG SSPIEKFPSG LFARSRGSGN AGSILITTGQ LNVRDLATVT
VETLGTSNAG EIKINATRIN LYGKANLNAT TPLGNGGNIN LQIDDQLLLR RGSFISTRAG
TTSGRGNGGN ININIPNGFI VAVPGENSDI TANAFQGQGG NVSINAFSVF GIEFREKDSP
LTNDITASSE FGLNGTVEIN TPEVQPNQGL INLPTQPVEP QLAQVCQAAA GRNQDSFTIT
GRGGLPNNPN ELLYSDAVLT DWVAVSNVEN IPNTPISKSI STPTQTNIVE ATGWVISPQG
EVVLTVNTPN TKSPNSWQKT SACNS