Gene Ava_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4131 
Symbol 
ID3681207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5145463 
End bp5148009 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content41% 
IMG OID637719477 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_324625 
Protein GI75910329 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.343736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0350194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAACT TACAAACAAC AAAAATTTTA TTTTTTTTTA CATTGTGTCT AGTGCTAGAA 
ACACCTTTAA AAGGATTAGC ACAAACTCAA CTGAACCCTG ATCATACTCT ACCCACTAAT
GTCAATAGTA TTGGTGGTGT GTACGATATT ACTGGCGGCA ATAGACCGAA TAATGGTGCT
AATCTTTTTC ACAGTTTCCA AGATTTTTCA ATTCAATCAG GAGATACTGC TAGATTTATT
TACGATACAG GAATTAGCAA TATCATCACA CGGATTACGG GAGGGTCACC TTCCCAAATT
AACGGGACTA TTCAAACACT TCTCAATGGT ACTAATAATA TAGGTAATGC CAATTTATTT
CTGATTAATC CACATGGAAT TATCTTTGGT GCAAATGCCA AATTAGATAT AGGCGGCTCA
TTTATTGGGA CTACAGCAGA TAGTATCAAA TTCAATGATG GGAAAGAATT TAGTGCTATC
AACCCTACAG TTAACCCGAT TTTGACTGTT AATGTACCTA TCGGTTTACA ATTTGGTTCT
CACCCCACCA GCACCATTCA AGTACAAGGT TCAGGCAACA ATTTCCAACT CAATCCTGAT
TTATCTGTTG ATAACAGTAA CCGTCCATCA GGATTGAGCT ATCAAACCCC AAATGCTCAG
ACTTTAGCAC TAGTTGCAGG CAAGGTGGAA TTAGCTGGAG GAAATATTAC TGTACCCCAG
GGAAGAATTG AATTATGGTC TGTGAATAGA GGTGAGGTGA CAATCACCAA TCCTAGTGGA
CATCTGCAAC TACAACCCAC ACCAGGAATT AGTTACGGTA ATGTTGACCT TGTAAATGCT
GCTTCTGTAG ATGCTAGTGG TAATAGTGGT GGTTCTATCG AGGTGCGAGG GCAAAATGTG
ACTCTCGACA ACGGTTCAGT CATAGTTACA GATACCACAG GTAGCGGTTC TGGAGGGATA
TTGAATATAT CTGCATCAGA GGTATTGACT GTTAAAGGCT TTGTTTTGAA CCCTAATAAC
CAGATATCTA GCGGTATATC AGCTGATGTT GCTTCAGGTG CAAGTGGAGA AGGAGGTAAA
GTCACAGTTA CTACAAAAAC TTTGCAAGTG AGCAATGGGG GTCAAATTTC CAGTGGTACT
TTTGGCACTG GAAATGCTGG AGAATTGAAC GTTACAGCTC AGGATGTGCA GATACGTGGT
ATTTCTCTCT TTGGGCCTAG TGGTTTATTT GCTCCTGTTG CTCCTGGCGC AAGAGGAAAC
GGGGGAAACT TAACAGTTGA AACTAATAAA TTACAAGTTA CTGATGGTGG ACAGATATTT
ACTAATACCT TGGGCTTTGG TAAAGCTGGT GACTTGAAAA TTCTCGCTCA AGATGTAGAG
GTCAGTGGTG GGACAGAATT TGGGCCTAGT ACCATTGCAG CCACAGTCCA AAAGATATTG
AGTATTCCAG AGCCAGCTGC AACTTTTTTA GGCGCTGGTT TTGGTAATGC TGGTAATTTA
ATCATTGAAA CCAGCAATTT ACGAGTTACT GACGGGGGTC AGATTGCTGT TAGCACCTCT
GGGAATGGCT CGGCTGGTAA CATGACAATT AATGCTAACT CAGTAGAATT AGCAGGTACT
AATCAATTTG GTCGTAGTGG TTTATTCGCT AATGCTATTG TTGGTAAAGG TCAAGGTGGC
GATGTTAATA TCAGTAGCGA TCGCTTAGTT GTTCGTGATG GTGCAACTAT TAATGTCAGT
AGTTTCCTTA GTAGAGACCC AGGAAATCTG CGGGGTTTAG CTGGAAAAGG GGCGGCGGGA
AATATAAATC TCAATTCTGC TGATATTTTA CTAGCAAATC AAGGGATCAT TACTGCTGAT
ACTAATGCCG GGGATAAAGG CAATATTACG ATTCAATCGG ACACCCTGCA AATACTACGT
GGTAGTCAAA TTAGCACCAA TGCGCGCAAT AGTGCAGTTG GGGGAAATAT TAATATTACT
ACCAATACTT TAGTTGCTTA CGAAAATAGT GATATTAGTG CTAATGCTCA AAAAGGTTTT
GGTGGTAGGG TAGTTGTCAA TGCCAAAGCA GTTTTTGGGA TTCAATTCCG TCCCCAACCG
ACTCCAGACA GTGACCTGAC GGCTTCTTCT GATTTGGGTG CGGAGTTTAA TGGTACTGTA
GAACTGAATA CACTAGATGT TGATCCTACT AGCGGATTAG TGAAGCTACC GACTAACTTT
AGCGATCGCT CACAGCAGAT AGCTAGTGGT TGTAGTGTGA CGCAAAAGAA TCGTTTCGTT
GTTAGTAACC GTGGTGGCTT ACCCACCAAC CCTACCGATA CCCTCAGAGG TGAGATAGTT
TGGTATGATG TCCGTGATTT ATCCAATGAA GTAGCTAACT CAACAGCAGG CAGTAACTAT
CAAACTGTTA ATAATCAAGA ACCAATTGTT GAGGCTCAAG GATTAATTGT TGGTGCAGAT
GGTACAATGC AGCTACTAGC ATCCATCCCA CAGGTAACGC CTCTCACTCC GTGGCAAGTA
TCACCTTCAT GTGATGTTAA ACCCTAA
 
Protein sequence
MFNLQTTKIL FFFTLCLVLE TPLKGLAQTQ LNPDHTLPTN VNSIGGVYDI TGGNRPNNGA 
NLFHSFQDFS IQSGDTARFI YDTGISNIIT RITGGSPSQI NGTIQTLLNG TNNIGNANLF
LINPHGIIFG ANAKLDIGGS FIGTTADSIK FNDGKEFSAI NPTVNPILTV NVPIGLQFGS
HPTSTIQVQG SGNNFQLNPD LSVDNSNRPS GLSYQTPNAQ TLALVAGKVE LAGGNITVPQ
GRIELWSVNR GEVTITNPSG HLQLQPTPGI SYGNVDLVNA ASVDASGNSG GSIEVRGQNV
TLDNGSVIVT DTTGSGSGGI LNISASEVLT VKGFVLNPNN QISSGISADV ASGASGEGGK
VTVTTKTLQV SNGGQISSGT FGTGNAGELN VTAQDVQIRG ISLFGPSGLF APVAPGARGN
GGNLTVETNK LQVTDGGQIF TNTLGFGKAG DLKILAQDVE VSGGTEFGPS TIAATVQKIL
SIPEPAATFL GAGFGNAGNL IIETSNLRVT DGGQIAVSTS GNGSAGNMTI NANSVELAGT
NQFGRSGLFA NAIVGKGQGG DVNISSDRLV VRDGATINVS SFLSRDPGNL RGLAGKGAAG
NINLNSADIL LANQGIITAD TNAGDKGNIT IQSDTLQILR GSQISTNARN SAVGGNINIT
TNTLVAYENS DISANAQKGF GGRVVVNAKA VFGIQFRPQP TPDSDLTASS DLGAEFNGTV
ELNTLDVDPT SGLVKLPTNF SDRSQQIASG CSVTQKNRFV VSNRGGLPTN PTDTLRGEIV
WYDVRDLSNE VANSTAGSNY QTVNNQEPIV EAQGLIVGAD GTMQLLASIP QVTPLTPWQV
SPSCDVKP