Gene Ava_2351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2351 
Symbol 
ID3683466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2918993 
End bp2921485 
Gene Length2493 bp 
Protein Length830 aa 
Translation table11 
GC content44% 
IMG OID637717696 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_322864 
Protein GI75908568 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CTTCTGTGTG CTTGGGTCTA ACTTGTGTGT TCTTAGCTGC TGGAATGTCT 
TTACCAGCCA CTGCTCAAGT GACTTCTGAT GATACCACAA ACACTACTGT CAACCCAAGC
GGTAATAATT TTCAGATTAT TAACGGTACT GCCAGAGGAA ATAATTTATT TCACAGCTTT
AGTAATTTCT CTGTACCCAA AGACTCTTCA GCAACATTTG ACTTAACTAA TACACCAAAT
ATTACAAATA TATTTAGCCG AGTTACTGGT GGTAATGTTT CTAATATTGA TGGAATGATT
GAAACTGTTA AGAGTAATAA TCCAGTAAGT TTGTTTTTAA TGAATCCGGC AGGGATTGTG
TTTGCTCCAA ATGCCTCCTT GAATATTGGC GGGTCGTTTG TAGGAACAAC GGCAAATAGT
ATTAAGTTTG CTGATGGGAC AGAGTTTAGT GCCAATAATT CCAGTGTGAC ACCATTACTG
ACGATGAGTG TTCCAATTGG CTTGCAAATG GGAAGTAATC CAAACCCCAT CACAGTTCAA
GGTACAGGTT ATGCCCTAAC ATCTGTCAGT ACCGTTGCCC CCATCACTCA AAGTCCCAGT
ACCTCAGAAC TAAGGGTAAA GCCAGTAAAG ACCTTGGCAC TGGTAGGGGG CGACTTAAAT
CTTACTGGGG CAACCTTGAA CGCGCCCGAA GGACGTGTCG AATTGGGAAG TTTAAGCGGA
GTGGGAATGG TCAGTTTGAA CCCCATTTCT CAGGGTTATC AGTTAAGTTA CGAGGGCGGG
CAAAGCTTTG CAGATATCCA ACTTACGCAA AAATCCTTGC TGACTGTAGG AGCATTATTA
AGTGCAGGGG CATTAAATGC TGGTTCGGTA CAATTACAGG GAAGGCACAT TCAAATCAGT
GATGGTTCAA TTATATTTTC CAAAAATCTG GGAAATGTCG CTGGAGGTGA AATTATTTTA
CAAGCCTCGG ACAGTATCGA TATTATAGGT ACAACAGCCA ACGCCCAAAT TCGCAGTGGC
ATCCGCTCTG AAGGTTTGAA TACTGGCACA GGTTCACCCA TCCATATCAT CACTCCCCAC
TTAACCCTCT CACAAGGAGC AGGAGTTAAC AACAATGCTT TTGGATTTGC TGCCAGTGGT
GGGATTCAGA TTGATGCACA GACAGTTGAG TTATCTGGTT TTTCTCCCAT CAATCCTACA
GGAGTAACGT CCCTGACCAT CTCTTCCCGA ACTCCCAAGT CAGCAGGCAA TCTCTCCATT
AACACTAATA GTCTACTGGT TTCTCAGGGA GCAGCCATCT CCTCAGTAGC ATTTGGGACT
GGTTCTACAG GGCAAGTCAC CATTCGCAGT CAGAATACTA CTGTCACGGG AGACAATCCG
GCAGGACTCT ATAGCAATAT TAGTGCAATC ACATATGGCA CAGGCGATGC CCAAACCTTG
ACGTTAGATA CGAAACGATT ACAACTACTC GATGGGGGAG TAGTCGCAAC CACATCCTTT
TTGATTGGTA AAGCTGGAAA TTTAAACATT AACGCGACTG AATCGATTCT GATTGACGGG
CGCAGCCAGG CTAACAATAG CAGTATTAAC TCAGCCGTCC TGATCCCGCC AGCATTAATC
AAACAGTTAT TCAGGTTGCC TAATATTTTG TCGGCGAATG GTGGAACTGT GAATGTAACC
ACCCCTACCT TGACGTTAAG CAATGGCGGC GCAGTTAGTG TGACTAATCA GGGTGCAGGC
GATGGGGGCC ATATAAACAT TACTGCCAAT ACGGTATTTT TAGATCGTCA AGGTAGCGTT
CAAGCTCAAA CGTTGTCGGG CGAGGGCGGT AATATTACCT CACAAGTCAG TGAGCTGCTG
TTACTACGTC ACAATAGCCT GATTAGTGCG ACATCCGGGA ATACAGGTAA TGGTGGCAAT
ATTAGCATTA ATGCTCCTGT TATCGCTGGG CTAGAAAACA GTGACATTAT TGCTAATGCA
GTGAAAGGAA GAGGTGGCAA TATTGATATC AGGACTCAAG GTATTATTGG TCTAGCATTC
CGTAATACTC TCACCCCAAG AGTTGAGCAG ACCAACGACA TCACAGCCAG TTCCGAATTT
AACATTAATG GCACAGTAAA AATTAATAAC GTTGGTGTTG ATCCCAATTC GGGTTTAGTT
GCACTACCCG CAAATATTAT CGATCCATCC CAGCAAATAG CTAGTGGTTG TTCTGCCGAT
ACTGGCAGTA GTTTCGTCGC CACAGGGCGG GGTGGAATAC CGCAAAATCC TAGTCAAGAA
ATGAGGAGCG ATCGCACTTG GTCTGATACC CGTGATATAT CTGCATTCCA AAACAGACAG
CAGACACAAG CACAAGCACC CACACAGCCA CGAATCCCCG TCCAAGCAAA TTCCTGGCAT
CGTAACAACC AAGGCAAAAT TGAATTAATT GCGGATCAAT CCCTAACACA GGGACAAACA
GCATTAACCT GCATGGCTAT ACCCCAGAAT TAA
 
Protein sequence
MKVTSVCLGL TCVFLAAGMS LPATAQVTSD DTTNTTVNPS GNNFQIINGT ARGNNLFHSF 
SNFSVPKDSS ATFDLTNTPN ITNIFSRVTG GNVSNIDGMI ETVKSNNPVS LFLMNPAGIV
FAPNASLNIG GSFVGTTANS IKFADGTEFS ANNSSVTPLL TMSVPIGLQM GSNPNPITVQ
GTGYALTSVS TVAPITQSPS TSELRVKPVK TLALVGGDLN LTGATLNAPE GRVELGSLSG
VGMVSLNPIS QGYQLSYEGG QSFADIQLTQ KSLLTVGALL SAGALNAGSV QLQGRHIQIS
DGSIIFSKNL GNVAGGEIIL QASDSIDIIG TTANAQIRSG IRSEGLNTGT GSPIHIITPH
LTLSQGAGVN NNAFGFAASG GIQIDAQTVE LSGFSPINPT GVTSLTISSR TPKSAGNLSI
NTNSLLVSQG AAISSVAFGT GSTGQVTIRS QNTTVTGDNP AGLYSNISAI TYGTGDAQTL
TLDTKRLQLL DGGVVATTSF LIGKAGNLNI NATESILIDG RSQANNSSIN SAVLIPPALI
KQLFRLPNIL SANGGTVNVT TPTLTLSNGG AVSVTNQGAG DGGHINITAN TVFLDRQGSV
QAQTLSGEGG NITSQVSELL LLRHNSLISA TSGNTGNGGN ISINAPVIAG LENSDIIANA
VKGRGGNIDI RTQGIIGLAF RNTLTPRVEQ TNDITASSEF NINGTVKINN VGVDPNSGLV
ALPANIIDPS QQIASGCSAD TGSSFVATGR GGIPQNPSQE MRSDRTWSDT RDISAFQNRQ
QTQAQAPTQP RIPVQANSWH RNNQGKIELI ADQSLTQGQT ALTCMAIPQN