Gene Ava_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2349 
Symbol 
ID3683464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2913995 
End bp2916409 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content45% 
IMG OID637717694 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_322862 
Protein GI75908566 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATGC TACCGTGCTG TGATCCTGTC AGTGGGCAGG TAATTCCCGA TGGTAGTGTG 
AATACCAGAG TCTCTCAAAG CGGCGATCTC TTCACCATCA CTGACGGTAA TCGAGTAGGA
GATAACTTAT TTCATAGTTT CAGTCAATTC TCAGTTCCCA GCAACGGCAC TGCTTTCTTC
AATAACGCCT CAGATGTTCA AAATATTTTT AGTCGTGTCA CAGGAGCCAG TGTTTCCCTA
ATTGACGGTT TAATACAAGC TAATGGTAAC GCTAATCTAT TTTTGCTCAA TCCCAAGGGA
ATCATTTTCG GAGCAAATGC TCAGTTAAAT ATTGGCGGAT CATTCATCAG TACAACTGCC
AACAGTATCG AATTTGCCGA TGGCATTGAG TTTAGTAGCA TCAATTCTCA ACCTGCTCCA
TTGTTGAGTA TCAATGTTCC CGTTGGTCTA CAACTAGGAA ACAATCCTGC ACCAATTAAC
CTTCAAGGTA CAGGACACAC TTTAACCAAT ACCAGTGGAC TGACTCTAGC TCCCCTAATT
CGGACTCCTC ATCCCACAAA ACTACAAGTA CCATCGGGAA AAACCATAGC CCTAGTCGGC
GGTGACATCA ACCTCAATGG GGCAACCTTA ATCGCGGAAA CGGGAGGAAT AGAATTAGGT
AGTGTGATTA GTCCGGGATT AGTCAACCTC ATACCTAGCG CCCAGGGCTA CACCTTGGGA
TATAGTAATA TACAACACTT TGGCAATATT CAACTAGCCG AGCGATCGCT ATTAGATATA
AGCGGTGTAA ATGCTGGTTC CGTGCAGATT CGAGGTGGGC AACTGCGATT CACCGACGGT
TCTCTAATCC TGTCACAAAA TTTCGGCGCT CGCCCTGGTG GTGAAATTCG CCTCCAGACT
ACAGAATCCA TTGAATTAAT AGGCAGAACA TCTGATGTTG GAATCCGGGG TGGGATACGT
AGTGAAGCCT TAGGTATAGG AAACAGTAGC AATATCAATA TTATTACCCC TATTCTCACC
GTCAGTCAGG GAGCAGGTGT GACTAACTCA ACTCTAGGAT CTGCTTCTAG TGGCAATATC
AACATTGAAG CAACTGCAAC TTACCTCTCC GGCTTTTCAC CCACAAATCC TGTTGAAGTT
ACCACAATTA ATACTTCCAC ACTAGGCAGT GGAAACGCGG GTGATCTTTT CTTCAATGGC
AGTAGTCTAT TAATATCAGA TGGTGCTTCA CTGTCTTCGA CCACCTTTGG CAACGGTTCA
AGTGGTAAAG TCACAATTCG CAACACTAAT ACCACCGTCA TCGGAGAAAG TCCCTCTGAA
CTGTACAGCA ACATTAGCTC AACGACATTT GCAACTGGAA ATGCTCAAAC CCTGACCCTA
GATACTAAGA ACTTACAAAT CTTGGATGGG GGAGCAGTGG CCACCACAGC ATTTTTTGTA
GGTAGTGGCG GAGACTTAAG CATCAACGCC AGCGAATCCA TCACCATTAG TGGTCAAGGT
CGAACCATTA GCAGTAGCAT TAACGCCTCC ACTTTGCGAC CCGATCCTTT TTTACGACAA
AGATTTGGTC TGCCGGACAT ACTAACAGCA AATCCCGGCT CTGTCAGCAT CACCACACCT
AAGCTAACAT TAACCAACGG TGGAACCGTC GGTGTCACAA ATCAAGGTAG CGGCAACGGT
GGTAATATGA GCATCGCTAC TAATACCATC CAATTAAAAA ATCAAGGTTT TATTCAAGCA
CAGACAAAAT TTGGCAACGG TGGGAATATT CAATTATCTG CAACAGACCT ATTGCTATTG
CGGCAAAATA GCCTAATTAC CTCCACATCT GGTGGCTCAG GAAATGGAGG TAACATCAAC
ATCAATGCAC CCATCATCAC GGGACTAGAA AACAGCGACA TTATTGCTAA TGCCATTAGA
GGTCAGGGAG GCAATATTCA AATTACCACT CAAGGTATGT TTGGACTAAA ATTTCGTGAC
CAAATCACCC CGGAAAGTGA CATTACAGCC AGTTCCCAAT TTGGATTAAG TGGCACAGTC
CAAGTTAACA CTGTGGGAGT TGACCCAAAC TCAGGTTTAG TCGAATTACC AGCCAATGTT
ACCGATCCAT CCCAGAAAAT AGCTACAGGT TGCTCTAATA ATCAAGGTAG TAGTTTTATC
GCCATAGGAA GGGGTGGAGT ACCGCAAAAC CCGACACAAG AAATCAGGAG CGATCGCTCC
TGGTCAGACA CCCGCGACAT CTCTGCATAC CGGGAAACAC AGACAGTCCA AGCTCAAATC
CCGCAACCTC CAGAAACGCT TGTCCAAGCA ACGTCCTGGC AACGTAACCC TCAAGGTAAA
ATTGAATTAG TGGTGGCTAA ATCACCTCCG CAGATGCCAC CAGCCCTAAC CTGTGCTGCT
GTAGTTGAAA ATTAA
 
Protein sequence
MTMLPCCDPV SGQVIPDGSV NTRVSQSGDL FTITDGNRVG DNLFHSFSQF SVPSNGTAFF 
NNASDVQNIF SRVTGASVSL IDGLIQANGN ANLFLLNPKG IIFGANAQLN IGGSFISTTA
NSIEFADGIE FSSINSQPAP LLSINVPVGL QLGNNPAPIN LQGTGHTLTN TSGLTLAPLI
RTPHPTKLQV PSGKTIALVG GDINLNGATL IAETGGIELG SVISPGLVNL IPSAQGYTLG
YSNIQHFGNI QLAERSLLDI SGVNAGSVQI RGGQLRFTDG SLILSQNFGA RPGGEIRLQT
TESIELIGRT SDVGIRGGIR SEALGIGNSS NINIITPILT VSQGAGVTNS TLGSASSGNI
NIEATATYLS GFSPTNPVEV TTINTSTLGS GNAGDLFFNG SSLLISDGAS LSSTTFGNGS
SGKVTIRNTN TTVIGESPSE LYSNISSTTF ATGNAQTLTL DTKNLQILDG GAVATTAFFV
GSGGDLSINA SESITISGQG RTISSSINAS TLRPDPFLRQ RFGLPDILTA NPGSVSITTP
KLTLTNGGTV GVTNQGSGNG GNMSIATNTI QLKNQGFIQA QTKFGNGGNI QLSATDLLLL
RQNSLITSTS GGSGNGGNIN INAPIITGLE NSDIIANAIR GQGGNIQITT QGMFGLKFRD
QITPESDITA SSQFGLSGTV QVNTVGVDPN SGLVELPANV TDPSQKIATG CSNNQGSSFI
AIGRGGVPQN PTQEIRSDRS WSDTRDISAY RETQTVQAQI PQPPETLVQA TSWQRNPQGK
IELVVAKSPP QMPPALTCAA VVEN