Gene Ava_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2353 
Symbol 
ID3683410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2924091 
End bp2926565 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content41% 
IMG OID637717698 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_322866 
Protein GI75908570 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA CTTTTGTTGG ACTTGGTGTG CTGAGTGTAA TCTGCTTATC TGCTGTTGAC 
AACAATAGTG TTCACGCCCA AGTAATTCAA GATAATACCT TCAATACTTC TGTTACTCCC
ACTACTTCCA TGAACGGAAG TAATGCTTAC ACCATCAACA ACGGCACTCG TGTCGATAAT
AATTTATTTC ATAGCTTTAG CCAATTCTCT ATTCCCACGG GCGATTCTGC ATTTTTCGGC
AATGATCTTA GTATAGAAAA TATTTTTAGC CGGGTGACTG GTGGTAATAT TTCCAAGATT
GATGGTTCTA TCAATGCCAA TGGTAAAGCC AATTTATTCT TACTCAACCC TGCGGGCATT
ATTTTTGGAA AAAATGCCAG CTTAAATATT GGTGGTTCAT TTGTCGCAAC TACAGCTAAT
AGTATTAAGT TTGAGGATGG GACAGAATTT AGTGCTGTAA ATCCTGCCGC TAAAGCGTTA
TTAACAATGA GTGTACCTGT GGGTTTGCAA ATGGGAAGTA ACCCTGGTCA AATTACCGTC
CAGAACACAG GGCATCAGTT GGCATTTCCG ATTCACCCTT TGGTTTCTTC TCCAAATCGC
AGCAACAATC CTGTGGGATT AAATGTTAAT AATGGTAATT TGGCATTAAT CGGGGGAAAA
ATCACCTTAG ATGGAGGAGT TTTAAATGCA CCATCAGGAC ATATTGAACT TGGTAGTGCT
AGTAACGGAA TAGTTAATTT AAACACTGCC TCTCAGAGTT GGAATTTTGA CTATAGCAAT
ATTCAGCAAT TTGGCGATAT TCAGTTGTCC CGTCAATCAC TAGCAGATGC TAGCGGTACT
CCTGCGGGTT CTATTCAATT CGTAGGTCAA AATATTAGTT TAAATGATGC TTCTGCTGCC
CTATTGGTTA ATGAAGGAAA TGGCAATTCA GGGAATATTA ACATTGATGC GAGTGGATCT
TTAACACTAC GAGGAACTGG CACTATAGGA TTTCCTCAAA GTTTATTGCG TGCTGATAAT
TTCAGCGATG GTAGTGGAGG TAATATCATT GCTTCGGCTT CCCAAGTATT CCTTCAAGAT
GGCGGATCTC TTCATGGCAT TAACTTTGCT GAAGGTACAG GAAGTAATAT TTTTGTAGAA
ACTCAGGATT TAATTCAAAT AACGGGAATA TCACCAGTTA GCGGATTTGC CAGTTCATTG
AATACTATCA CCAGAGGTTC TGGAAAAAGT GGCGATATTC AAGTAGCTAC CAACAAATTG
CAAGTTTTAG ATGGTGCTGT TATTGGTAAT TCGCCTTGGT CAAATGGTGC AGGAGGTAAC
ATCACCATTA ACGCCTCTGA TTTTATCGAA GTGGCCGGAG AAAACACAAA AAACTTGGCT
GACAGTGTCA TCGTAGTAGG AACATTCGAC GAAGGAAATT CAGGTTTATT AACTATTAAT
ACTGCCCAAT TGAGAGTACG AGATGGTGGA GGTATAGTTG TCTCTACCGT CAATCAAGGA
AATGCTGGTG ATTTAGTGAT TAATGCTTCA GACACAGTTG AGGTTAGTGG AGTGGGTAGT
ATTTCTGGTC TTCCTAGCCG CATTGGCGTG AGGGCAGAAT TACTTGCACC ACCAATTCGA
CAATTATTTG GATTACCGGA TGTGATTACA GGTAATATAG GTAAGCTTAC CCTCAACGCC
CAACGTTTAC AAGTCACAAA TCAGGCAATT GTAGGGGTTG ATCATCAAGG CATCGGAGAT
GCTGGACAAC TGGAAGTTAA TGCAGATTCC ATCCGACTAG ATAATGGTGG TAGCATTACA
GCCGCTACAG TTCAAGGTGA AGGCGGTAAT ATTTTGATTA ATTCCAATGA CCTGCTATTG
CGTCATGGTA GTTCAATTAT GACTAATGCA GACGGTATTG GTAACGGGGG GAATATGACC
ATTAACTCTG CTGTGATTCT TGCTTTGGAA AACAGTGATA TTGTCGCCAA TGCCGTTCAA
GGTAACGGAG GTAACATTAA CATTACTTCT CAAAGTATTT TTGGGTTGAA ATACCGTAAC
GAACTCACTA CAGAAAGTGA CATCACTGCT AGTTCGCAAT TTGGTTTAAA CGGAACAGTC
AATATTTATC ACTTTGGTGT TGATCCCAAA ACTGCTTTAG TAGAATTGCC AAAAAATACT
ATAGATCCGT CAAAACAAAT TGCTAGTGGT TGTAATGCTA ATACTGGTAG TAGTTTTGTC
GCCACAGGAC GGGGTGGAAT ACCACAAAAT CCCACACAGG AAATTAAGAG CGATCGCACT
TGGTCTGATA TTCGTGACAT ATCTGCATTC CACACCAAAC AACAAGCACA AGCACCAAAA
AATCCCGTAA CACTTGTCCA AGCTACCTCT TGGCGACGTA ACGCCAACGG CAAAATTGAG
CTTTTTGCCG CTAAATCTCC TACAGGTGTG CAAATGTCAT TAACCTGTGC ATCTCTTGCC
AAAAGTCAAC CTTAG
 
Protein sequence
MKLTFVGLGV LSVICLSAVD NNSVHAQVIQ DNTFNTSVTP TTSMNGSNAY TINNGTRVDN 
NLFHSFSQFS IPTGDSAFFG NDLSIENIFS RVTGGNISKI DGSINANGKA NLFLLNPAGI
IFGKNASLNI GGSFVATTAN SIKFEDGTEF SAVNPAAKAL LTMSVPVGLQ MGSNPGQITV
QNTGHQLAFP IHPLVSSPNR SNNPVGLNVN NGNLALIGGK ITLDGGVLNA PSGHIELGSA
SNGIVNLNTA SQSWNFDYSN IQQFGDIQLS RQSLADASGT PAGSIQFVGQ NISLNDASAA
LLVNEGNGNS GNINIDASGS LTLRGTGTIG FPQSLLRADN FSDGSGGNII ASASQVFLQD
GGSLHGINFA EGTGSNIFVE TQDLIQITGI SPVSGFASSL NTITRGSGKS GDIQVATNKL
QVLDGAVIGN SPWSNGAGGN ITINASDFIE VAGENTKNLA DSVIVVGTFD EGNSGLLTIN
TAQLRVRDGG GIVVSTVNQG NAGDLVINAS DTVEVSGVGS ISGLPSRIGV RAELLAPPIR
QLFGLPDVIT GNIGKLTLNA QRLQVTNQAI VGVDHQGIGD AGQLEVNADS IRLDNGGSIT
AATVQGEGGN ILINSNDLLL RHGSSIMTNA DGIGNGGNMT INSAVILALE NSDIVANAVQ
GNGGNINITS QSIFGLKYRN ELTTESDITA SSQFGLNGTV NIYHFGVDPK TALVELPKNT
IDPSKQIASG CNANTGSSFV ATGRGGIPQN PTQEIKSDRT WSDIRDISAF HTKQQAQAPK
NPVTLVQATS WRRNANGKIE LFAAKSPTGV QMSLTCASLA KSQP