Gene Ava_4181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4181 
Symbol 
ID3681042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5231460 
End bp5233739 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content40% 
IMG OID637719528 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_324675 
Protein GI75910379 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000153742 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCCGTAG TATCAATATT GACCAGTGAT ATTAGCTCGG CACAAATTAC ACCGGATACG 
ACTTTGGGTA ATCAAAGCTC CCGTGTTACA ACTGGTGTCA ACATCAAAGG ACATGAGACT
GACTTAATTG AAGGTGGTGT CCAACGAGGA AGCAGCCTGT TTCACAGTTT TACAGAATTT
AATGTCAACA ACGGACAAAG AGTATATTTT GCCAACCCTA CAGGCATTGC AGATATTTTT
AGTCGAGTAA CTGGGAGTAA CGCTTCTCAT ATTTTGGGTA CTTTGGGCGT ACATGGTGCA
GCTAATCTAT ATCTGCTCAA TCCCAATGGT ATTATTTTTG GGACAAATGC CCAGTTAGAT
ATTCAAGGTT CTTTGTTTGC TACCACAGCC GACAGTTTTG AATTTCCTGA TGGTAGTGAA
TTTAGTGCCA CTAATCCCCA AGCACCACTG TTAACAATGA GTGTACCTGT AGGTGTACAG
TATGGTTTAC AGCAAACAGC AAGTATTACT AATCAAGGGA ATTTGGTAAC AGGAAAAAAT
TTGACGCTAA ATGCTGGAAG TTTAGATTTA CAGGGCAAAC TCGAAGCTGG TGGTGATATT
TCCCTAAACA GTCAGCAAGG AACAATTGAT ATAAATGGGG AGATTATATC TACAACTCCC
AATGGAATTG CTGGAGATGT TACGATTCAA GCTGTAGGCG ATATTCATAT ATTTAAGAAC
ATAAAAGCCT CTAGCAACAG TGATATTATC ACTAATAATG ATCAGAAAAG ATATAATAAT
ATCGCAATTA ATTCTACCGG AGGTTCTATT TATTTAAATG GTTCTCAATT AAATGTTACT
AACTCTGGTT CCGGCTTTGC CGGAGATATC ATGATTAACG CTAATGACCA AATATCCATT
CTCAATGGTA GTAAGATATC TAGTAATGGA AATATGGGAC GGATATTTAT TGGCGTTGGC
GATAGTAATA AAATTACGCC TAAAAGAGTT AATATTGATA ATTCTGGTCT GAATGTAGAT
AATTATTCAT CTGATGGTAA TACAGGAAGT ATAACTATTT ACAGCTTTGA AGAAATTTCG
ATTAAAAATA GTTATATCTT AAGTGGCAGA GGCAGCATAG CTAGTAGCCG TAATGATGAC
GGCAGTGGGG CAAATGGTGG CACTGTAAGT ATTGAATCAC AAGGCTTAGT AAGTTTGGAC
AGTTTAGTAA TATTAGCTGA TGTTTTTTCT TCAGACCAAA ATGGTACTGG AGGTGACATT
AATATTTCCG CAGGCTCACT AATTATTAAG AATGGGTCTG AATTAGATAC TTCCACATCT
GGTAAAGGGA ATGCGGGTGA TGTGACTATT ACGGCTTCAG ATTCTATTTT GTTAGACAAT
AGTAAAATTT CCAGTAGTCT TAGGAACGCA GCAACAGGTA AGGGTGGAAA TATTGCAATT
ACAACTAATT TTTTGACTGC TGACAAAGGA ACAATAGAAG CTCAGAGTCC AGGTAAGGGA
GATGGTGGCA ATATCAACAT CACAACACTA AATTTGTGGC TTCGCCGCCA GAGTAAAATT
TCCACTACAG CAGGTGCAGA AGGATCGGGT GGTAATGGAG GTAGCATCAG GATTAATGCT
AAGGGTGGAT TTATCGTAGC AGTACCATCA GAAGACAGTA ATATCGTTGC TAATGCTTTT
GGTGGTAATG GTGGCAAGAT TAACATTAGT GCCAACCGTA CTCTAGGATT TCAAAATCGG
GGAAAGTTGA GTCCAAATGA ACTAAACGCG ATTATTACCA ATGGCACTAG CGAAATTAGC
GCCAGTTCTG ATTATGGTGA AGACGGGGAG GTAGCGATTG AAACCCTGAG CATTGACCCT
ACTCAGGGGC TAGTTGAATT GCCGACACAG TTAGTAGACC CTTCTCGATT AATTGCCCAA
GGCTGTGGCT CTCCTAACAA TAGGGTTGCT AAAGGGCAGA GTGAATTTGT GATTACTGGA
CGTGGGGGAC TACCGCCTAG TCCTGATGAT ATGCTCAAGC CTGGGATTAG GTCGCCGGAA
TGGGTGGTAA ATAATACTAG AGATTACAGT AATAATTCTG AAAGCATAAT TGAAAAAGAG
ATGAATTTAA AGCAATTATC GTCAAATACT TCTACTCCGT TAGTAGAAGC AGTGGGGATG
GTTCACAATG CCAATGGAGA TGTTGTTTTG ACTACTCAAC CAGCAGTCGC CACTCCACTA
CATTATTCGG GGTTATCTAG CCAAGTCTGT GGTCTTATTC AAGGGAATGT CAGAGAATGA
 
Protein sequence
MSVVSILTSD ISSAQITPDT TLGNQSSRVT TGVNIKGHET DLIEGGVQRG SSLFHSFTEF 
NVNNGQRVYF ANPTGIADIF SRVTGSNASH ILGTLGVHGA ANLYLLNPNG IIFGTNAQLD
IQGSLFATTA DSFEFPDGSE FSATNPQAPL LTMSVPVGVQ YGLQQTASIT NQGNLVTGKN
LTLNAGSLDL QGKLEAGGDI SLNSQQGTID INGEIISTTP NGIAGDVTIQ AVGDIHIFKN
IKASSNSDII TNNDQKRYNN IAINSTGGSI YLNGSQLNVT NSGSGFAGDI MINANDQISI
LNGSKISSNG NMGRIFIGVG DSNKITPKRV NIDNSGLNVD NYSSDGNTGS ITIYSFEEIS
IKNSYILSGR GSIASSRNDD GSGANGGTVS IESQGLVSLD SLVILADVFS SDQNGTGGDI
NISAGSLIIK NGSELDTSTS GKGNAGDVTI TASDSILLDN SKISSSLRNA ATGKGGNIAI
TTNFLTADKG TIEAQSPGKG DGGNINITTL NLWLRRQSKI STTAGAEGSG GNGGSIRINA
KGGFIVAVPS EDSNIVANAF GGNGGKINIS ANRTLGFQNR GKLSPNELNA IITNGTSEIS
ASSDYGEDGE VAIETLSIDP TQGLVELPTQ LVDPSRLIAQ GCGSPNNRVA KGQSEFVITG
RGGLPPSPDD MLKPGIRSPE WVVNNTRDYS NNSESIIEKE MNLKQLSSNT STPLVEAVGM
VHNANGDVVL TTQPAVATPL HYSGLSSQVC GLIQGNVRE