Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_37890 |
Symbol | |
ID | 7762681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3834234 |
End bp | 3838493 |
Gene Length | 4260 bp |
Protein Length | 1419 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806655 |
Product | Filamentous hemagglutinin domain-containing protein |
Protein accession | YP_002800908 |
Protein GI | 226945835 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.687016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGGA TATTCGACGT CATCTGGAGT CATGCGCAAA ACGCCTGGGT CGTCACCAGC GAACGCACCG TCAGGCGCGG CAAACCCGGC AGGCTGCGCC TGCCGATCCT GGCCCTGTGC CTTTCGCCAC TCCCCCTGCA GGCCGCCGAC CTGCCCGAAG GCGGACAAAT CGTCCTGGGC GACGGCCTCA TCGGCACACC CGCGAGCAAC AACCTGCAGA TCCTGCAGAA CAGCCGGAAG CTGGCCATCG ACTGGCAGCG GTTCGACATC GGCGCGGACA AGAGCGTGAC CTTCCGGCAG CCGGACAGCG GCGCTATCGC CCTGAACCGG GTGATCGGCA GCGATGGCAG CGCCATCCTC GGCAAGCTGG ATGCGAACGG TCAGGTATTT CTGATCAACC CCAACGGCAT CCTCTTCGGC CAGGGCGCCA GCGTGAACGT CGGCGGCCTG GTCGCCTCGA CCCTGGACCT CGGCAACGAG GACTTCGAGG CCGGCAGCTA CCGCTTCAAG GCCAACGGCC GCCCGGCCGG CATCAATAAC CTGGGCAGCA TCGCCGCCAG CGACGGCGGC TCGGTGGCCC TGCTCGGCGG CCAGGTCGGC AATGACGGAG TCATCCAGGC GAAGCTGGGC ACCGTGGTCC TGGCCGCCGG CGACCAGATA ACCCTCGACT TCGCCGGCGA CGGCCTGCTC AATGTCCAGG TCGACCGGGC CGCTGCCGAT GCCCTGGCCC ACAACGGCAA GCTGATCGCG GCCGATGGCG GCAGCGTGGT CATGACCGCC GGGAGCGGCG ATGCGCTGCT GAAGACGGTG GTCAACAACG AAGGCGTCAT CGAGGCGCGG ACCCTGGGCA ACAGGAACGG CAGGATCGCC CTGCTCGGCG ACACGGGCCA GGGCATCGTC CAGGTCGGTG GCAGCCTGGA CGCCTCGGCC CCGGACACGG GCGACGGCGG CTTCATCGAA ACCAGCGGCG CCACGGTACG GGTCGCCGAC ACGGCCAGGG TGACCACCAG GGCCGGCACG GGCAAGACCG GCACCTGGCG CATCGAGCCG AGCGACCTCA GCATCGGCGC CGGCGGCTCG CCGGCCGGCA GCAGTATCGG CGCCGATACG CTATCGGCCA ACCTGGCGGC CACCAACGTC GAAGTGGCGA GCGGCAACGA CGGCACCGGC CCCGGCGATA TCGAGGTCAA TGCCGCCGTC GGCTGGAGTG CCGTCACCAC GCTGACCCTC ACCGCGCACA ACGACATCGA TATCCGTGCC GATATCGACG CCACCGGCAA CGGCGCGGGC CTGGCGCTGA ATCCCGGCGG CAACGCCGGC TACCGCCTGT TCAACGGCGC CAGCATCACC CTTTCAGGCA ACGGCGCCCG CTTCTCCGTG AACGGCGACC TCTACACCCT GATCCAGGAT CTCGCGGCCC TGCAGGACAT CGCTGGCGGC GATCCGGGCG GCCGCTACGC ATTGGGCAAC GACATCGCCG CCAGCGATAC CGGTACCTGG AACGGCGGCG CCGGCTTCGA GCCGATCGGC AACGGCGAAA CCCCCTTCAC CGGCATCTTC GACGGACTGG GACACAGGAT TTCCGGACTG ACCATCGACC GGACGACGAG CGACAACGTC GGCTTGTTCG GCACGACACA GGGCGCAACC ATACGCCGGT TGGGACTGAC CGATACCAGC ATTCTGGGCC TGAGCCGCGT TGGCGGCCTG GTCGGCAAGG CCTCGTCCAG CACACTCGAC TCGGTCTATT CCATCGGCAA CGTCAACGGT TACCAGTACA TCGGCGGACT GGTGGGGGAA AACAGCGGCC TGATCGCCAA CAGCCACGGC ATCGGCGCTG TCGGCGGCGA AAGTTTCGTG GGCGGGCTGG TCGGCAGCAA CAGCGGCCAG CTCATCGGCA CCTTCGCCCG CGCCAGCGTG ACCGGCACGG CATCCAGCAT CGGCGGACTG GCAGGCTCCA ACTCGGGCGT GCTTATGGGC AACTTCGCCA TCGGCGACAC CCTGGGCGAC ACCCATGTCG GTGGCCTGGT GGGGAGCAAC GACGGCGGCA TGGTCGCGGG CAACTACAGC CTCGGACAAG TCGGCGGGAA TACCGGCGTC GGCGGTCTGG CAGGCGCCAA TCTCGCCGGT CTCATCGCCG CCAACTACAG CAACGCCGTG GTCGGCGGAA ACGTCGATAC CGGAGGCCTG GCAGGCCTCA ATCTCGGTGG CCTGGTGGCG GGCGGCGGCT TCGGCATCAG CATGCCCGGG GAACTGGCCA GCCTGCTCGG CCTCGACGGT CACGCTTCGT TCGCCGGCGG TTACAGCATC GGCGGCCTAG CGGAAACGGC CACCGGCCTG GGACTGCCGG GATTCGACGA CGGCAGGCTT TTCGCCGGCA GCTACAGTCT GGGCAGCCTC GCCGGCATCG ACAGCCTGGA CGGCCTGCGG ACACTCGTCG ACAGCAGGCT GGCCACGACC GGCCCCGGCA TCGTCGTTCC CGGCGGGCTC GCGTCGCTCG ACAATATCCG CGCCGCTATT GACGGCATTC CGGGCGATTA CGCGAGTCAT CCCGGCAGCG TCAGCGGAAA CGTCAACGTC GGCGGTCTGG TGGGATTCAA CTCGAACGGC ATCGTCGTCG GCGGCCACAA CTTCGCCAAC GTCTATGGGA ATGCCAATGT CGGCGGATTG GTGGGCACCA GCAGCGGACT GATCGTCGGC AACAGCGCCT CGGGCGATGT CGCCGGCCGG GCGGAGAACA CCGGCGGACT GGTGGGCTAC AACGCCGGCA CGCTCAGGAC CAACTACGCC ACGGGCAGCG TCGCCGGAGT CGACAGGGTC GGCGGTCTGG TGGGCCATAA CGTCGGGCAT GTCGATACCA GCCACGCCAC CGGCGATGTC GCCTCCGGCG GCGACGGCGG CGGTCTGGTG GGCGCCAATG CAGGCCACAT CGAAAACAGC TTCGCGCTCG GCAGCGTTAT CGGCGCCGCC CCCCGCAGCG GCGGACTGGC CGGCTCCAAT TCAGGCTCGA TCGCCAACAC CTACGCCTCC GGCAGCGTCT CCGGCCCCAG CGAGGCCGGC GGCTTGGTCG GCTACAACAG CGGCAGCATC GACACCAGCT ACGCGATCGG CGCTGTCATC GCCACTGGCC GCGACGTCGG CGCCTTGCTG GGCAACAACG GCGGCAGCCT TTCCAGGAGC TACTGGAACG CCACAACGGC TGGCGGCCTG CCCGGTATCG GAACTGGCGA CACCGGCGGA GCCGGCGGCC TGAGCGACGG ACACATGATG CGCGAGGACA GCTTCGCCGG CTGGAGCATC TCCGCCAGCG GCGGCAGCAC GGCAGTCTGG CGCATCCACG AAGGCCATAC CGCACCCCTG CTGCGCGCCT TCATGACACC GCTGGCGGTC GTCGCCGACG ATGTGACCAT CATCTACGAC GGTAGCGTCT GGGCGGGAGG CAGCGGCTAC ACGGCCAGCT CGATAACCCC GAACTTCTGG CACCGCTTTC CGAAAGTCGA CCACAACCTG ATCCACGGCA CCATCAACAC CAGCCAGCCG GCGCGCAACG CCGGCAGCCA TGCGATCGAC TCGGGCCTCT ACTCCAGCCA GCTCGGTTAC GACATCGGCT ACCTCCCCGG CACCCTGACC ATCGACAGGG CTCGGCTGAT CCTGAGCGCC AGTCCGGACA GCAAGACCTA TGATGGCACC GTCGCTTCCA GCGGCACGGT CGGAGTCGCC GGCCTCGCCA CCGGCGATAC GCTCACGGCG ACTCAGCGAT ACGATTCGGC CGAAGCCGGC GAACGCACCC TGCGGGTCGA CGAGGTGGCC ATCGACGATG GCAACGGCGG AAACAATTAC GAAGTGACCC ACCGAACCGC CACCGGCTCC ATCCTTGCAG CTACGGACAA TGGCGACGGC GGTGTGGGGA GCGACGGGGG CATCGGCGCT GACGGCGGCT CCGACGATAG CGGAAACACG GGCGGCGGCT CCGACGGCAG CGAAAACTCC GGCGGCAATG GAGGCGACGA CGACGGTGGA GATGACGGTA ACGGTAGCAA TCGCAACCGG CGTGACAAGA GCAACTTTGT CCGGCTATCG CCTGCCTATC TGGCTGCCTT GGCAACCAGG GAACCTTGCA GAACGGCGGA CCAGCGGCTG GATATTCGGC AACGCTATCG CTGCCCCGTG GCAACTACCA CGCAGGATCA GGCGGCAAAC GAATCTGCCG TTCCCTACGC GATAGAGGAC GGAGGCCTGC GCCTGCCGGA AGGACTCTGA
|
Protein sequence | MNRIFDVIWS HAQNAWVVTS ERTVRRGKPG RLRLPILALC LSPLPLQAAD LPEGGQIVLG DGLIGTPASN NLQILQNSRK LAIDWQRFDI GADKSVTFRQ PDSGAIALNR VIGSDGSAIL GKLDANGQVF LINPNGILFG QGASVNVGGL VASTLDLGNE DFEAGSYRFK ANGRPAGINN LGSIAASDGG SVALLGGQVG NDGVIQAKLG TVVLAAGDQI TLDFAGDGLL NVQVDRAAAD ALAHNGKLIA ADGGSVVMTA GSGDALLKTV VNNEGVIEAR TLGNRNGRIA LLGDTGQGIV QVGGSLDASA PDTGDGGFIE TSGATVRVAD TARVTTRAGT GKTGTWRIEP SDLSIGAGGS PAGSSIGADT LSANLAATNV EVASGNDGTG PGDIEVNAAV GWSAVTTLTL TAHNDIDIRA DIDATGNGAG LALNPGGNAG YRLFNGASIT LSGNGARFSV NGDLYTLIQD LAALQDIAGG DPGGRYALGN DIAASDTGTW NGGAGFEPIG NGETPFTGIF DGLGHRISGL TIDRTTSDNV GLFGTTQGAT IRRLGLTDTS ILGLSRVGGL VGKASSSTLD SVYSIGNVNG YQYIGGLVGE NSGLIANSHG IGAVGGESFV GGLVGSNSGQ LIGTFARASV TGTASSIGGL AGSNSGVLMG NFAIGDTLGD THVGGLVGSN DGGMVAGNYS LGQVGGNTGV GGLAGANLAG LIAANYSNAV VGGNVDTGGL AGLNLGGLVA GGGFGISMPG ELASLLGLDG HASFAGGYSI GGLAETATGL GLPGFDDGRL FAGSYSLGSL AGIDSLDGLR TLVDSRLATT GPGIVVPGGL ASLDNIRAAI DGIPGDYASH PGSVSGNVNV GGLVGFNSNG IVVGGHNFAN VYGNANVGGL VGTSSGLIVG NSASGDVAGR AENTGGLVGY NAGTLRTNYA TGSVAGVDRV GGLVGHNVGH VDTSHATGDV ASGGDGGGLV GANAGHIENS FALGSVIGAA PRSGGLAGSN SGSIANTYAS GSVSGPSEAG GLVGYNSGSI DTSYAIGAVI ATGRDVGALL GNNGGSLSRS YWNATTAGGL PGIGTGDTGG AGGLSDGHMM REDSFAGWSI SASGGSTAVW RIHEGHTAPL LRAFMTPLAV VADDVTIIYD GSVWAGGSGY TASSITPNFW HRFPKVDHNL IHGTINTSQP ARNAGSHAID SGLYSSQLGY DIGYLPGTLT IDRARLILSA SPDSKTYDGT VASSGTVGVA GLATGDTLTA TQRYDSAEAG ERTLRVDEVA IDDGNGGNNY EVTHRTATGS ILAATDNGDG GVGSDGGIGA DGGSDDSGNT GGGSDGSENS GGNGGDDDGG DDGNGSNRNR RDKSNFVRLS PAYLAALATR EPCRTADQRL DIRQRYRCPV ATTTQDQAAN ESAVPYAIED GGLRLPEGL
|
| |