Gene Avin_37890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_37890 
Symbol 
ID7762681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3834234 
End bp3838493 
Gene Length4260 bp 
Protein Length1419 aa 
Translation table11 
GC content68% 
IMG OID643806655 
ProductFilamentous hemagglutinin domain-containing protein 
Protein accessionYP_002800908 
Protein GI226945835 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.687016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGGA TATTCGACGT CATCTGGAGT CATGCGCAAA ACGCCTGGGT CGTCACCAGC 
GAACGCACCG TCAGGCGCGG CAAACCCGGC AGGCTGCGCC TGCCGATCCT GGCCCTGTGC
CTTTCGCCAC TCCCCCTGCA GGCCGCCGAC CTGCCCGAAG GCGGACAAAT CGTCCTGGGC
GACGGCCTCA TCGGCACACC CGCGAGCAAC AACCTGCAGA TCCTGCAGAA CAGCCGGAAG
CTGGCCATCG ACTGGCAGCG GTTCGACATC GGCGCGGACA AGAGCGTGAC CTTCCGGCAG
CCGGACAGCG GCGCTATCGC CCTGAACCGG GTGATCGGCA GCGATGGCAG CGCCATCCTC
GGCAAGCTGG ATGCGAACGG TCAGGTATTT CTGATCAACC CCAACGGCAT CCTCTTCGGC
CAGGGCGCCA GCGTGAACGT CGGCGGCCTG GTCGCCTCGA CCCTGGACCT CGGCAACGAG
GACTTCGAGG CCGGCAGCTA CCGCTTCAAG GCCAACGGCC GCCCGGCCGG CATCAATAAC
CTGGGCAGCA TCGCCGCCAG CGACGGCGGC TCGGTGGCCC TGCTCGGCGG CCAGGTCGGC
AATGACGGAG TCATCCAGGC GAAGCTGGGC ACCGTGGTCC TGGCCGCCGG CGACCAGATA
ACCCTCGACT TCGCCGGCGA CGGCCTGCTC AATGTCCAGG TCGACCGGGC CGCTGCCGAT
GCCCTGGCCC ACAACGGCAA GCTGATCGCG GCCGATGGCG GCAGCGTGGT CATGACCGCC
GGGAGCGGCG ATGCGCTGCT GAAGACGGTG GTCAACAACG AAGGCGTCAT CGAGGCGCGG
ACCCTGGGCA ACAGGAACGG CAGGATCGCC CTGCTCGGCG ACACGGGCCA GGGCATCGTC
CAGGTCGGTG GCAGCCTGGA CGCCTCGGCC CCGGACACGG GCGACGGCGG CTTCATCGAA
ACCAGCGGCG CCACGGTACG GGTCGCCGAC ACGGCCAGGG TGACCACCAG GGCCGGCACG
GGCAAGACCG GCACCTGGCG CATCGAGCCG AGCGACCTCA GCATCGGCGC CGGCGGCTCG
CCGGCCGGCA GCAGTATCGG CGCCGATACG CTATCGGCCA ACCTGGCGGC CACCAACGTC
GAAGTGGCGA GCGGCAACGA CGGCACCGGC CCCGGCGATA TCGAGGTCAA TGCCGCCGTC
GGCTGGAGTG CCGTCACCAC GCTGACCCTC ACCGCGCACA ACGACATCGA TATCCGTGCC
GATATCGACG CCACCGGCAA CGGCGCGGGC CTGGCGCTGA ATCCCGGCGG CAACGCCGGC
TACCGCCTGT TCAACGGCGC CAGCATCACC CTTTCAGGCA ACGGCGCCCG CTTCTCCGTG
AACGGCGACC TCTACACCCT GATCCAGGAT CTCGCGGCCC TGCAGGACAT CGCTGGCGGC
GATCCGGGCG GCCGCTACGC ATTGGGCAAC GACATCGCCG CCAGCGATAC CGGTACCTGG
AACGGCGGCG CCGGCTTCGA GCCGATCGGC AACGGCGAAA CCCCCTTCAC CGGCATCTTC
GACGGACTGG GACACAGGAT TTCCGGACTG ACCATCGACC GGACGACGAG CGACAACGTC
GGCTTGTTCG GCACGACACA GGGCGCAACC ATACGCCGGT TGGGACTGAC CGATACCAGC
ATTCTGGGCC TGAGCCGCGT TGGCGGCCTG GTCGGCAAGG CCTCGTCCAG CACACTCGAC
TCGGTCTATT CCATCGGCAA CGTCAACGGT TACCAGTACA TCGGCGGACT GGTGGGGGAA
AACAGCGGCC TGATCGCCAA CAGCCACGGC ATCGGCGCTG TCGGCGGCGA AAGTTTCGTG
GGCGGGCTGG TCGGCAGCAA CAGCGGCCAG CTCATCGGCA CCTTCGCCCG CGCCAGCGTG
ACCGGCACGG CATCCAGCAT CGGCGGACTG GCAGGCTCCA ACTCGGGCGT GCTTATGGGC
AACTTCGCCA TCGGCGACAC CCTGGGCGAC ACCCATGTCG GTGGCCTGGT GGGGAGCAAC
GACGGCGGCA TGGTCGCGGG CAACTACAGC CTCGGACAAG TCGGCGGGAA TACCGGCGTC
GGCGGTCTGG CAGGCGCCAA TCTCGCCGGT CTCATCGCCG CCAACTACAG CAACGCCGTG
GTCGGCGGAA ACGTCGATAC CGGAGGCCTG GCAGGCCTCA ATCTCGGTGG CCTGGTGGCG
GGCGGCGGCT TCGGCATCAG CATGCCCGGG GAACTGGCCA GCCTGCTCGG CCTCGACGGT
CACGCTTCGT TCGCCGGCGG TTACAGCATC GGCGGCCTAG CGGAAACGGC CACCGGCCTG
GGACTGCCGG GATTCGACGA CGGCAGGCTT TTCGCCGGCA GCTACAGTCT GGGCAGCCTC
GCCGGCATCG ACAGCCTGGA CGGCCTGCGG ACACTCGTCG ACAGCAGGCT GGCCACGACC
GGCCCCGGCA TCGTCGTTCC CGGCGGGCTC GCGTCGCTCG ACAATATCCG CGCCGCTATT
GACGGCATTC CGGGCGATTA CGCGAGTCAT CCCGGCAGCG TCAGCGGAAA CGTCAACGTC
GGCGGTCTGG TGGGATTCAA CTCGAACGGC ATCGTCGTCG GCGGCCACAA CTTCGCCAAC
GTCTATGGGA ATGCCAATGT CGGCGGATTG GTGGGCACCA GCAGCGGACT GATCGTCGGC
AACAGCGCCT CGGGCGATGT CGCCGGCCGG GCGGAGAACA CCGGCGGACT GGTGGGCTAC
AACGCCGGCA CGCTCAGGAC CAACTACGCC ACGGGCAGCG TCGCCGGAGT CGACAGGGTC
GGCGGTCTGG TGGGCCATAA CGTCGGGCAT GTCGATACCA GCCACGCCAC CGGCGATGTC
GCCTCCGGCG GCGACGGCGG CGGTCTGGTG GGCGCCAATG CAGGCCACAT CGAAAACAGC
TTCGCGCTCG GCAGCGTTAT CGGCGCCGCC CCCCGCAGCG GCGGACTGGC CGGCTCCAAT
TCAGGCTCGA TCGCCAACAC CTACGCCTCC GGCAGCGTCT CCGGCCCCAG CGAGGCCGGC
GGCTTGGTCG GCTACAACAG CGGCAGCATC GACACCAGCT ACGCGATCGG CGCTGTCATC
GCCACTGGCC GCGACGTCGG CGCCTTGCTG GGCAACAACG GCGGCAGCCT TTCCAGGAGC
TACTGGAACG CCACAACGGC TGGCGGCCTG CCCGGTATCG GAACTGGCGA CACCGGCGGA
GCCGGCGGCC TGAGCGACGG ACACATGATG CGCGAGGACA GCTTCGCCGG CTGGAGCATC
TCCGCCAGCG GCGGCAGCAC GGCAGTCTGG CGCATCCACG AAGGCCATAC CGCACCCCTG
CTGCGCGCCT TCATGACACC GCTGGCGGTC GTCGCCGACG ATGTGACCAT CATCTACGAC
GGTAGCGTCT GGGCGGGAGG CAGCGGCTAC ACGGCCAGCT CGATAACCCC GAACTTCTGG
CACCGCTTTC CGAAAGTCGA CCACAACCTG ATCCACGGCA CCATCAACAC CAGCCAGCCG
GCGCGCAACG CCGGCAGCCA TGCGATCGAC TCGGGCCTCT ACTCCAGCCA GCTCGGTTAC
GACATCGGCT ACCTCCCCGG CACCCTGACC ATCGACAGGG CTCGGCTGAT CCTGAGCGCC
AGTCCGGACA GCAAGACCTA TGATGGCACC GTCGCTTCCA GCGGCACGGT CGGAGTCGCC
GGCCTCGCCA CCGGCGATAC GCTCACGGCG ACTCAGCGAT ACGATTCGGC CGAAGCCGGC
GAACGCACCC TGCGGGTCGA CGAGGTGGCC ATCGACGATG GCAACGGCGG AAACAATTAC
GAAGTGACCC ACCGAACCGC CACCGGCTCC ATCCTTGCAG CTACGGACAA TGGCGACGGC
GGTGTGGGGA GCGACGGGGG CATCGGCGCT GACGGCGGCT CCGACGATAG CGGAAACACG
GGCGGCGGCT CCGACGGCAG CGAAAACTCC GGCGGCAATG GAGGCGACGA CGACGGTGGA
GATGACGGTA ACGGTAGCAA TCGCAACCGG CGTGACAAGA GCAACTTTGT CCGGCTATCG
CCTGCCTATC TGGCTGCCTT GGCAACCAGG GAACCTTGCA GAACGGCGGA CCAGCGGCTG
GATATTCGGC AACGCTATCG CTGCCCCGTG GCAACTACCA CGCAGGATCA GGCGGCAAAC
GAATCTGCCG TTCCCTACGC GATAGAGGAC GGAGGCCTGC GCCTGCCGGA AGGACTCTGA
 
Protein sequence
MNRIFDVIWS HAQNAWVVTS ERTVRRGKPG RLRLPILALC LSPLPLQAAD LPEGGQIVLG 
DGLIGTPASN NLQILQNSRK LAIDWQRFDI GADKSVTFRQ PDSGAIALNR VIGSDGSAIL
GKLDANGQVF LINPNGILFG QGASVNVGGL VASTLDLGNE DFEAGSYRFK ANGRPAGINN
LGSIAASDGG SVALLGGQVG NDGVIQAKLG TVVLAAGDQI TLDFAGDGLL NVQVDRAAAD
ALAHNGKLIA ADGGSVVMTA GSGDALLKTV VNNEGVIEAR TLGNRNGRIA LLGDTGQGIV
QVGGSLDASA PDTGDGGFIE TSGATVRVAD TARVTTRAGT GKTGTWRIEP SDLSIGAGGS
PAGSSIGADT LSANLAATNV EVASGNDGTG PGDIEVNAAV GWSAVTTLTL TAHNDIDIRA
DIDATGNGAG LALNPGGNAG YRLFNGASIT LSGNGARFSV NGDLYTLIQD LAALQDIAGG
DPGGRYALGN DIAASDTGTW NGGAGFEPIG NGETPFTGIF DGLGHRISGL TIDRTTSDNV
GLFGTTQGAT IRRLGLTDTS ILGLSRVGGL VGKASSSTLD SVYSIGNVNG YQYIGGLVGE
NSGLIANSHG IGAVGGESFV GGLVGSNSGQ LIGTFARASV TGTASSIGGL AGSNSGVLMG
NFAIGDTLGD THVGGLVGSN DGGMVAGNYS LGQVGGNTGV GGLAGANLAG LIAANYSNAV
VGGNVDTGGL AGLNLGGLVA GGGFGISMPG ELASLLGLDG HASFAGGYSI GGLAETATGL
GLPGFDDGRL FAGSYSLGSL AGIDSLDGLR TLVDSRLATT GPGIVVPGGL ASLDNIRAAI
DGIPGDYASH PGSVSGNVNV GGLVGFNSNG IVVGGHNFAN VYGNANVGGL VGTSSGLIVG
NSASGDVAGR AENTGGLVGY NAGTLRTNYA TGSVAGVDRV GGLVGHNVGH VDTSHATGDV
ASGGDGGGLV GANAGHIENS FALGSVIGAA PRSGGLAGSN SGSIANTYAS GSVSGPSEAG
GLVGYNSGSI DTSYAIGAVI ATGRDVGALL GNNGGSLSRS YWNATTAGGL PGIGTGDTGG
AGGLSDGHMM REDSFAGWSI SASGGSTAVW RIHEGHTAPL LRAFMTPLAV VADDVTIIYD
GSVWAGGSGY TASSITPNFW HRFPKVDHNL IHGTINTSQP ARNAGSHAID SGLYSSQLGY
DIGYLPGTLT IDRARLILSA SPDSKTYDGT VASSGTVGVA GLATGDTLTA TQRYDSAEAG
ERTLRVDEVA IDDGNGGNNY EVTHRTATGS ILAATDNGDG GVGSDGGIGA DGGSDDSGNT
GGGSDGSENS GGNGGDDDGG DDGNGSNRNR RDKSNFVRLS PAYLAALATR EPCRTADQRL
DIRQRYRCPV ATTTQDQAAN ESAVPYAIED GGLRLPEGL