Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_03280 |
Symbol | |
ID | 7759288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 303965 |
End bp | 306838 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803252 |
Product | hybrid sensory histidine protein kinase |
Protein accession | YP_002797563 |
Protein GI | 226942490 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.558043 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCCC CCTCCCATAT CGGACTTCGC CAATGGATTT GGCGAGCATT TGCACGGAGC GCACTGATAC CGCTGTTGCT GGTCGAGACG GTGCTGATCG CCGTCTATCT GATCAGCAAC GGTTCGATCC GCGATGCGCA GATCGATCAC CTGCGGAAAG TCGCCCTGTT CGATCTGCAG TCTTCCGCCC GCCAGGAGTC GCGCAGCATC GACCAGCAAC TGGCGCAGGT CGGCGCTCTG ACCGAAATGT TCCGCAACCT GACCGCCCGG ACCCTGCTGG AGACGCAGGC GAAGCCGCTC GTTCCGCCAG CGCTGGCGAG CACGCCGGAC GGGGTGCGCT ACAGTCCCGA GGACAAGGGC GGCAGCGCGG TCTTCTACGC CCGCTCCACG CCGCTGGAGC GCCAGGATCT GGACAAGGTC GCACGGATGG CCACTCTCGA TCCGCTGATG AAGGAACTGC AGCAGCGCAA CCCGCTGGTC GCCAGCCTCT ATTTCAACGG TTGGGACAAC TACAACCACA TTTACCCGTG GTTTCTCACT CATGAGCAGT ACCCTCATGA CATGGTCATC CCCAACTACA GCTTCTACTA CCTGGCCGAC GCCCGGCACA ATCCGGGGCG CGGCATGGTC TGGACCGATG TCTACCTCGA TCCGGCCGGC CACGGCTGGA TGATGTCGGC CATCGCGCCC GTCTATCGGG GCGACTTTCT CGAAGGCGTG GTCGGCCTGG ATGTCACGGT CAGCAACCTG CTGGAGCAGA TCGGCCGCCT GGAGGTGCCG TGGAGCGGCT ATGCCGTGCT GGTCAGCGAC GACCTGGACA TCATGGCCCT GCCGGAGCCG GGCGAAGCCG ATTTCGGCCT CGACGAGCTG ACCTCGCACT CCTACGACGA GGCCATCCAC AGCGAACTGT TCAAGCCGGA GGATTTCAAC CTGCGCCGGC GCCCCGCGAC CGCCGGACTG GCCGCCGCCA TCGTCGAGCG CAACGAGGGG ATGATGACGG TGGAATTCCG CGGCCGGCCC AGCCTGGTGG CCTGGACCAC CATTCCGCAG ACCGGCTGGC ATTTGCTGAC CGTGGTCGAC GAGGCCAGGG TGTTCGCCGA AACCAACAGC CTCGCCAGCC AGTACGAGCG CATCGGCTAC CTGATGATCG CCGGCCTGCT GGTGTTCTAC CTGGTCTTCT TCGGCTGCAT GTGGCTGCGT GCCCGCCACC TCAGCGAGCG CCTGTTGCGC CCCATCGACG GTATTTCGCG GATGATGGAG GAGATCGGCG GGGGCAACTG GCGTCCGGCC CGGGTCAGTT CGCGGATCGT CGAGCTGGAT GCCATGGCCG GCCACGCCGA GCGCATGGGC GAACAGCTCG AGCAGAGCGA ACGACAGAGC CAGCGGACCC AGGAGCGCCT CGAACTGGTG CTCGACAGCG CCACGGAAAG CCTCTGGGAG CAGGACCTGC ACAGCCGGCG GATCAGCCTG CGCGGGCGCT TCTGCAAGCG CTTCGGGCTG CCGGCCGGTC CGATCGATCA TGAAGCCTTC ATGGCGCATG TTCACCCCGA GGACGTGCCG CGGGTCGAGG CGAGCTTCCG GCTCGCCGAC AGCCGCAGCG GGTTGTGCGA GGCGGATTTC CGCTTTCGCG ACAGCCACGG TTCCTACCAC TGGCTGCTCG GCCGCGGCCG GGTGGTCGAG CGCGACCCGG CGACGGGGCA GGCCGTCCTG CTGGCCGGCA CCCATGTGGA CATCGACACG CTCAAGCGCA CCGAGGCCGA CCTGCGCCTG GCCATCGGCG AGGCCCAGGC CGCCAGCCAG GCGAAATCGC GCTTCATCTC CAGCATCAGC CACGAGCTGC GCACGCCGCT CAACGCCATC TATGGCTTCG CCCAGTTGTT GCGCCTGGGC TATGAGGCAC GGGGCGAGAC GCAGGAAGTG GCCTACCTCG ACGAACTTCT GAGGGCCAGT CGGCACCTCA ACCAACTGGT CGACGACGTG CTCGACTGGT CCAATCTCCA GACGGCGAAG TTCAGCCTGG AGATGCAGTC CGTCGAGGTG GCGGGACTGA TGGCCGAGTG CGCCGAGATG GTCCGCCTGG ACGTGGAGAC CCGGGGCCTG CGCCTCGACC TGGAGCCGCC CGACAAGGGG CTGCTGGTGC GGGCCGACCC GCGCCGTCTG CGCCAGGTGC TGCTCAACCT GCTGTCCAAT GCGATCAAGT ACAACAGCCC GGCCGGGCGG ATCGCCCTGA CCCATCAGGC GGAGTCGGGG CGCATCCGGT TGATCGTCGA GGACACCGGT CCCGGCATCG ACGAGTCGCG CCAGGTCCAG TTGTTCGAGC CCTTCCAGCG CCTCGGGCGG GAAAACTCGA CGATCCAGGG CACCGGCATC GGCCTGTCGC TGTGCCGCCA GTTCGCCGTC CTGATGGGCG GCCGCATGGG CATGAGCAGC GAGCCGGGCA CCGGCAGCCG CTTCTGGATC GATCTGCCGC TCGTGCAGGA CGAGACGGGG GCGGGGAGCG TGGCGCGCAT CTGCCATGTC GGGGACGACT CCTTCAGCCG GTGCCAGGTG CGCAAGGCGC TGGTCGACCT GGGCGAGGTC AGCGCCATCG ACAACGGCCG GGCCGTCCTG GAGCGCGTCC TGGCCAGCCC GCCGGACCTG CTGCTGCTCG ACCTGGACCT GCCGGACATC GGCGGGGAGC GCGTGCTGGA GAGGCTTCGC CAGCATCCGC AAACCCGCTC GTTGCCGGTG ATCGTGCTCG GCGCGGCCGC CGATGCCGGG CGTCTGGTCG GGCTCGACTG CCAGGGCAGG CTGGGCAAGC CACTGGACCC GGACGAACTG CGCGAGCTGG TCGCCGCCCT GCTTCCCCAG GCGAAGCCCG AACATGCCCC CTGA
|
Protein sequence | MPSPSHIGLR QWIWRAFARS ALIPLLLVET VLIAVYLISN GSIRDAQIDH LRKVALFDLQ SSARQESRSI DQQLAQVGAL TEMFRNLTAR TLLETQAKPL VPPALASTPD GVRYSPEDKG GSAVFYARST PLERQDLDKV ARMATLDPLM KELQQRNPLV ASLYFNGWDN YNHIYPWFLT HEQYPHDMVI PNYSFYYLAD ARHNPGRGMV WTDVYLDPAG HGWMMSAIAP VYRGDFLEGV VGLDVTVSNL LEQIGRLEVP WSGYAVLVSD DLDIMALPEP GEADFGLDEL TSHSYDEAIH SELFKPEDFN LRRRPATAGL AAAIVERNEG MMTVEFRGRP SLVAWTTIPQ TGWHLLTVVD EARVFAETNS LASQYERIGY LMIAGLLVFY LVFFGCMWLR ARHLSERLLR PIDGISRMME EIGGGNWRPA RVSSRIVELD AMAGHAERMG EQLEQSERQS QRTQERLELV LDSATESLWE QDLHSRRISL RGRFCKRFGL PAGPIDHEAF MAHVHPEDVP RVEASFRLAD SRSGLCEADF RFRDSHGSYH WLLGRGRVVE RDPATGQAVL LAGTHVDIDT LKRTEADLRL AIGEAQAASQ AKSRFISSIS HELRTPLNAI YGFAQLLRLG YEARGETQEV AYLDELLRAS RHLNQLVDDV LDWSNLQTAK FSLEMQSVEV AGLMAECAEM VRLDVETRGL RLDLEPPDKG LLVRADPRRL RQVLLNLLSN AIKYNSPAGR IALTHQAESG RIRLIVEDTG PGIDESRQVQ LFEPFQRLGR ENSTIQGTGI GLSLCRQFAV LMGGRMGMSS EPGTGSRFWI DLPLVQDETG AGSVARICHV GDDSFSRCQV RKALVDLGEV SAIDNGRAVL ERVLASPPDL LLLDLDLPDI GGERVLERLR QHPQTRSLPV IVLGAAADAG RLVGLDCQGR LGKPLDPDEL RELVAALLPQ AKPEHAP
|
| |