Gene Avin_03280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_03280 
Symbol 
ID7759288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp303965 
End bp306838 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content68% 
IMG OID643803252 
Producthybrid sensory histidine protein kinase 
Protein accessionYP_002797563 
Protein GI226942490 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.558043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCCC CCTCCCATAT CGGACTTCGC CAATGGATTT GGCGAGCATT TGCACGGAGC 
GCACTGATAC CGCTGTTGCT GGTCGAGACG GTGCTGATCG CCGTCTATCT GATCAGCAAC
GGTTCGATCC GCGATGCGCA GATCGATCAC CTGCGGAAAG TCGCCCTGTT CGATCTGCAG
TCTTCCGCCC GCCAGGAGTC GCGCAGCATC GACCAGCAAC TGGCGCAGGT CGGCGCTCTG
ACCGAAATGT TCCGCAACCT GACCGCCCGG ACCCTGCTGG AGACGCAGGC GAAGCCGCTC
GTTCCGCCAG CGCTGGCGAG CACGCCGGAC GGGGTGCGCT ACAGTCCCGA GGACAAGGGC
GGCAGCGCGG TCTTCTACGC CCGCTCCACG CCGCTGGAGC GCCAGGATCT GGACAAGGTC
GCACGGATGG CCACTCTCGA TCCGCTGATG AAGGAACTGC AGCAGCGCAA CCCGCTGGTC
GCCAGCCTCT ATTTCAACGG TTGGGACAAC TACAACCACA TTTACCCGTG GTTTCTCACT
CATGAGCAGT ACCCTCATGA CATGGTCATC CCCAACTACA GCTTCTACTA CCTGGCCGAC
GCCCGGCACA ATCCGGGGCG CGGCATGGTC TGGACCGATG TCTACCTCGA TCCGGCCGGC
CACGGCTGGA TGATGTCGGC CATCGCGCCC GTCTATCGGG GCGACTTTCT CGAAGGCGTG
GTCGGCCTGG ATGTCACGGT CAGCAACCTG CTGGAGCAGA TCGGCCGCCT GGAGGTGCCG
TGGAGCGGCT ATGCCGTGCT GGTCAGCGAC GACCTGGACA TCATGGCCCT GCCGGAGCCG
GGCGAAGCCG ATTTCGGCCT CGACGAGCTG ACCTCGCACT CCTACGACGA GGCCATCCAC
AGCGAACTGT TCAAGCCGGA GGATTTCAAC CTGCGCCGGC GCCCCGCGAC CGCCGGACTG
GCCGCCGCCA TCGTCGAGCG CAACGAGGGG ATGATGACGG TGGAATTCCG CGGCCGGCCC
AGCCTGGTGG CCTGGACCAC CATTCCGCAG ACCGGCTGGC ATTTGCTGAC CGTGGTCGAC
GAGGCCAGGG TGTTCGCCGA AACCAACAGC CTCGCCAGCC AGTACGAGCG CATCGGCTAC
CTGATGATCG CCGGCCTGCT GGTGTTCTAC CTGGTCTTCT TCGGCTGCAT GTGGCTGCGT
GCCCGCCACC TCAGCGAGCG CCTGTTGCGC CCCATCGACG GTATTTCGCG GATGATGGAG
GAGATCGGCG GGGGCAACTG GCGTCCGGCC CGGGTCAGTT CGCGGATCGT CGAGCTGGAT
GCCATGGCCG GCCACGCCGA GCGCATGGGC GAACAGCTCG AGCAGAGCGA ACGACAGAGC
CAGCGGACCC AGGAGCGCCT CGAACTGGTG CTCGACAGCG CCACGGAAAG CCTCTGGGAG
CAGGACCTGC ACAGCCGGCG GATCAGCCTG CGCGGGCGCT TCTGCAAGCG CTTCGGGCTG
CCGGCCGGTC CGATCGATCA TGAAGCCTTC ATGGCGCATG TTCACCCCGA GGACGTGCCG
CGGGTCGAGG CGAGCTTCCG GCTCGCCGAC AGCCGCAGCG GGTTGTGCGA GGCGGATTTC
CGCTTTCGCG ACAGCCACGG TTCCTACCAC TGGCTGCTCG GCCGCGGCCG GGTGGTCGAG
CGCGACCCGG CGACGGGGCA GGCCGTCCTG CTGGCCGGCA CCCATGTGGA CATCGACACG
CTCAAGCGCA CCGAGGCCGA CCTGCGCCTG GCCATCGGCG AGGCCCAGGC CGCCAGCCAG
GCGAAATCGC GCTTCATCTC CAGCATCAGC CACGAGCTGC GCACGCCGCT CAACGCCATC
TATGGCTTCG CCCAGTTGTT GCGCCTGGGC TATGAGGCAC GGGGCGAGAC GCAGGAAGTG
GCCTACCTCG ACGAACTTCT GAGGGCCAGT CGGCACCTCA ACCAACTGGT CGACGACGTG
CTCGACTGGT CCAATCTCCA GACGGCGAAG TTCAGCCTGG AGATGCAGTC CGTCGAGGTG
GCGGGACTGA TGGCCGAGTG CGCCGAGATG GTCCGCCTGG ACGTGGAGAC CCGGGGCCTG
CGCCTCGACC TGGAGCCGCC CGACAAGGGG CTGCTGGTGC GGGCCGACCC GCGCCGTCTG
CGCCAGGTGC TGCTCAACCT GCTGTCCAAT GCGATCAAGT ACAACAGCCC GGCCGGGCGG
ATCGCCCTGA CCCATCAGGC GGAGTCGGGG CGCATCCGGT TGATCGTCGA GGACACCGGT
CCCGGCATCG ACGAGTCGCG CCAGGTCCAG TTGTTCGAGC CCTTCCAGCG CCTCGGGCGG
GAAAACTCGA CGATCCAGGG CACCGGCATC GGCCTGTCGC TGTGCCGCCA GTTCGCCGTC
CTGATGGGCG GCCGCATGGG CATGAGCAGC GAGCCGGGCA CCGGCAGCCG CTTCTGGATC
GATCTGCCGC TCGTGCAGGA CGAGACGGGG GCGGGGAGCG TGGCGCGCAT CTGCCATGTC
GGGGACGACT CCTTCAGCCG GTGCCAGGTG CGCAAGGCGC TGGTCGACCT GGGCGAGGTC
AGCGCCATCG ACAACGGCCG GGCCGTCCTG GAGCGCGTCC TGGCCAGCCC GCCGGACCTG
CTGCTGCTCG ACCTGGACCT GCCGGACATC GGCGGGGAGC GCGTGCTGGA GAGGCTTCGC
CAGCATCCGC AAACCCGCTC GTTGCCGGTG ATCGTGCTCG GCGCGGCCGC CGATGCCGGG
CGTCTGGTCG GGCTCGACTG CCAGGGCAGG CTGGGCAAGC CACTGGACCC GGACGAACTG
CGCGAGCTGG TCGCCGCCCT GCTTCCCCAG GCGAAGCCCG AACATGCCCC CTGA
 
Protein sequence
MPSPSHIGLR QWIWRAFARS ALIPLLLVET VLIAVYLISN GSIRDAQIDH LRKVALFDLQ 
SSARQESRSI DQQLAQVGAL TEMFRNLTAR TLLETQAKPL VPPALASTPD GVRYSPEDKG
GSAVFYARST PLERQDLDKV ARMATLDPLM KELQQRNPLV ASLYFNGWDN YNHIYPWFLT
HEQYPHDMVI PNYSFYYLAD ARHNPGRGMV WTDVYLDPAG HGWMMSAIAP VYRGDFLEGV
VGLDVTVSNL LEQIGRLEVP WSGYAVLVSD DLDIMALPEP GEADFGLDEL TSHSYDEAIH
SELFKPEDFN LRRRPATAGL AAAIVERNEG MMTVEFRGRP SLVAWTTIPQ TGWHLLTVVD
EARVFAETNS LASQYERIGY LMIAGLLVFY LVFFGCMWLR ARHLSERLLR PIDGISRMME
EIGGGNWRPA RVSSRIVELD AMAGHAERMG EQLEQSERQS QRTQERLELV LDSATESLWE
QDLHSRRISL RGRFCKRFGL PAGPIDHEAF MAHVHPEDVP RVEASFRLAD SRSGLCEADF
RFRDSHGSYH WLLGRGRVVE RDPATGQAVL LAGTHVDIDT LKRTEADLRL AIGEAQAASQ
AKSRFISSIS HELRTPLNAI YGFAQLLRLG YEARGETQEV AYLDELLRAS RHLNQLVDDV
LDWSNLQTAK FSLEMQSVEV AGLMAECAEM VRLDVETRGL RLDLEPPDKG LLVRADPRRL
RQVLLNLLSN AIKYNSPAGR IALTHQAESG RIRLIVEDTG PGIDESRQVQ LFEPFQRLGR
ENSTIQGTGI GLSLCRQFAV LMGGRMGMSS EPGTGSRFWI DLPLVQDETG AGSVARICHV
GDDSFSRCQV RKALVDLGEV SAIDNGRAVL ERVLASPPDL LLLDLDLPDI GGERVLERLR
QHPQTRSLPV IVLGAAADAG RLVGLDCQGR LGKPLDPDEL RELVAALLPQ AKPEHAP