Gene Avin_50630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50630 
Symbol 
ID7763912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5129310 
End bp5132402 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content70% 
IMG OID643807892 
ProductPeriplasmic hybrid histidine protein kinase, two-component 
Protein accessionYP_002802126 
Protein GI226947053 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.366718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTGC CTGGCAAAAC GACTGGAACG GATGCCAAGA TTCCCGGAGC GAACGTGAAA 
CTCGGAAAGC CCAGCACCCT CGCCTCCCTG CTCGCCGTAC TTCTGCTGCT GTTCTCCTCG
CTGAACGCCG TGCGTACCGT CCTGCTGGCC CAGGAGCGCC GCGAAGTCCT GCAGGCGCTG
GCCCACGGCG TCGGAAGCCC GGCCACGACC GGGGACGGGC GCGATCCGGA GGCCTTCCAG
GCCGGGGAGG ACATAGCCCG CGGAAAGGCC CTGGAGCGCC TCGCGGCCCG TCTGGAACGG
CTCGACGGCC AGGTCGACTC CGCCATGCGC ATCGTCGCGG CCACCCTGCT GCCCGGCCTG
CTGCTGGCGG CCTTCGCCCT GCACGGCCGC CACCGGCGGC GTTCGCCGCG CGACCAGCCG
CGCACGCCCC CGGAGCAGGC CGGCAGCGAC GAGTCGGAAC TCCAGCGGGT GCTGTTCGAC
AGCATCCCCT TCCCGCTCTT CGTCAAGGAC CGGGAGGCCC GTTACATCCG CTTCAACCGG
GCCTTCCTGA ACAACTTCCG GGTCCGCGCC GAGGAACTGA CCGGCAAGAC CATCCTGGAT
TTCCTGGAAC TGCCCGCCGA AGAACGCTCC CGCTACCAGG CCGCCAGCGA GCGGCTGCTG
CGCGAAGGCG GCCTGTTCAG CACCGAACGG CGAATCCGCT ATGCCGACGG GCGCGAGCAC
CCATCCATCT ACACCCTCGC CGGCTACCCC GGCGGCGACG GCCGTCCGGC CGGGCTGCTC
GGCATGCTGA TCGATATCTC GGCACAGAAG GCGTCCGAAC GGGCCCTGGC CGAAGCCAAG
GAGCTGGCCG AGGAGGCGAC GCGCATGAAG TCCGACTTCC TCGCCAACAT GAGCCACGAG
ATTCGCACGC CGATGAACGT GATCATGGGC ATGACCCAGT TGGCGCTGGA CAGCGGGCTG
GACGAGCGCC AGCGCAACTT CGTCGAAAAG GCCCACGACG CGGCCCGGAG CCTGCTCGGC
ATCATCAACG ACATCCTCGA CTTCTCGAAG ATCGAGGCCG GCAAGATGCG TTTCGAGCAG
GTGGACTTCC AGCTCGAGAG CGTGACTGAT CACCTGGCCG ACCTGTCCGT CCTCAAGGCC
CAGGAAAAGG GCCTGGAGCT GCTCTTCGAC ATCGCCACCG ACGTGCCCAC CGCGCTGATC
GGCGACCCGC TGCGCCTCGG CCAGGTGCTG AGCAACCTGC TCGATAACGC GATCAAGTTC
ACCAGCCGGG GCGAGATCAC TCTGCGCATC CGCAAGGAAT GCGAGGACGA ACGGGGCGTC
CGGCTGCGCT TCGAGGTGCG CGACACCGGC ATCGGCCTCA ACGAGACGCA GCGCCGGAAG
CTGTTCCGGG CCTTCACCCA GGCCGACTCC TCGACCTCCC GCCAGTACGG CGGAACCGGT
CTCGGCCTGG CCATCTGCAA GCACCTGGTG GCCATGATGG ACGGCGAGAT CGGCGTCGAC
TCCAGCCCCG GGGTCGGCAG CACCTTCCAT TTCGATGCCC GCTTCGAGCG GCAGGCGAAC
CAGCGCGAAC TGCTGGCGGA CAGCGCCGAT CTCCTCGGCA TGCGCGTACT GGTGGTGGAC
GACAACGCCA GCGCCCTGGA AATCTTCGCC GGCATGCTGA GCGCGCTGCG CTTCGCGGTG
ACCACGGCGG ACAACGCGCC GCAGGCGATC CGGCTGCTGG AGCGGGCCCA GCAGGAGGAC
AACCCTTACC GCCTGGTAAT CATGGACTGG CTGATGCCGG GCATGGACGG CGTCCAGGCG
GTCGGCGCGA TCCGCGCCGC CCCGCGGATC GCCGAGACTC CGCTGTTCGT GATGGTCACC
TCGTACGGCC GCGACGAGTT GCTCGAACGC CTCGGCAAAG TCCCCGTGGA GGGCCTGCTG
GTCAAGCCGG TCACTCCTTC CAGCCTGCTC GACGGCATCC TCGACGCCTG CAGTCGCAGG
ATGAGGACCG CGCCGAACAG CCGAAAAGCG CTGGAAAGCG AGGCCGAACA GGCCCGGGAG
GCCCTGCGCG GTGTCCATCT GCTGCTGGTC GAGGACAACC CGGTCAACCA GGAAATGGCC
ATGGAACTCC TCGCCACCGC GGGGATTCGC GTGGATGTCG CCAACGACGG CGCCGAGGCC
GTCGGCAAGG TCATGGCGCA TGCCTACGAC GGCGTGCTGA TGGACTGCCA GATGCCGGTG
ATGGATGGCT TCGAGGCCAC CCGGCGGATT CGCCGGGCAG GCCTCGTCGA CCTGCCGATC
CTGGCCATGA CCGCCAGCAC CATGGCTGGT GACCGGGAAC GCTGCCTGGC CGCCGGGATG
CACGCGTACA TCGCCAAGCC CATCGACGTC GCCCAGTTGT TCGTCACCCT GCGCCACTGG
GTCGGGAGCG GGCGGCCGCT GGCCCTACCG GCTCCTCTCG CCATCGGCGG GAGCGTGCCC
ACAGCGTCGG CGAACACCGC CCTGGAGCGG GCCGGAGCCT TGCGGCGCCT GGGCGGCAAC
CGCGCCCTGC TCGACCGGCT GCTGGCGCGT TTCGCGGCGA CCCAGGCCGA CGCCGCCGCA
CGGCTCGGCG CCGCCCTCGC GGCCGGCGAC CGGGAAGGCG CCCGGCGTAT CGCCCATACC
CTCAAGGGAC TGGCCGGCAA TATCGGCGCC ACGGAGCTGG CGACCCAGGC CGCCACCCTC
GAGCAATGCC TCGGGCAGAA CCGGCAGCCT CCCGCCGAAA CCCTGGACGA TCTGGAAAGG
ACGCTCGGCG CCCTGTGCGC ATCCATCGCC GCCCCGGCCG CCGGCAATCC GGCACCCATC
GTGGCGGAAA CGCCGCAGGA GCTGTCCGCA CTCGACGAAG GACTCCGAAC CCTGGAGGCC
TTGTTGCGCG ACGACGATGC CGACGCCGTG GCGCAACTGC GCGCCCTGGG CCCCCGGCTG
GCCGCCCGCG GCCTCGGCGA ACGGGCCGGC GAGCTACAGG CCCTGGTCGC CCGCTATGAC
TTCGAGGCCG CCCTGGCCAG CCTCGCGGCC ATCCGCCGGA ACCTCGCTCC GCGGGACGCC
GAGGCAGCCG GCCCGCAGAG ACCGCGCCGA TGA
 
Protein sequence
MDLPGKTTGT DAKIPGANVK LGKPSTLASL LAVLLLLFSS LNAVRTVLLA QERREVLQAL 
AHGVGSPATT GDGRDPEAFQ AGEDIARGKA LERLAARLER LDGQVDSAMR IVAATLLPGL
LLAAFALHGR HRRRSPRDQP RTPPEQAGSD ESELQRVLFD SIPFPLFVKD REARYIRFNR
AFLNNFRVRA EELTGKTILD FLELPAEERS RYQAASERLL REGGLFSTER RIRYADGREH
PSIYTLAGYP GGDGRPAGLL GMLIDISAQK ASERALAEAK ELAEEATRMK SDFLANMSHE
IRTPMNVIMG MTQLALDSGL DERQRNFVEK AHDAARSLLG IINDILDFSK IEAGKMRFEQ
VDFQLESVTD HLADLSVLKA QEKGLELLFD IATDVPTALI GDPLRLGQVL SNLLDNAIKF
TSRGEITLRI RKECEDERGV RLRFEVRDTG IGLNETQRRK LFRAFTQADS STSRQYGGTG
LGLAICKHLV AMMDGEIGVD SSPGVGSTFH FDARFERQAN QRELLADSAD LLGMRVLVVD
DNASALEIFA GMLSALRFAV TTADNAPQAI RLLERAQQED NPYRLVIMDW LMPGMDGVQA
VGAIRAAPRI AETPLFVMVT SYGRDELLER LGKVPVEGLL VKPVTPSSLL DGILDACSRR
MRTAPNSRKA LESEAEQARE ALRGVHLLLV EDNPVNQEMA MELLATAGIR VDVANDGAEA
VGKVMAHAYD GVLMDCQMPV MDGFEATRRI RRAGLVDLPI LAMTASTMAG DRERCLAAGM
HAYIAKPIDV AQLFVTLRHW VGSGRPLALP APLAIGGSVP TASANTALER AGALRRLGGN
RALLDRLLAR FAATQADAAA RLGAALAAGD REGARRIAHT LKGLAGNIGA TELATQAATL
EQCLGQNRQP PAETLDDLER TLGALCASIA APAAGNPAPI VAETPQELSA LDEGLRTLEA
LLRDDDADAV AQLRALGPRL AARGLGERAG ELQALVARYD FEAALASLAA IRRNLAPRDA
EAAGPQRPRR