Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50630 |
Symbol | |
ID | 7763912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5129310 |
End bp | 5132402 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643807892 |
Product | Periplasmic hybrid histidine protein kinase, two-component |
Protein accession | YP_002802126 |
Protein GI | 226947053 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.366718 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCTGC CTGGCAAAAC GACTGGAACG GATGCCAAGA TTCCCGGAGC GAACGTGAAA CTCGGAAAGC CCAGCACCCT CGCCTCCCTG CTCGCCGTAC TTCTGCTGCT GTTCTCCTCG CTGAACGCCG TGCGTACCGT CCTGCTGGCC CAGGAGCGCC GCGAAGTCCT GCAGGCGCTG GCCCACGGCG TCGGAAGCCC GGCCACGACC GGGGACGGGC GCGATCCGGA GGCCTTCCAG GCCGGGGAGG ACATAGCCCG CGGAAAGGCC CTGGAGCGCC TCGCGGCCCG TCTGGAACGG CTCGACGGCC AGGTCGACTC CGCCATGCGC ATCGTCGCGG CCACCCTGCT GCCCGGCCTG CTGCTGGCGG CCTTCGCCCT GCACGGCCGC CACCGGCGGC GTTCGCCGCG CGACCAGCCG CGCACGCCCC CGGAGCAGGC CGGCAGCGAC GAGTCGGAAC TCCAGCGGGT GCTGTTCGAC AGCATCCCCT TCCCGCTCTT CGTCAAGGAC CGGGAGGCCC GTTACATCCG CTTCAACCGG GCCTTCCTGA ACAACTTCCG GGTCCGCGCC GAGGAACTGA CCGGCAAGAC CATCCTGGAT TTCCTGGAAC TGCCCGCCGA AGAACGCTCC CGCTACCAGG CCGCCAGCGA GCGGCTGCTG CGCGAAGGCG GCCTGTTCAG CACCGAACGG CGAATCCGCT ATGCCGACGG GCGCGAGCAC CCATCCATCT ACACCCTCGC CGGCTACCCC GGCGGCGACG GCCGTCCGGC CGGGCTGCTC GGCATGCTGA TCGATATCTC GGCACAGAAG GCGTCCGAAC GGGCCCTGGC CGAAGCCAAG GAGCTGGCCG AGGAGGCGAC GCGCATGAAG TCCGACTTCC TCGCCAACAT GAGCCACGAG ATTCGCACGC CGATGAACGT GATCATGGGC ATGACCCAGT TGGCGCTGGA CAGCGGGCTG GACGAGCGCC AGCGCAACTT CGTCGAAAAG GCCCACGACG CGGCCCGGAG CCTGCTCGGC ATCATCAACG ACATCCTCGA CTTCTCGAAG ATCGAGGCCG GCAAGATGCG TTTCGAGCAG GTGGACTTCC AGCTCGAGAG CGTGACTGAT CACCTGGCCG ACCTGTCCGT CCTCAAGGCC CAGGAAAAGG GCCTGGAGCT GCTCTTCGAC ATCGCCACCG ACGTGCCCAC CGCGCTGATC GGCGACCCGC TGCGCCTCGG CCAGGTGCTG AGCAACCTGC TCGATAACGC GATCAAGTTC ACCAGCCGGG GCGAGATCAC TCTGCGCATC CGCAAGGAAT GCGAGGACGA ACGGGGCGTC CGGCTGCGCT TCGAGGTGCG CGACACCGGC ATCGGCCTCA ACGAGACGCA GCGCCGGAAG CTGTTCCGGG CCTTCACCCA GGCCGACTCC TCGACCTCCC GCCAGTACGG CGGAACCGGT CTCGGCCTGG CCATCTGCAA GCACCTGGTG GCCATGATGG ACGGCGAGAT CGGCGTCGAC TCCAGCCCCG GGGTCGGCAG CACCTTCCAT TTCGATGCCC GCTTCGAGCG GCAGGCGAAC CAGCGCGAAC TGCTGGCGGA CAGCGCCGAT CTCCTCGGCA TGCGCGTACT GGTGGTGGAC GACAACGCCA GCGCCCTGGA AATCTTCGCC GGCATGCTGA GCGCGCTGCG CTTCGCGGTG ACCACGGCGG ACAACGCGCC GCAGGCGATC CGGCTGCTGG AGCGGGCCCA GCAGGAGGAC AACCCTTACC GCCTGGTAAT CATGGACTGG CTGATGCCGG GCATGGACGG CGTCCAGGCG GTCGGCGCGA TCCGCGCCGC CCCGCGGATC GCCGAGACTC CGCTGTTCGT GATGGTCACC TCGTACGGCC GCGACGAGTT GCTCGAACGC CTCGGCAAAG TCCCCGTGGA GGGCCTGCTG GTCAAGCCGG TCACTCCTTC CAGCCTGCTC GACGGCATCC TCGACGCCTG CAGTCGCAGG ATGAGGACCG CGCCGAACAG CCGAAAAGCG CTGGAAAGCG AGGCCGAACA GGCCCGGGAG GCCCTGCGCG GTGTCCATCT GCTGCTGGTC GAGGACAACC CGGTCAACCA GGAAATGGCC ATGGAACTCC TCGCCACCGC GGGGATTCGC GTGGATGTCG CCAACGACGG CGCCGAGGCC GTCGGCAAGG TCATGGCGCA TGCCTACGAC GGCGTGCTGA TGGACTGCCA GATGCCGGTG ATGGATGGCT TCGAGGCCAC CCGGCGGATT CGCCGGGCAG GCCTCGTCGA CCTGCCGATC CTGGCCATGA CCGCCAGCAC CATGGCTGGT GACCGGGAAC GCTGCCTGGC CGCCGGGATG CACGCGTACA TCGCCAAGCC CATCGACGTC GCCCAGTTGT TCGTCACCCT GCGCCACTGG GTCGGGAGCG GGCGGCCGCT GGCCCTACCG GCTCCTCTCG CCATCGGCGG GAGCGTGCCC ACAGCGTCGG CGAACACCGC CCTGGAGCGG GCCGGAGCCT TGCGGCGCCT GGGCGGCAAC CGCGCCCTGC TCGACCGGCT GCTGGCGCGT TTCGCGGCGA CCCAGGCCGA CGCCGCCGCA CGGCTCGGCG CCGCCCTCGC GGCCGGCGAC CGGGAAGGCG CCCGGCGTAT CGCCCATACC CTCAAGGGAC TGGCCGGCAA TATCGGCGCC ACGGAGCTGG CGACCCAGGC CGCCACCCTC GAGCAATGCC TCGGGCAGAA CCGGCAGCCT CCCGCCGAAA CCCTGGACGA TCTGGAAAGG ACGCTCGGCG CCCTGTGCGC ATCCATCGCC GCCCCGGCCG CCGGCAATCC GGCACCCATC GTGGCGGAAA CGCCGCAGGA GCTGTCCGCA CTCGACGAAG GACTCCGAAC CCTGGAGGCC TTGTTGCGCG ACGACGATGC CGACGCCGTG GCGCAACTGC GCGCCCTGGG CCCCCGGCTG GCCGCCCGCG GCCTCGGCGA ACGGGCCGGC GAGCTACAGG CCCTGGTCGC CCGCTATGAC TTCGAGGCCG CCCTGGCCAG CCTCGCGGCC ATCCGCCGGA ACCTCGCTCC GCGGGACGCC GAGGCAGCCG GCCCGCAGAG ACCGCGCCGA TGA
|
Protein sequence | MDLPGKTTGT DAKIPGANVK LGKPSTLASL LAVLLLLFSS LNAVRTVLLA QERREVLQAL AHGVGSPATT GDGRDPEAFQ AGEDIARGKA LERLAARLER LDGQVDSAMR IVAATLLPGL LLAAFALHGR HRRRSPRDQP RTPPEQAGSD ESELQRVLFD SIPFPLFVKD REARYIRFNR AFLNNFRVRA EELTGKTILD FLELPAEERS RYQAASERLL REGGLFSTER RIRYADGREH PSIYTLAGYP GGDGRPAGLL GMLIDISAQK ASERALAEAK ELAEEATRMK SDFLANMSHE IRTPMNVIMG MTQLALDSGL DERQRNFVEK AHDAARSLLG IINDILDFSK IEAGKMRFEQ VDFQLESVTD HLADLSVLKA QEKGLELLFD IATDVPTALI GDPLRLGQVL SNLLDNAIKF TSRGEITLRI RKECEDERGV RLRFEVRDTG IGLNETQRRK LFRAFTQADS STSRQYGGTG LGLAICKHLV AMMDGEIGVD SSPGVGSTFH FDARFERQAN QRELLADSAD LLGMRVLVVD DNASALEIFA GMLSALRFAV TTADNAPQAI RLLERAQQED NPYRLVIMDW LMPGMDGVQA VGAIRAAPRI AETPLFVMVT SYGRDELLER LGKVPVEGLL VKPVTPSSLL DGILDACSRR MRTAPNSRKA LESEAEQARE ALRGVHLLLV EDNPVNQEMA MELLATAGIR VDVANDGAEA VGKVMAHAYD GVLMDCQMPV MDGFEATRRI RRAGLVDLPI LAMTASTMAG DRERCLAAGM HAYIAKPIDV AQLFVTLRHW VGSGRPLALP APLAIGGSVP TASANTALER AGALRRLGGN RALLDRLLAR FAATQADAAA RLGAALAAGD REGARRIAHT LKGLAGNIGA TELATQAATL EQCLGQNRQP PAETLDDLER TLGALCASIA APAAGNPAPI VAETPQELSA LDEGLRTLEA LLRDDDADAV AQLRALGPRL AARGLGERAG ELQALVARYD FEAALASLAA IRRNLAPRDA EAAGPQRPRR
|
| |