Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0521 |
Symbol | |
ID | 3682351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 654685 |
End bp | 657903 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637715849 |
Product | multi-sensor Signal transduction histidine kinase |
Protein accession | YP_321040 |
Protein GI | 75906744 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2203] FOG: GAF domain [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAG GTGTTTCATT TTTCTCAATA TCTATGATAA GAAACTCTTA TGAGCCGCCC AACCCTTCAG ATTTGCCAGA TTGTCACAAC CAGCTACGCC TGACAGCACA GTATCAGAAA ATATTGGTGA GAATTCTGGC TAAAATTCGC TCATCGGTAA ACATAGAATC TTTGTGCCGG ACTTCTTGTC AAGATATTTG TGGGCAATTG CAGATTGAGC GATTAGCTAT TTACCGCTTC AACAGTGATT GGAGCGGTAG TTTTATCAAT CGTTTTGGCT TTGCAGAACC ACCTTGGGAT AAACTCACTG CTTTTGGACA GGACTTGGTG TGGCAAGATT CCCATTTGCA AGAAACTCAA GGGGGGCGAT ATCGCAAAAA TGAGCCGTTT GCTACGGCTG ATATTTACGA TGCAGGACAC TCTCGTTGTC ATATTGAGGT GTTAGAACAA TTTCAGATTC GAGCATATGC GATCGCTCCG ATTGTCGTTG GTGCAAAACT GTGGGGTCTA CTAGCGGCCT ATCAGCACTC TGCGCCTCGA CAATGGCATC AAAACGAGGT GGAATTTATA GCCCAAGCCG CTAGCTATTT GGGTGTAGCA ATGCAGCAGG AAGAAATCAT CAAGGAAGCC AAACAACGTA CAGTAGAATT GCAAGATGCT ATTGCCAGGC AACGAGCTTT GATGGAAGTT GTAGGCAATA TTCGCTCATC TATCAATACG GAAATTATTT TAAACACAGC TTGCCAAGAA TTGTGCAAAC TTCTGAAGCT AGAACGGGCA GCAGTTTATC GCTTTAATGA AGATTGGAGC GGTGAATTTG TCAGTCAATT TGGGATGGTA GAAACACAGT GGCACAGAAT CAGCCCCTTT GGCAAAAATC TGGTTTGGGA TGATACCTAC CTCCAAGAAA CCAAAGGTGG GCGTTACCGC CATAACGAAA CTTTTGCGGT TAATGATATC TATGAGGCTG GACATACACG CTGTCATATT GACATTCTTG AGCAATTCAA GATTTATGCC TATGCTTTAG CACCAATTTT CATCGGTAAA AAACTGTGGG GACTCATAGC AGCTTATCAA CACACTGGCC CACGAGAATG GGCAAATTAT GAAGTGGAGT TTCTCGGACA GGTGGGCGCG CAATTAGGGG TGGCAATTCA GCAAGCGGAG AATTTAGCCC AATCGAAACA ACAGGCTGAT GCTCTGCAAA ATGCGATCGC TCGGCAACGC GCCCTCACAG AAGTAGTGGG AAAAATCCGC TCTTCCCTAG ATATTAATTT AATCTTGAAA ACCACCTGCC AAGAAGTATG TAAAATGCTA CGAATTGAGC GGGTAGGAGT TTATCGCTTT AATCCTGACT GGAGTGGTGA ATTCGTCAGT AATTTCGGTA TGGTGGAGGC GCAATGGGAC AGCATTAACC CCTTTGGCCA AAATCTGGTT TGGGAGGACA CCCACCTCCA AGAAACCAAA GGGGGACGCT ATCGCAACAA CGAAAATTTT GCAGTCAACG ACATTTACCA AGTCGGACAC TCACGCTGTC ACTTAGATAT TCTGGAGCAG TTTAAGATTC GAGCTTATGC CTTAACCCCA ATTTTTGTCG GACGGAATCT GTGGGGACTG CTGGCCGCCT ATCAACATTC TGCACCGCGC CAGTGGGATA TTGTAGAAGT AGAATTTTTG GGACAAGTAG CTAGTCAACT GGGAGTGGCG TTGCAAAGTT CTCAAATGAT GAGCCAAATT CAAACCCGTG CTGATGAACT GCAAAAATCG GCTGAACAGC GCCGAATTTT ATTTGATTTG GTTGTCAAAA TCCGAGAATC TTTGGATTTA GAGGCAATTT TGAAAAATAC TGTGCAGGAA GTGAGGCGAT CGCTGCAAGC AGACCGAGTC GGCATCTTTC GCTTTGATTC TGATCAGGGT TTTTGTAGTG GTGAATTTAT TGCTGAAGAT GTGTTACCCA AGTTTGATTC TGCTCTAGCT GTGAAGGTGC AAGACTATTG CTTCGGTGAC CATTATGCAC CTCAATATCG CCAAGGGCAA GTACAAGTAA TTTCCGATGT CAACAGTGTT GGCTCTAAGG TACCCCATCT GGATGTGATT GAGCAATTCC AAGTCAAAGC TCAGATTATT GTGCCATTGA TGGAAGGTGA TAATCTGTGG GGATTATTAT GTATTCACCA ATGCACTCAT CCACGTCATT GGGAAGAAGA CGAATTGGAA TTTGTCACTC AAATAGCAGC TCAACTCAGT GTGGCACTAC ATCAAGCTAA CTTGTTCCAA CAATCGAGCT TGCTTGGTGA GACTCGTGCA GAAGCAAATC AACTTGCCCA AACACTGAAG GAACTCCGCA CTGCCCAAAT GCAAATTATC CACGCTGAAA AAATGGCCAG TTTGGGGCAG TTGGTAGCAG GAGTTGCCCA CGAGATTAAT AATCCAATTA ATTTTATTCA CGGCAATTTA GAACACGCCC ATCAATATAC CCAAGAGTTG CTGCGTTATG TGAAACTTTA TCAGCACTAT CACCCCAATG CAGCTCCGGA AATACAAGAG TTTTTCCAGC AAGCCGAAAT CGAGTTTTTA TTTGAGGACT TACCTAACTT ATTCCAGTCA ATGCAGGTAG GTACTCAGCG CATTCAGGAA ATTGTCACCT CTTTACGTAG CTTCTCACGC CTAGACGAAG CTGACTTCAA AACAGCAAAT ATTCATGAAT GTATCGACAG CACTTTGATG ATTTTACAAC ATCGCTTAAA GCCCTCCGCT GATAGCCACG CCATTCATGT GACCAAGGAC TATGATGATT TACCTCTCAT AGAGTGCTAT CCCGGTCAAT TAAACCAGGT ATTTATGAAT TTGCTGTCTA ATGCCATTGA TGCCTTAGAA GAACGGGAAG CCAAGCTATC TCCTGAAGTA ATTGCCGCAC ATCCCAGCGA AATCCGCATT TATACATCTC TACTCAATCA AGACTGGATG AGTATTCGGA TTACCGATAA TGGACTCGGC ATTGATGAGC AGATAATCCC CAGATTGTTC GATCCATTTT TTACTACTAA GATGGTCGGT AAAGGTACTG GACTGGGACT TTCGATCAGC TATCAGATTG TCACAGATAA ACACAAAGGC AAGATTTACT GCCAATCAGA ACTTGGTAAA GGTACAGAGT TTGTCGTCGA ATTACCAATT CTTCAGGCAA AAATTAATCC ATCAATAACG AAAGTCTGA
|
Protein sequence | MKLGVSFFSI SMIRNSYEPP NPSDLPDCHN QLRLTAQYQK ILVRILAKIR SSVNIESLCR TSCQDICGQL QIERLAIYRF NSDWSGSFIN RFGFAEPPWD KLTAFGQDLV WQDSHLQETQ GGRYRKNEPF ATADIYDAGH SRCHIEVLEQ FQIRAYAIAP IVVGAKLWGL LAAYQHSAPR QWHQNEVEFI AQAASYLGVA MQQEEIIKEA KQRTVELQDA IARQRALMEV VGNIRSSINT EIILNTACQE LCKLLKLERA AVYRFNEDWS GEFVSQFGMV ETQWHRISPF GKNLVWDDTY LQETKGGRYR HNETFAVNDI YEAGHTRCHI DILEQFKIYA YALAPIFIGK KLWGLIAAYQ HTGPREWANY EVEFLGQVGA QLGVAIQQAE NLAQSKQQAD ALQNAIARQR ALTEVVGKIR SSLDINLILK TTCQEVCKML RIERVGVYRF NPDWSGEFVS NFGMVEAQWD SINPFGQNLV WEDTHLQETK GGRYRNNENF AVNDIYQVGH SRCHLDILEQ FKIRAYALTP IFVGRNLWGL LAAYQHSAPR QWDIVEVEFL GQVASQLGVA LQSSQMMSQI QTRADELQKS AEQRRILFDL VVKIRESLDL EAILKNTVQE VRRSLQADRV GIFRFDSDQG FCSGEFIAED VLPKFDSALA VKVQDYCFGD HYAPQYRQGQ VQVISDVNSV GSKVPHLDVI EQFQVKAQII VPLMEGDNLW GLLCIHQCTH PRHWEEDELE FVTQIAAQLS VALHQANLFQ QSSLLGETRA EANQLAQTLK ELRTAQMQII HAEKMASLGQ LVAGVAHEIN NPINFIHGNL EHAHQYTQEL LRYVKLYQHY HPNAAPEIQE FFQQAEIEFL FEDLPNLFQS MQVGTQRIQE IVTSLRSFSR LDEADFKTAN IHECIDSTLM ILQHRLKPSA DSHAIHVTKD YDDLPLIECY PGQLNQVFMN LLSNAIDALE EREAKLSPEV IAAHPSEIRI YTSLLNQDWM SIRITDNGLG IDEQIIPRLF DPFFTTKMVG KGTGLGLSIS YQIVTDKHKG KIYCQSELGK GTEFVVELPI LQAKINPSIT KV
|
| |