Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4885 |
Symbol | |
ID | 3679209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6149364 |
End bp | 6152279 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637720243 |
Product | PAS/PAC sensor Signal transduction histidine kinase |
Protein accession | YP_325377 |
Protein GI | 75911081 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00047345 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00891826 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGACA ATCTCGTGAC TATAGATAAA AATACTTATG ATTCTCTTCA ACAGGAACTC ATAGGACTAC GCCAGGTAGT AGCAAAGACT TTACAACTAG GGTTATTGAT TGAGTATATA CCTGCGGCGA TCGCTATTTT CGACAGTGAG ATGCGCTACT TGCTGGCTAG TCGGCGCTGG CGAGAAGACT ATGACTTAGT TGATGAGAAT ATCGTCGGTC GTTCTCATCA TGATGTTTGT CCAGAAATCA CTTTGGGTTG GCAGAACAGA TACGAGTGTT GTTTGGCGGG AAATACGGAG AAATCTGAAG AGGAAACTCT GATCCGTCGG GATGGGACTA TTGATTGGGT GAAATGGGAA ATAAATCCTT GGTATGAACC ATCAGGTGAG GTGGGTGGTT TAGCTGTTGT CTCTGAGGTA ATTACCCCAC GCAAACAATC AGACATTGCT TTAGCCGATA GTGAAAGACG TTTAAGAGAT ACGGAAGCAC GGTTGCAGCG TCTAGCTGAT AATCTGCCAG GGATGCTTTA TGAATTTTGC CTGAAACCTG ATGGCACCAT GACTTTCCCT TATGTTTCTT CTGGGTGTCG AGAACTGATA GAACGCAGTC CCCAAGAGTT ACAGGATGAT GCCTCGCTCA TTTTTGCCAA TACTCATCCT GAGGATATGA TAGGGCTGCA AGAGGCGATC GCTAACTCTG CCAAAACTCT GCAAAACTTT GAATATGAGT GGCGTATTAT CACTTCTTCT GGTCAGATAA AATGGGTTCA AGGAGTCTCC CGACCGGAAC GTCAGCCAGG AGGCGAAATC ACTTGGTATG GTTATCTATT AGATATAAGC GAACAGCAAG CCGTACTTCG TGAACGCAAG CAAGCAGACC AACAATTGCA ACAGCAAGCA CAGTTGTTAC AAAGTATATG GGAAGGTGTA GATTACGGTA TCTGTGTCTT GGATGTTTTG GATGATGAGG CAGAGTTTCG TTACGTTAAA ATAAATCCTG CCATGCACCG CATTAGTCTC TTGCCAGTTG CATCTTTTAT TGGTAAAACA ACGGCAGAGT CACTGCCACC TGAAATTGCC GATTTATATC GCCAACGCTA TCAACAATGT ATCAAGTCTC GAAAGAGCCT AGTCTTCGAG GATAATTTCT TAGTTAACGA TAAAGAAACT TGGTGGTTCG TCAATATTAC ACCCCTTGGT GATAGCAATT CACAAATTTC GCAACTCGTA GTGACAGCGA CGGAAATTAC AGAACGCAAA CAAGCTGAAC AAGAACGGCA ATTGTTTGTC TCTCTGATTG AAAATAGCAG TGACTTTATT GGTTTTGCGA CTTTAGCAGG AAAACCATTA TTTCTGAATG AAGCTGGACT TAAGTTAGTT GGTCTTGATG GTTTAGACGC TCTGAAGAAT CTCCATATTA TGGATTTTTT CTTCCCAGAA GATCAAGAAT ATATGGATAA ACACATTATG CGGGTGGTAA ATGAGCGTGG TTTATGGCAG GGTGAATATC GTTTCCGCAA CTATCAGACT CATGAAGAAA TACCAGTTGA TTTTAATATA TTTACTGTTA AAAGCTCGGA GACTGGTAAG CCTTTGTGTT TAGCAACAGT TACTCGTGAT ATTCAGGAAC GTAGAAAAGT AGAAGCATTA CTGCAAGAAC AAGAGCAATT TTTACGTAAT ATTTATGAGG GTGTTGATCA AATTATCTTT GTGGTTGATG TTTTAGAAAA CTTAGATTTT TGTTATACCG GTTGGAACTC AACAGCAGAA AGATATACAG GAATTACTCG AAATGATGCT ATTGGTAAAG CACCTGAGGA TATTTTTGGC AGTGTTGAAG GTTCTTTAGT CCGTCAACGA TACAAAAACT GTGTGGAGGC TGGTGTTAGC ATTTCTTTTG AAGATTGTTT AACTTTCCAT AATCAAGAAA CTTGGTGGTT AACTAAAATT AATCCCCTGA AAAATAGTGC TGGTAGAGTT TATTGTCTAG TTGGGACAAC TTTAGATATT ACACAGCGCA AACAAAATGA AATTCAATTG CGACAGCAAG CCGAAAATTT AGAAAACACT CTGCGTGAAT TACAACTTAC CCAAACTCAA CTTATCCATA GTGAGAAAAT GTCTTCCATT GGGAATATGG TTGCAGGTGT AGCCCATGAA ATCAATAATC CAGTTAACTT TATTCACGGT AATTTGATTC CAGCCAGTGA ATATGCTCAA GACTTGTTAC AGCTAGTAGA ACTTTATAGA CTCCACTTTC CCTATCCGCC AGAAGAAATT CAGGAATTCA TTATAGATAT TGAGTTTGAT TTCCTCAAGG AAGATTTGGT TAAGCTGCTG CAATCTATGC GTATAGGAAC CCAACGCATT CGAGAAATTG TCTTATCACT CCGCAATTTT TCCCGCCTTG ATGAAGCTGA ATTTAAGCAA GTAGATATTC ATGAAGGTAT CGATAGTACG CTCATGATTT TACAAAATCG CCTAAAAGCT AAGTCAGACC ATCCAGAAAT TTTAGTCGTT AAAAGTTATG GTGATTTACC TTTGATTGAT TGTTATCCTG GTCAGTTAAA TCAGGTATTT ATGAACCTTA TCAGTAATGC TATTGATGCT TTAGAGGAGC CAGTAGTTGA TGGTCAATTG TCAGTTGCTA AGCCTACAAT TTATATTCGT AGCGAAATGT TTAATAATAA CTGGGTGCGA GTTACCATAT CAGATAACGG TGTAGGGATT CCTCAAGAAA TCTTATCAAA ACTATTTGAC CCATTTTTCA CGACTAAGTC TGTGGGTAAG GGAACTGGCT TAGGTTTATC TATTAGTTAT CAAATTGTTG TGGATAGACA CAATGGGAAA TTAACTTGCA ACTCGACACC TGGACAAGGA GCAGAATTTA TCATTGAGAT TCCCATTCAT CAGTGA
|
Protein sequence | MTDNLVTIDK NTYDSLQQEL IGLRQVVAKT LQLGLLIEYI PAAIAIFDSE MRYLLASRRW REDYDLVDEN IVGRSHHDVC PEITLGWQNR YECCLAGNTE KSEEETLIRR DGTIDWVKWE INPWYEPSGE VGGLAVVSEV ITPRKQSDIA LADSERRLRD TEARLQRLAD NLPGMLYEFC LKPDGTMTFP YVSSGCRELI ERSPQELQDD ASLIFANTHP EDMIGLQEAI ANSAKTLQNF EYEWRIITSS GQIKWVQGVS RPERQPGGEI TWYGYLLDIS EQQAVLRERK QADQQLQQQA QLLQSIWEGV DYGICVLDVL DDEAEFRYVK INPAMHRISL LPVASFIGKT TAESLPPEIA DLYRQRYQQC IKSRKSLVFE DNFLVNDKET WWFVNITPLG DSNSQISQLV VTATEITERK QAEQERQLFV SLIENSSDFI GFATLAGKPL FLNEAGLKLV GLDGLDALKN LHIMDFFFPE DQEYMDKHIM RVVNERGLWQ GEYRFRNYQT HEEIPVDFNI FTVKSSETGK PLCLATVTRD IQERRKVEAL LQEQEQFLRN IYEGVDQIIF VVDVLENLDF CYTGWNSTAE RYTGITRNDA IGKAPEDIFG SVEGSLVRQR YKNCVEAGVS ISFEDCLTFH NQETWWLTKI NPLKNSAGRV YCLVGTTLDI TQRKQNEIQL RQQAENLENT LRELQLTQTQ LIHSEKMSSI GNMVAGVAHE INNPVNFIHG NLIPASEYAQ DLLQLVELYR LHFPYPPEEI QEFIIDIEFD FLKEDLVKLL QSMRIGTQRI REIVLSLRNF SRLDEAEFKQ VDIHEGIDST LMILQNRLKA KSDHPEILVV KSYGDLPLID CYPGQLNQVF MNLISNAIDA LEEPVVDGQL SVAKPTIYIR SEMFNNNWVR VTISDNGVGI PQEILSKLFD PFFTTKSVGK GTGLGLSISY QIVVDRHNGK LTCNSTPGQG AEFIIEIPIH Q
|
| |