Gene Ava_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3043 
Symbol 
ID3681161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3767386 
End bp3770754 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content42% 
IMG OID637718389 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_323548 
Protein GI75909252 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000641159 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTG ACGAATTAAC TATGCCATTC AACGAGCTAC CGATTGACTT ACATAATTTA 
TATCATTTGA TCGATCGTCA TCCTTTAACG ATCGCTCCTG ATAGTTATGT TATTGATGCT
ATTAGATTGA TGAATCAACA AGGTAATAGT CCACCATTCA TCAGCTTAAA CTCATCGGGA
ACTTATAGCA ATAAGAATTC AAAACAAACT AGCTATGTTT TAGTAGTTGA GGCAGGAAAT
ATCTTAGGAA TCTTTACAGA GCGGGATTTA GTCAGGTTGG CTGCATCAAA ATTTGATTTA
TCAGATTTAA AAATATCTGA GGTGATGACA CAACCAGTGA TCACGATGAA AATGTCAAAC
TCCCCAGATA TTTTTACAGC TTTGTCCTTA TTAAATCACC ATCAGATTCG CCATTTGCCA
GTTTTGAACA GTAGAGAGCA ATTAATAGGG GTTGTGAGTG CAACTAGCTT ATTGCAAGGA
TTGCATCAAT TAGAAAGCTT TTACTGTCTG CAATCCTTAC AACAAGCACA ACAGATAGAA
TTTTTACGCT ACCCGTCTCG TAACCCTCTC AACGAAGTGG AACATCACGA CGAATTTCTG
GGATTGAGGA AGGTTTGCCA ACGAGGCATT GTTGAGGTTC AACGGGCAGA ACAAGCTTTA
CAGCAAAGTG AGGAACTGTA TCGCCAGTTA GTTGAGCTGC AAAATGAGGT AATTTTGCGT
GTTGATAATT TAGGGAGACT GACATTTGTT AATTCAGTAG CTTGCAAAAT TTTTGGCAAG
CCAATGGATG AGTTAATTGG TCAGCTAATA TTTGAGTGGA TGTTCCCTGA AGATGAAATC
CCTCAAGCAA AAGAATATTT CCAAGCCCTC AAATCACCAC CTTATCAAAT CAGTATTAGC
GAGCAGCCTG TATTAACACC TAGTGGTATA CGTTGGTTTC AGTGGAATAT TATCGGCATT
GAAAATACCA TTGGGGAGGT TGTTGAGTTT CAAGGAGTAG GCAGAGATAT TACCGAACGC
AGACAGGCAG AAGAATCGCT ACGACAGAGT GAGGCAAGAT TAAATTTAGC CTTGGAAGCC
GCCAACATGG GTATCTGGGA TTGGTATCTT TTGACCAATG AAACCATCTG GTCTGCCAAT
ATGGGATTAC TGTATGGTCT GCCAAGTACA ACTTTATGTC CCAGTCCTGA AGACTTCCTG
CAATTAGTCC ATCCTGAAGA CCGAGAGAAG TTCTCTCAGT CTGCCAAGAA CAGCATTGAG
CAAGGAATAC CATTTACTAT CGAATATCGG ACTGTCTGGA ATGATGGCAG CATTCATTGG
CTCAATAGTA AGGGTCAAGT CTACTATGAC AAAGCGGGTA AACCAATCAG GATGATTGGC
ACTACTAGAG ATATTAGCGA ACGCAAACAA GCAGAGGCAT CCCTACGGGA AAGTGAGGAA
CTTTATCGCT CAGTGGTGAC AGCTATGAGT GAGGGTATTG TTTTGCTACA TACTGATGGC
CAAATTATTG CTTGTAATGC AAGTGCCGAG AGAATTTTAG GGTTAAGCCG AGAGCAAATA
TTAGAACGTA CCTGTGTTGA TGAGCGTTGG CTGACTATTC ATGAAGATGG CTCTCCGTTT
CCTCGTGAAC AGCATCCAGC AATGGAGACG CTACGCACAG GCAAACCCTG CTCTAATGTG
GTCATGGGAG TTCACAAGCC AGACGGACAA ATAAGCTGGA TTTCGATTAA TTCTCAACCT
TTGTGTCGGG AAAATGAAAC GGTTCCTTAT GCAGTAGTTA CATCCTTTGC CGATATTAGT
GAACAGCAAG CTGCACTACG CTCACGCCAA CAAGCAGAGC AGAAGATTCG TGAGCAAGCA
GCTTTACTAG AAATAGCCAC TGATGCCATT TTTGTTCGAG ACTTACAAAG CAATATCTTA
TTTTGGAATC AAGGTGCAGA ACGCTTGTAT GGTTGGTCAC AACAGGAAGC TATCGGCCGT
AATACCCAAG AATTACTATA TTCTGAAACT TCATTTCACC TGCATGAAGC AGCTCTCAAT
GTTGTTATGG AGTTGGGATT GTGGCAGGGT GAGTTGCAAA AACTGACTAA ATCTGGCAAG
GAAGTTATTG TGGAAAGCCG TTGGACACTA ATGCGTGATG CAGCTGGGCA ACCCAAATCA
ATTTTAGTTG TTGATACCGA TATCACCCAA AAAAAACAAC TAGAAGAACA GTTCTTCCGC
GCTCAAAGAT TAGAGAGTCT GGGTACCCTT GCAGGTGGTA TTGCTCACGA CTTGAATAAT
ATCTTGACCC CCATTTTGGC AGCGTCCCAA CTCCTGAAAG TCAAATTTCC CGACGAGCAA
GGACGAACAC CTATGGTAGG CAGTACATCT TTTCAACACC TGTTAGAGAT TGTGGAAAGC
AACGCCAGAA GGGGAGCAGG TTTAGTCAAG CAGGTGTTAT CCTTTGCACG CGGTTTTAAA
GGAGAACGGA CAATAGTCCA ACTCAAACAC TTAATTACCG ATATTATTCT GATTGGTAAA
CAGACGTTTC CCAAGTCAAT TGAATTTATC AGTAGCTTTC CCGAAGCACT GTGGTCAGTC
TGTGGGGATG TCACTCAATT ACACCAAGTG TTGATGAATC TTGTCGTCAA TGCTCGTGAT
GCCATGCCCA ATGGTGGGAA GATCAATGTT ACAGCAGAAA ACATTTTTAT TGATGAAACC
TATGCCAGCA TGATTTTAGA GGCGCAAGTT GGCAACTACA TTCAGCTGAC AGTCACTGAC
ACTGGTATAG GAATGCCTCC AGAAATCTTA AATAGAATTT TTGAGCCATT TTTTACGACA
AAAGAGGTGG GAGCAGGTAC GGGTTTGGGA CTGTCAACTG TGTTGGGAAT TATCAAAAGT
CATGGAGGTT TTGTCAAGGT CTCCAGCAAA GTTGGTCAAG GTAGCCAATT TAAGCTGTTC
TTGCCAGCAA TTCCAGCAAC GCCAGATTTA ACGATAGAAG AAATAGAAAC ACCATCGGGT
AGCGGTGAAT TAATTTTAGT TGTCGATGAT GAAGCACCAA TTTGCGAAAT TGCCAAGGTG
ATTCTCGAAA AGTTTAACTA TAAAATTCTT ACCGCCTGTA ATGGTATTGA GGCGATCGCA
CTTTATGCCC AACACAAACA TCGTATTAGC GCTGTTCTCA TGGATATGAT GATGCCAGAA
ATGGACGGTA TCACAGCCAT TCGCACTTTG AAAAAAATGA ACTCAAAGGT ACAAGTTATT
GCTAGTAGTG GGATCAATTC TACAGAAACA GTGGCGCAAG CAGCCATGAT AGGTGTGCAG
CAAGTTTTAC CCAAACCTTT TACAGCCAAG GAATTATTAA ATAGCTTACA TCACGTACTT
AGATGTTGA
 
Protein sequence
MKIDELTMPF NELPIDLHNL YHLIDRHPLT IAPDSYVIDA IRLMNQQGNS PPFISLNSSG 
TYSNKNSKQT SYVLVVEAGN ILGIFTERDL VRLAASKFDL SDLKISEVMT QPVITMKMSN
SPDIFTALSL LNHHQIRHLP VLNSREQLIG VVSATSLLQG LHQLESFYCL QSLQQAQQIE
FLRYPSRNPL NEVEHHDEFL GLRKVCQRGI VEVQRAEQAL QQSEELYRQL VELQNEVILR
VDNLGRLTFV NSVACKIFGK PMDELIGQLI FEWMFPEDEI PQAKEYFQAL KSPPYQISIS
EQPVLTPSGI RWFQWNIIGI ENTIGEVVEF QGVGRDITER RQAEESLRQS EARLNLALEA
ANMGIWDWYL LTNETIWSAN MGLLYGLPST TLCPSPEDFL QLVHPEDREK FSQSAKNSIE
QGIPFTIEYR TVWNDGSIHW LNSKGQVYYD KAGKPIRMIG TTRDISERKQ AEASLRESEE
LYRSVVTAMS EGIVLLHTDG QIIACNASAE RILGLSREQI LERTCVDERW LTIHEDGSPF
PREQHPAMET LRTGKPCSNV VMGVHKPDGQ ISWISINSQP LCRENETVPY AVVTSFADIS
EQQAALRSRQ QAEQKIREQA ALLEIATDAI FVRDLQSNIL FWNQGAERLY GWSQQEAIGR
NTQELLYSET SFHLHEAALN VVMELGLWQG ELQKLTKSGK EVIVESRWTL MRDAAGQPKS
ILVVDTDITQ KKQLEEQFFR AQRLESLGTL AGGIAHDLNN ILTPILAASQ LLKVKFPDEQ
GRTPMVGSTS FQHLLEIVES NARRGAGLVK QVLSFARGFK GERTIVQLKH LITDIILIGK
QTFPKSIEFI SSFPEALWSV CGDVTQLHQV LMNLVVNARD AMPNGGKINV TAENIFIDET
YASMILEAQV GNYIQLTVTD TGIGMPPEIL NRIFEPFFTT KEVGAGTGLG LSTVLGIIKS
HGGFVKVSSK VGQGSQFKLF LPAIPATPDL TIEEIETPSG SGELILVVDD EAPICEIAKV
ILEKFNYKIL TACNGIEAIA LYAQHKHRIS AVLMDMMMPE MDGITAIRTL KKMNSKVQVI
ASSGINSTET VAQAAMIGVQ QVLPKPFTAK ELLNSLHHVL RC