Gene Ava_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1021 
Symbol 
ID3678689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1230959 
End bp1234288 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content45% 
IMG OID637716357 
Producthistidine kinase 
Protein accessionYP_321540 
Protein GI75907244 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.103758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCC AAAGCCTTAT GTCAATGCTA CAGTATCCGC TTGATGAATT TCTAGCCAAT 
GTAACCAGTT GCTTAGAAAC AAGTACTCTG GCAATGGTGC TGGAGGTTTT TGAGCAACAG
CAATGCGATC GCCTACCGCC CGCCTTTGGC GATCGCTTGG TGATAGTAAA TGAGCAGCAA
TGCCCCGTTG GGTTGCTGCA CTCTGGCCAG CTAGCACGGA AGTTGTTAGC AGCCACAGGT
AATGAACTAT TTTTAAACTT ACAACAGCCA CTGTCAACCT TTAGTCAAGC CTTAATTGAA
CCAATACAGA TATTACCAGC CAGCCATAGT GTAGAGCAAT TAGGCGTATT TTGGCGGTAT
CACCAAGCCC ATAACTGGGA ATATGTACTG GTTGATGAAG AAGGTAAATT CTTGGGGATT
CTCGACAGTG TACGTTTATT GGCAGCATTG GCAAAAGAGC AAACAGCGAA TCATTCTCAT
GCCCCTAACT CTACATATGC CCCTAACTCT ACATATATAG ATGGTGTCGA GGCTGATTTG
AGGGGATCAC CTATGGAGAG TGGAACGCCG ACGCTACATT CATTGCAGGA TGTCGCCCGT
TCTCTGCGAG ATACTTCACC AGCGTCTAGA GGTGACCAAC TCCAACCATT GGTACAGTTA
TTAGAGCGTC TACCTTGGCC TCTCATGTTG CAAACCAGTG CAGGTGAAGT GTTAACACGA
AATTTAGCTT GGTGGCAACA ATTGGGAGCG TTGAAAGATC CAGAGGGTGT GCGTCAACAA
GTGGAAGCCA TTTTGGCAGC TACCAGGATT AAACAGCCAG AATATGTTGG TCAAAGAAGT
ACCACAGGTA CAAATCATAT TCAAGGCTAT GAGCAAGTAG GAGAGGAGTT AGCCCAAATA
CCAGCCACAG GTTTGCTACC CCTGTCACCC TACGAACCAA CAACCAGCGC CAAAACTAGC
TCTAGTCGCT GCTTTTTAGA TAGTCAAATG GGTACTTGTA CCTGTGTGGT AGAAGTGCAG
AACGGACAGG AGCGAGTTTG GCAGTTTGCC AAAATTCTGT TAGACAATAC CGATTTAAAA
TTGCCCAACA CCGATGATTT ATGGCTGGTG TTGGCGACGG ATGTCACAGA ACAGCAGCAA
CTTTGTAAAG AACTTGCAGC TAAAAATGCT GATCTCATTC AGTTAAATCG CTTAAAAGAT
GAATTTTTAG CTTGTATTAG TCATGAGTTG AAAACACCCT TAACGGCAGT TTTAGGCTTA
TCGCGGTTGC TAGTAGACCA GCAGTTGGGG GAACTGAACG AACGCCAAGC CCGTTATGCG
GGGCTAATTC ACCAAAGTGG TCGCCACCTG ATGAGCGTGG TAAATGACAT TTTGGATTTA
ACTCGCATGG AAACAGGGCA AATGGAATTG ACCCCAGTAC CAGTGAGTAT TCGGGCTGTG
TGCGATCGCG CCCTTTCTGA AGCCAAAGCG ATCCATAATC AAACTAGTAA AGGCAGCACC
ACAGAAGCCA CTCGCCAATC TACGCCAGAA TTTACACTTT CCATTGAACT GGACTTAGAC
CAGATAGTGG CTGATGAACT GCGTCTGCGG CAAATGCTAG TTCACCTGCT GTCTAACGCT
TTCAAATTTA CAGAAATATC CGGGGAAATT GGCTTAAGAG TCAGTCACTG GGAAGGATGG
ATTGCCTTTA CAGTTTGGGA TACAGGGATT GGTATTCCCG AACACCAACA ACATTTAATT
TTTCAAAAGT TCCAACAATT AGAAAATCCT CTGACTCGCC AATTTGAAGG TACAGGCTTA
GGACTAGTCT TAACTAGGGC TTTAGCTCGT CTCCACGGCG GTGATGTCAG CTTTTTGTCG
CGGGAAGGTA AAGGCAGTCA ATTTACCTTG TTGTTACCAC CTAGCCCACC CAGCAGTGGT
TTTTCTGAGT CGGAAGTGGA AGTAGAAGAT ATGCCCACCT CCAACACCCG CAATCGGGTA
ACTAATACTG GACAAAATCA TCCAGTTTCC TGTCAAAGAT TAGTCCTAGT GGTGGAAGCT
GTAGCCCGAT ATATAGAAGA TTTAACCGAC CAACTGAAAA CTTTAGGCTA TCGAGTAGTA
ATTGCTCGTT CAGGAACAGA AGCTGTCGAA AAAGCCAGGC GTTTACAACC AAAAGCCATC
TTTTTAAATC CCCTGTTACC TTTGCTGTCT GGTTGGGATG TGCTGACTCT GCTGAAATCT
GATAGCGCAA CTCGTCATAT TCCGGTAGTT GTGACCGCTA CAGGGGCAGA AAAAGAAATA
GCCTTTGCTA ACCGCGCCGA TGGTTTCCTC AGCTTGCCAG TAGAACAGCA ATCCCTGACA
CCATTGTTAG ATAAATTATG TGGCAAATCG ACGGTGCAGT CACTAGGTTT AGGGATTAAC
GAAAGGAATC AATCAAAAAA GACTCTACGG ATTCTGCGGT TAGTGGATGT GGAACTGGAG
TCTATCAATC CCCATCCTTC ACTACAAGAA CACCGGGTGA TTGAAGTGGA TGATTTAGAC
CAAGCCGAAC TTTTAGCCAG GGTTTGGCAG TTTGATGTCA TTCTGTTGGA TGTAGAAACT
TCTCTAGCCA AAGCTTATTT ACAACAGTTA ACTCAGCATC CCCGCCTAGC CGCTTTACCT
TTAGTCACTT GTGACGTGGA AACAACCTTG GCTGCTTCCC AAATCCCTGG TCTTTCGGTG
TTTCCTTGTT TAACACCCCG TGTGAAAGAT CAGGACAGCG AACTACGCAA AGGTAAAGTC
GATCCCTTAC TATCGGTGTT ACAAATTGCC TCCGGTATCT GCTACCCACC AAGTATCTTT
GTAGTAGACT TGACCATGTT AGAGGATTTA CCACAAGCTA GACGCAAGCC AGTCAAGGGT
TCTGGTACAG AGAGAAGAAC TGTCAGCCGT GGAGAAGTTG CAGAACGGGG AACTGAATGG
TTCCAAGCTT TAATTCAGTA CTTACAAACA GCTGGCTTGA AAGCCACAAT GGCGCGCTGT
TGGGCAGAAG TTTTACAACA AATTGGCCAT AATAGCGTCG ATTTACTGCT GATTTGTCTA
GGAGAATCAG CTATCCATCC AGAAGCCGTC AATGCTTTGA AAGCATTACA GGATTTACCC
TGTAATTTGC CACCGATTTT AGTTATTGAC CAACAATTAA GTCGCACTCA AGTTACTTCT
CGCAGTAGAC TGACTCATCA TAAAAAGCAG GAGTCAGAAT CCATCACAGA TGTGGCAAAA
GCGATCGCTA CCCAAATTGT CCCACGTTCC ATCTCAATGG AAGACCTGTT AACTCAAATT
CATCAAACTT TGACTCTCAA TGAACGCTAA
 
Protein sequence
MIPQSLMSML QYPLDEFLAN VTSCLETSTL AMVLEVFEQQ QCDRLPPAFG DRLVIVNEQQ 
CPVGLLHSGQ LARKLLAATG NELFLNLQQP LSTFSQALIE PIQILPASHS VEQLGVFWRY
HQAHNWEYVL VDEEGKFLGI LDSVRLLAAL AKEQTANHSH APNSTYAPNS TYIDGVEADL
RGSPMESGTP TLHSLQDVAR SLRDTSPASR GDQLQPLVQL LERLPWPLML QTSAGEVLTR
NLAWWQQLGA LKDPEGVRQQ VEAILAATRI KQPEYVGQRS TTGTNHIQGY EQVGEELAQI
PATGLLPLSP YEPTTSAKTS SSRCFLDSQM GTCTCVVEVQ NGQERVWQFA KILLDNTDLK
LPNTDDLWLV LATDVTEQQQ LCKELAAKNA DLIQLNRLKD EFLACISHEL KTPLTAVLGL
SRLLVDQQLG ELNERQARYA GLIHQSGRHL MSVVNDILDL TRMETGQMEL TPVPVSIRAV
CDRALSEAKA IHNQTSKGST TEATRQSTPE FTLSIELDLD QIVADELRLR QMLVHLLSNA
FKFTEISGEI GLRVSHWEGW IAFTVWDTGI GIPEHQQHLI FQKFQQLENP LTRQFEGTGL
GLVLTRALAR LHGGDVSFLS REGKGSQFTL LLPPSPPSSG FSESEVEVED MPTSNTRNRV
TNTGQNHPVS CQRLVLVVEA VARYIEDLTD QLKTLGYRVV IARSGTEAVE KARRLQPKAI
FLNPLLPLLS GWDVLTLLKS DSATRHIPVV VTATGAEKEI AFANRADGFL SLPVEQQSLT
PLLDKLCGKS TVQSLGLGIN ERNQSKKTLR ILRLVDVELE SINPHPSLQE HRVIEVDDLD
QAELLARVWQ FDVILLDVET SLAKAYLQQL TQHPRLAALP LVTCDVETTL AASQIPGLSV
FPCLTPRVKD QDSELRKGKV DPLLSVLQIA SGICYPPSIF VVDLTMLEDL PQARRKPVKG
SGTERRTVSR GEVAERGTEW FQALIQYLQT AGLKATMARC WAEVLQQIGH NSVDLLLICL
GESAIHPEAV NALKALQDLP CNLPPILVID QQLSRTQVTS RSRLTHHKKQ ESESITDVAK
AIATQIVPRS ISMEDLLTQI HQTLTLNER