Gene Ava_4432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4432 
Symbol 
ID3680424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5549704 
End bp5552712 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content45% 
IMG OID637719785 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_324925 
Protein GI75910629 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGCC AACTTGATCA ATGTAATCGC CTCCTTTGGG AAAAATGTCC TGTTGGTCTA 
GTATTGTGGC GAAAAAATGG CGCATTGATA GATGTTAATC CCACTTATGC CACCATTTTG
GGGAGAACTG TCCCAGAAAC CCTCAACTTA AACTACTGGC AGATTACTCC TGAAAACTAT
CTCGCTTCGG AGCGAACCAT ACTAGAGCAA CTAGAACAGA CCGGTTGTCA CCAAGCCTAT
GAGCAAGAGT ATCTGCACAA GGATGGACAC TTAGTTCCGG TGAAAGTTTC CACTGTGATG
ATTGAGAAAG ATGGCGAGAA GCTGATGTGG TCGAGTGTAG AAGACATTAG CAACATCAGA
CAAACCCAAA AAGAACGTCA ACAATCAGAA AAAATCTTGA AACAGAGCGA AGCACGATAC
CGCTCTCTGG TGACAACTAA TACACAAATT ATTTGGGTCA GTTCACCAGA AGGAATTTGC
TTTGAACTCA AAGACTGGAT CGCTTACACA GGACAAACTT TAGCGGAAGC TGAGAACGGA
GGGTGGATTG ATGCTGTTCA TCCCGATGAT CGGGGCTACA CCGGGGAAGC CTGGGGTATT
GCTGTAGCCA ACCGTAGTCA ATACCAAATT GAATACCGGA TTCGTGGCAA GGATGGCAAT
TATCGCTACT TTTGGGTCTG GGGCGCTCCT GTTATTGAAG AAGATGGCAA TGTCCGGGAG
TGGATTGGCA CCTGTACAGA TATTCACGAT CGCAAGTTAG CCGAAGCCGA AAATCAACGT
CTGAAGGAAC GATATCGCTC TTTGGTAACT GCTAGTTCCC AAATTGTTTG GGGAACAACC
CCAGAAGGAT GGGGAATCAG TAGCGAAATG CTCACTTGGA TAGCTTATAC AGGGCAGAAT
GAAGCGGAGG TTGAGGGTTG GGGTTGGCTT GATCCCATTC ACCCCGATGA CCGTACCCGT
TCCTTAGAAG CCTGGAATGC GGCTGTAGCG AACCGGAGTA TTTACCAAAC CGAATATCAG
CTACGTGGCA AAGATGGCAC TTACCGCTAC TTCTCGGTTT GTGGTGTCCC TGTCTTAAAA
CAAGATGGCA GTATTCAAGA ATGGATTGGC ACCTGTACCG ATATTCATGA CCGCAAGCTG
GCAGAAATGG CTCTAAGGCA ACTCAATCAG CAACTGGAAG CCAGAGTAGC AGAACGGACG
GCGGCACTGC AAAATACCCT AGCTGAAGCC CAAGGATTAA ATGCCATTTT AGATAACTTG
GCAGATGGTT TGTTGGTAGT GGACACCACA GGACAAATTA CCCATTACAA TCCTGCGTTT
TTAGCTATGC ACGGATTAAC AGCCAATACC CTGAATGGAC ATTATCAGGA ACTACCTATA
TTCGGTTTAG CAGATTTGAT TGAACATACT CGATCCCATC CTGGAGAGGT GTTTACTGCT
GATGTAGCAC TGGCGAAAGA CCGCATTGGG CAAGCCGTAG GGACAGCCAT CTTTAAGCGG
ACAGACACCC AGGAAGTTGC CGCTTGCTTT GGTTCAGCGC TGTTGATTCG GGATGTAACG
GCGGAAAAAG AAATCGACAA AATGAAAACT GATTTTATCT CCACAGTTTC CCACGAACTC
AGAACACCAC TAACTTCTGT CCTTGGTTTT GCCTCCATCA TTCAAGAAAA ACTGCAAAAA
GATGTATTCC CCATCCTGTC CACCGAAGAT CGTAAACTGC AAAAAACCAT CAAGCGGGTG
GGTGACAATC TCAATATTAT TGTGTCGGAA GCAGAACGAC TCACATCTTT GATTAACGAT
GTTTTAGACA TTGCCAAGAT GGAAGCAGGT AAGGTGGAAT GGCAAATGCA GCCCATCGAC
CCTGGTGAGT TATTGGATTG GGCGACTACT TCCACAGCCG CATTATTTGA AACTAATGGT
CTACAGTTGC TCACAGAAAT TGAATCTGGA TTGCCGCAAA TCATAGGCGA TCGCAATCGT
CTATTACAAG TCCTGATCAA TTTAATTTCT AATGCCGTTA AGTTTACTGA ATTTGGCTCT
GTTACCTGTC GCGTCAAACA AGACAAAGAT GGTGTATGTA TCAGTGTCAT TGATACAGGT
GTTGGTATTG CCCCAGAAGA CCAGCCTAAA GTATTTGAGA AATTCCGCCA AGTTGGTGAT
ACCCTTACCG ACAAACCCAA AGGTACAGGG TTAGGACTAC CCATCTGTAA ACAAATTGTC
GAACATCATG GTGGCAGAAT CTGGGTGGAA AGTGAACCAG GTCATGGTAG CGTCTTCTCT
TTCCACATTC CCATCTACGC TAGCAATCAC AACACCAATG CTAATCTCAA TCTCGATGCC
TTAGTCAGAC AACTGAAAGA ACACGTCATC ACTGCCAACC AAGTATGCAG CGAAAACCGC
AAAACCATTC TTGTTGTCGA TGATGATGCC AATATTCGAG AACTACTCCG TCAACAACTA
GAAAATGAAG GCTACAACGT TCGGGAAGCC AAAGATGGTA TGGATGCGAT TCATCAAATC
AAAATATCCT CCCCTGATTT GATTGTTCTG GATGTGATGA TGCCCCAAAT TAACGGTTTT
GATGTAGCGG CTGTTTTGAA AAACGATCCC CTAACCGCAG ACATTCCAAT CATCATTCTC
TCAATTGTGG AAAATAAAGA ACGCGGTCAC CACATTGGGA TTGATCGCTA TCTCACCAAG
CCTATCAATA CAGAAGAACT TCTCAATGAA ATTGGCTCGC TCCTTTCCCA GGGTATTTCT
AGTAAGAAGG TTTTGGTTGT TGATCAAAAC GCGTCAACTT TAAAAACTAT ATCGGATGTC
TTGCAAACTC AGGGATACAA CGTGATTGAA GCCTCAGATC CCCAAGAATG TATCCGTAAA
GCCCTGTCAG CTAAACCGGA CATGATCATC ATCGATTCTA TCTTCTCTCA AGAAGCCGAC
TTAGTGAAAA CTCTCAAATT TGAAAAGGAG TTAGAGAATG TATTCTTCAT CATGCTCTCA
GACCGCTAA
 
Protein sequence
MSSQLDQCNR LLWEKCPVGL VLWRKNGALI DVNPTYATIL GRTVPETLNL NYWQITPENY 
LASERTILEQ LEQTGCHQAY EQEYLHKDGH LVPVKVSTVM IEKDGEKLMW SSVEDISNIR
QTQKERQQSE KILKQSEARY RSLVTTNTQI IWVSSPEGIC FELKDWIAYT GQTLAEAENG
GWIDAVHPDD RGYTGEAWGI AVANRSQYQI EYRIRGKDGN YRYFWVWGAP VIEEDGNVRE
WIGTCTDIHD RKLAEAENQR LKERYRSLVT ASSQIVWGTT PEGWGISSEM LTWIAYTGQN
EAEVEGWGWL DPIHPDDRTR SLEAWNAAVA NRSIYQTEYQ LRGKDGTYRY FSVCGVPVLK
QDGSIQEWIG TCTDIHDRKL AEMALRQLNQ QLEARVAERT AALQNTLAEA QGLNAILDNL
ADGLLVVDTT GQITHYNPAF LAMHGLTANT LNGHYQELPI FGLADLIEHT RSHPGEVFTA
DVALAKDRIG QAVGTAIFKR TDTQEVAACF GSALLIRDVT AEKEIDKMKT DFISTVSHEL
RTPLTSVLGF ASIIQEKLQK DVFPILSTED RKLQKTIKRV GDNLNIIVSE AERLTSLIND
VLDIAKMEAG KVEWQMQPID PGELLDWATT STAALFETNG LQLLTEIESG LPQIIGDRNR
LLQVLINLIS NAVKFTEFGS VTCRVKQDKD GVCISVIDTG VGIAPEDQPK VFEKFRQVGD
TLTDKPKGTG LGLPICKQIV EHHGGRIWVE SEPGHGSVFS FHIPIYASNH NTNANLNLDA
LVRQLKEHVI TANQVCSENR KTILVVDDDA NIRELLRQQL ENEGYNVREA KDGMDAIHQI
KISSPDLIVL DVMMPQINGF DVAAVLKNDP LTADIPIIIL SIVENKERGH HIGIDRYLTK
PINTEELLNE IGSLLSQGIS SKKVLVVDQN ASTLKTISDV LQTQGYNVIE ASDPQECIRK
ALSAKPDMII IDSIFSQEAD LVKTLKFEKE LENVFFIMLS DR