Gene Ava_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0521 
Symbol 
ID3682351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp654685 
End bp657903 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content44% 
IMG OID637715849 
Productmulti-sensor Signal transduction histidine kinase 
Protein accessionYP_321040 
Protein GI75906744 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system
[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAG GTGTTTCATT TTTCTCAATA TCTATGATAA GAAACTCTTA TGAGCCGCCC 
AACCCTTCAG ATTTGCCAGA TTGTCACAAC CAGCTACGCC TGACAGCACA GTATCAGAAA
ATATTGGTGA GAATTCTGGC TAAAATTCGC TCATCGGTAA ACATAGAATC TTTGTGCCGG
ACTTCTTGTC AAGATATTTG TGGGCAATTG CAGATTGAGC GATTAGCTAT TTACCGCTTC
AACAGTGATT GGAGCGGTAG TTTTATCAAT CGTTTTGGCT TTGCAGAACC ACCTTGGGAT
AAACTCACTG CTTTTGGACA GGACTTGGTG TGGCAAGATT CCCATTTGCA AGAAACTCAA
GGGGGGCGAT ATCGCAAAAA TGAGCCGTTT GCTACGGCTG ATATTTACGA TGCAGGACAC
TCTCGTTGTC ATATTGAGGT GTTAGAACAA TTTCAGATTC GAGCATATGC GATCGCTCCG
ATTGTCGTTG GTGCAAAACT GTGGGGTCTA CTAGCGGCCT ATCAGCACTC TGCGCCTCGA
CAATGGCATC AAAACGAGGT GGAATTTATA GCCCAAGCCG CTAGCTATTT GGGTGTAGCA
ATGCAGCAGG AAGAAATCAT CAAGGAAGCC AAACAACGTA CAGTAGAATT GCAAGATGCT
ATTGCCAGGC AACGAGCTTT GATGGAAGTT GTAGGCAATA TTCGCTCATC TATCAATACG
GAAATTATTT TAAACACAGC TTGCCAAGAA TTGTGCAAAC TTCTGAAGCT AGAACGGGCA
GCAGTTTATC GCTTTAATGA AGATTGGAGC GGTGAATTTG TCAGTCAATT TGGGATGGTA
GAAACACAGT GGCACAGAAT CAGCCCCTTT GGCAAAAATC TGGTTTGGGA TGATACCTAC
CTCCAAGAAA CCAAAGGTGG GCGTTACCGC CATAACGAAA CTTTTGCGGT TAATGATATC
TATGAGGCTG GACATACACG CTGTCATATT GACATTCTTG AGCAATTCAA GATTTATGCC
TATGCTTTAG CACCAATTTT CATCGGTAAA AAACTGTGGG GACTCATAGC AGCTTATCAA
CACACTGGCC CACGAGAATG GGCAAATTAT GAAGTGGAGT TTCTCGGACA GGTGGGCGCG
CAATTAGGGG TGGCAATTCA GCAAGCGGAG AATTTAGCCC AATCGAAACA ACAGGCTGAT
GCTCTGCAAA ATGCGATCGC TCGGCAACGC GCCCTCACAG AAGTAGTGGG AAAAATCCGC
TCTTCCCTAG ATATTAATTT AATCTTGAAA ACCACCTGCC AAGAAGTATG TAAAATGCTA
CGAATTGAGC GGGTAGGAGT TTATCGCTTT AATCCTGACT GGAGTGGTGA ATTCGTCAGT
AATTTCGGTA TGGTGGAGGC GCAATGGGAC AGCATTAACC CCTTTGGCCA AAATCTGGTT
TGGGAGGACA CCCACCTCCA AGAAACCAAA GGGGGACGCT ATCGCAACAA CGAAAATTTT
GCAGTCAACG ACATTTACCA AGTCGGACAC TCACGCTGTC ACTTAGATAT TCTGGAGCAG
TTTAAGATTC GAGCTTATGC CTTAACCCCA ATTTTTGTCG GACGGAATCT GTGGGGACTG
CTGGCCGCCT ATCAACATTC TGCACCGCGC CAGTGGGATA TTGTAGAAGT AGAATTTTTG
GGACAAGTAG CTAGTCAACT GGGAGTGGCG TTGCAAAGTT CTCAAATGAT GAGCCAAATT
CAAACCCGTG CTGATGAACT GCAAAAATCG GCTGAACAGC GCCGAATTTT ATTTGATTTG
GTTGTCAAAA TCCGAGAATC TTTGGATTTA GAGGCAATTT TGAAAAATAC TGTGCAGGAA
GTGAGGCGAT CGCTGCAAGC AGACCGAGTC GGCATCTTTC GCTTTGATTC TGATCAGGGT
TTTTGTAGTG GTGAATTTAT TGCTGAAGAT GTGTTACCCA AGTTTGATTC TGCTCTAGCT
GTGAAGGTGC AAGACTATTG CTTCGGTGAC CATTATGCAC CTCAATATCG CCAAGGGCAA
GTACAAGTAA TTTCCGATGT CAACAGTGTT GGCTCTAAGG TACCCCATCT GGATGTGATT
GAGCAATTCC AAGTCAAAGC TCAGATTATT GTGCCATTGA TGGAAGGTGA TAATCTGTGG
GGATTATTAT GTATTCACCA ATGCACTCAT CCACGTCATT GGGAAGAAGA CGAATTGGAA
TTTGTCACTC AAATAGCAGC TCAACTCAGT GTGGCACTAC ATCAAGCTAA CTTGTTCCAA
CAATCGAGCT TGCTTGGTGA GACTCGTGCA GAAGCAAATC AACTTGCCCA AACACTGAAG
GAACTCCGCA CTGCCCAAAT GCAAATTATC CACGCTGAAA AAATGGCCAG TTTGGGGCAG
TTGGTAGCAG GAGTTGCCCA CGAGATTAAT AATCCAATTA ATTTTATTCA CGGCAATTTA
GAACACGCCC ATCAATATAC CCAAGAGTTG CTGCGTTATG TGAAACTTTA TCAGCACTAT
CACCCCAATG CAGCTCCGGA AATACAAGAG TTTTTCCAGC AAGCCGAAAT CGAGTTTTTA
TTTGAGGACT TACCTAACTT ATTCCAGTCA ATGCAGGTAG GTACTCAGCG CATTCAGGAA
ATTGTCACCT CTTTACGTAG CTTCTCACGC CTAGACGAAG CTGACTTCAA AACAGCAAAT
ATTCATGAAT GTATCGACAG CACTTTGATG ATTTTACAAC ATCGCTTAAA GCCCTCCGCT
GATAGCCACG CCATTCATGT GACCAAGGAC TATGATGATT TACCTCTCAT AGAGTGCTAT
CCCGGTCAAT TAAACCAGGT ATTTATGAAT TTGCTGTCTA ATGCCATTGA TGCCTTAGAA
GAACGGGAAG CCAAGCTATC TCCTGAAGTA ATTGCCGCAC ATCCCAGCGA AATCCGCATT
TATACATCTC TACTCAATCA AGACTGGATG AGTATTCGGA TTACCGATAA TGGACTCGGC
ATTGATGAGC AGATAATCCC CAGATTGTTC GATCCATTTT TTACTACTAA GATGGTCGGT
AAAGGTACTG GACTGGGACT TTCGATCAGC TATCAGATTG TCACAGATAA ACACAAAGGC
AAGATTTACT GCCAATCAGA ACTTGGTAAA GGTACAGAGT TTGTCGTCGA ATTACCAATT
CTTCAGGCAA AAATTAATCC ATCAATAACG AAAGTCTGA
 
Protein sequence
MKLGVSFFSI SMIRNSYEPP NPSDLPDCHN QLRLTAQYQK ILVRILAKIR SSVNIESLCR 
TSCQDICGQL QIERLAIYRF NSDWSGSFIN RFGFAEPPWD KLTAFGQDLV WQDSHLQETQ
GGRYRKNEPF ATADIYDAGH SRCHIEVLEQ FQIRAYAIAP IVVGAKLWGL LAAYQHSAPR
QWHQNEVEFI AQAASYLGVA MQQEEIIKEA KQRTVELQDA IARQRALMEV VGNIRSSINT
EIILNTACQE LCKLLKLERA AVYRFNEDWS GEFVSQFGMV ETQWHRISPF GKNLVWDDTY
LQETKGGRYR HNETFAVNDI YEAGHTRCHI DILEQFKIYA YALAPIFIGK KLWGLIAAYQ
HTGPREWANY EVEFLGQVGA QLGVAIQQAE NLAQSKQQAD ALQNAIARQR ALTEVVGKIR
SSLDINLILK TTCQEVCKML RIERVGVYRF NPDWSGEFVS NFGMVEAQWD SINPFGQNLV
WEDTHLQETK GGRYRNNENF AVNDIYQVGH SRCHLDILEQ FKIRAYALTP IFVGRNLWGL
LAAYQHSAPR QWDIVEVEFL GQVASQLGVA LQSSQMMSQI QTRADELQKS AEQRRILFDL
VVKIRESLDL EAILKNTVQE VRRSLQADRV GIFRFDSDQG FCSGEFIAED VLPKFDSALA
VKVQDYCFGD HYAPQYRQGQ VQVISDVNSV GSKVPHLDVI EQFQVKAQII VPLMEGDNLW
GLLCIHQCTH PRHWEEDELE FVTQIAAQLS VALHQANLFQ QSSLLGETRA EANQLAQTLK
ELRTAQMQII HAEKMASLGQ LVAGVAHEIN NPINFIHGNL EHAHQYTQEL LRYVKLYQHY
HPNAAPEIQE FFQQAEIEFL FEDLPNLFQS MQVGTQRIQE IVTSLRSFSR LDEADFKTAN
IHECIDSTLM ILQHRLKPSA DSHAIHVTKD YDDLPLIECY PGQLNQVFMN LLSNAIDALE
EREAKLSPEV IAAHPSEIRI YTSLLNQDWM SIRITDNGLG IDEQIIPRLF DPFFTTKMVG
KGTGLGLSIS YQIVTDKHKG KIYCQSELGK GTEFVVELPI LQAKINPSIT KV