Gene Ava_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3004 
Symbol 
ID3681239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3725337 
End bp3728162 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content37% 
IMG OID637718350 
ProductSignal transduction histidine kinase 
Protein accessionYP_323509 
Protein GI75909213 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTA TCATCAATTA TTCATCAAAT AAAGCAATAA TGTTGTTTAC CGACAGTGAA 
AGCCTACGCT TAGAAGCGCT CTATCAATAT TGCATTTTAG ATACACCACC AGAAGAAGTA
TTTGACGATC TGGTAAATAT TGCTGCCGAT AGCTGTAATA CACCAATTGC TCTGATCAGT
ATTTTAGATT CTCAAAGAGA ATGGTTTAAG TCAAAAGTAG GAATTTCTGA ATTAGAAATA
CCACGTGATA TATGTCTTGG TATTCATACA ATTGGGCAAA ATGACATATT GATTATTCCT
GACACCTGGC AAGATGAACG ATTTGTAGAA AATCCCCTAG TGAGGCAAAA GCCAAGCGCT
TTCCGTTTCT ATGGAGGAGT TCCCTTAATT AACTCTGAGG GTTTTGCCTT GGGTTGTCTG
GCCGTGATAG ATTTTACCCC CCGTAACTTA AGTTTAAAAG AACAGCAAAT ACTCCAACGT
TTAGCGCGTC AAATCATCAG ACAATTAGAG TTACACAAAA AGCGAATTAA TCATGAAAGC
AGCTTCAATG CTCATCTCTT GTTCACTAAT AATCCTCGTC CTATGTGGAT ATGTGAGATG
CAAAGTCTAC AAATTTTAGA TGTTAATCAA GCTGCTATTA CACAATATGG TTACTCAAAA
ACAGAGTTTT TACAAATGCA GTTTGCCCAA GTTTTTGTAC CTGAGTTTAT ATCGGACTTA
ATCAGGGATA TAGAACAGGA ATATTCTCAA CTTCCCTTCC TAATGGAATG TCAACATCGT
CTAAGGGGTG GACAAGTTAT TGATGTTGAA TTAGCTATTA ATTATATAGA ATATTCAGGT
TATCAAGCTT GTTTAGTTGA TACCATAAAT ATTACTGAAC ATATTCAAAT AGAACGGAAT
CTACAAAAAA GTGAAGCCAG AGTTAGAACT ATTCTGGAAG CAATTCCCGT ACCTTTGGTG
ATTTCCCGCG TTGATGATGG CTTAATTTTA TATACTAATT CAGAGTTTCT GCAAACATTC
CAACTATCTG GAAATGATTT AATTAATCAC TATGCCGCAG ATTTATATGA AAACTCCGAA
GACCGACAGC AGATATTAGA AGCTCTTAGT CAACATGGAT CACTTCAGAA TTATGATATT
CAATTTAAGA AAAGTGACGG AACTTCATTT TGGGCGATCG CCTCAATTCA GTACTTAAAT
TTCAACAACG AGTATGCAAT TTTAACCGTC CTCTACGATA TTACAGAGCG CAAAAATATT
GAAGCCAAGT TACAAGAGCA AAATGCACTT TTACAAAGTA TTTTTGCTGG TATCCCGTTG
ATGATTGCAC TGATTAGTCC TGAAGGTCAG GTTCAATGGA TAAATCAAGA ATTAGAGCGT
CTTTTAGGTT GGAGTTTGAG AGATTATCAA ACCCTTGATA TTTTTGCAGA GTTATATCCT
CAGCCTGAAT ATCGTCAATC GGTCATCAAG TTTATGCAAT CAGGAGAATG TATTTGGGGT
GATTTTAGAA CTCAGACACG ATATGGGCAA GTATTAGATA CTTCTTGGAT AAATATCAAA
CTTGCCGACG GTCGAATAAT TGGTATTGGT CAAGAAATTA CCGAGCGTAA ACAAACCGAA
CGGGCTTTGA AAGGACAGAT TGAGCGAGAA CAGTTAATGC GTGCTGTTGC TCAACGAATT
CGCCAATCGC TGAATCTACA AAATATTCTG AATGCCACAG TTAAAGAAAT TAAAGACTTG
CTTGGGGTTG ATCGGGTCGT GGTTTATCAG TTTGCCCCAG ATATGAGTGG TAAAATCGTA
GCAGAATCGG TGAAGCCTGG ATGGAAAATT GCTCTAGGTG CAGATATTCA AGATAATTGT
TTCCAGTCAG GCGCAGGAGC AGATTATTGT CAAGGACATA AAAGAGCGAT CGCCAACATC
TACACAGCTG GATTAACTGA TTGTCATCTG CATTTATTAG AACAATTTCA AGTTAAAGCT
AACTTAGTTG TACCTATTCT GTTAGAAGTG AGTGAAGGCA ACACTGTGCC GCAGCTTTGG
GGTTTATTAA TAGCCCATCA ATGTTCTACC CCACGGGACT GGGAAGCGCA TGAACTAGAT
TTGCTCGACC AACTTTCCGT CCCCATTGCG ATCGCCATCC AGCAATCAAG CATACTTCAG
CAAGCCCAAA ATGAATTGGC TGAACGCCAA AAAGTAGAAG TTCGCTTGAG AAGTGCCTTA
GCGGAAAAAG AGGTTTTACT CAAAGAGGTT CATCATCGGG TTAAAAATAA TTTACAGATA
GTTTCTGGAT TATTACTACT TCATTCTCAA ACACTCAAAG ACCCAGAATT AATCAGAACT
CTGCAAGAAA GTCAAAACCG TATTGAGTCC ATCTCAATGA TTCACAAGAA CTTATATACT
TCACCAAATA TTGGGCAACT TGATGTTGTT GATTATGTTA ATAATTTAGC TACGAGTATT
TTAATATCCT ATCAATTAGA GCCAGGGAGA ATCAGTTTAG AAACTCATAT TCACCCGGTT
GATTTAAATC TTGATCAAGC CATTGCCTGC GGTTTAATTA TCAATGAACT AATTTCCAAT
TCACTGAAAC ACGCTTTTCC TCAAAATACA ACAGGTGCAA TAAAAATTGA TTTACAAAAA
GTCGATGACA AAATTGAGAT GACTATTCAA GATAATGGTA TCGGTTTACC AGATAATTTA
GATTGGCGGT ATACAGATTC TTTAGGCCTT TCGCTAGTTC ATGACTTAGT AACAGAACAA
CTAGAAGGCA CTGTTAGTAT CGAACGTCAA CCAGGAACTA CATTTAAAAT CCAATTTTCG
CATTAA
 
Protein sequence
MQLIINYSSN KAIMLFTDSE SLRLEALYQY CILDTPPEEV FDDLVNIAAD SCNTPIALIS 
ILDSQREWFK SKVGISELEI PRDICLGIHT IGQNDILIIP DTWQDERFVE NPLVRQKPSA
FRFYGGVPLI NSEGFALGCL AVIDFTPRNL SLKEQQILQR LARQIIRQLE LHKKRINHES
SFNAHLLFTN NPRPMWICEM QSLQILDVNQ AAITQYGYSK TEFLQMQFAQ VFVPEFISDL
IRDIEQEYSQ LPFLMECQHR LRGGQVIDVE LAINYIEYSG YQACLVDTIN ITEHIQIERN
LQKSEARVRT ILEAIPVPLV ISRVDDGLIL YTNSEFLQTF QLSGNDLINH YAADLYENSE
DRQQILEALS QHGSLQNYDI QFKKSDGTSF WAIASIQYLN FNNEYAILTV LYDITERKNI
EAKLQEQNAL LQSIFAGIPL MIALISPEGQ VQWINQELER LLGWSLRDYQ TLDIFAELYP
QPEYRQSVIK FMQSGECIWG DFRTQTRYGQ VLDTSWINIK LADGRIIGIG QEITERKQTE
RALKGQIERE QLMRAVAQRI RQSLNLQNIL NATVKEIKDL LGVDRVVVYQ FAPDMSGKIV
AESVKPGWKI ALGADIQDNC FQSGAGADYC QGHKRAIANI YTAGLTDCHL HLLEQFQVKA
NLVVPILLEV SEGNTVPQLW GLLIAHQCST PRDWEAHELD LLDQLSVPIA IAIQQSSILQ
QAQNELAERQ KVEVRLRSAL AEKEVLLKEV HHRVKNNLQI VSGLLLLHSQ TLKDPELIRT
LQESQNRIES ISMIHKNLYT SPNIGQLDVV DYVNNLATSI LISYQLEPGR ISLETHIHPV
DLNLDQAIAC GLIINELISN SLKHAFPQNT TGAIKIDLQK VDDKIEMTIQ DNGIGLPDNL
DWRYTDSLGL SLVHDLVTEQ LEGTVSIERQ PGTTFKIQFS H