Gene Ava_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1628 
Symbol 
ID3681872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2042468 
End bp2045548 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content43% 
IMG OID637716968 
Productmulti-sensor Signal transduction histidine kinase 
Protein accessionYP_322146 
Protein GI75907850 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.677715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.19967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCTCT CCAATCTTTT TAGGAAAAGG TCAGCTAACA ACGAAAAGCT GAAAATTTCC 
TTACAATTTA TCCTCACCGT TCCCTTTGTG CTGTTAATAG GGGGAACAAC AGGTTTAGTC
AGTTATGTAT CATGGCAAAA TTCCCAAAAT TCAGTCAATA GCTTGGCATA TCAGTTGATG
AATGAGATGA GCGATCGCAT TCATTTATAT TTAAGTAATT ATTTAAAGAC TCCACATTTA
ATTAACCGTC TCAATGTTCA AGCCAGCACA CTCCAACATA TAGACGTTAC CAACCCTCAA
AGTTTAGAAC GTTATTTTTT TGCTCAGGTT CAAGAATTTG CTTCTCTGAG AGTCCATTTT
ATCAATCCTC AAGGGGGACT GATTGGAGCC GGCAACGATG AACGGGGTGT TACCATTTCC
TCGACCAAAG ATTTTAAAAA AGGAGAGCTT TATGTGTATA GCGTAGATAG TCAAAGAAAA
CGGAAGAAAT TATTAGTTCA TCAGCATAAT TATGATGGCA CTCAAAGACC TTTCTATCAA
CAGGCGCTCT CCACAGGCAA ACCAACGTGG ACATCGGTTT ATTTATATGT ACCGACTTCC
AGAGGCTTAG GAATTGCTGC TAGCTACCCG CTTTATAACC AAAGACAAGA ACTTCTGGGG
GTTTTCACCA GCGATATAGA CCTTGTGAGC ATTAGTAAGT TCCTCCAGCA ATTGCGGGTG
GGTACTCACG GACAGGTATT TATTATGGAG CGTTCAGGAT TAATGGTTGC GTCTTCAACC
CCTGAACAAC CATTCCTGAC AGGTATGGGT GGGACACAAA ATCAACGGCT CCAGGTGATA
CAAAGTCAGC AACCCCTAAT TCGTTTAGCG GGTGAACATC TGCGATCGCA CTTTGGTAAT
TTAGCTCAAA TTCAGACCGC AAAGCAGCTC AATTTTGACA TCAAAGGCAA AAAGCAATTT
CTTCTCGTTA TCCCTTACAA CGACCAATTA GGACTTGACT GGTTGATTGT GACAGTGATC
CCCGCCTCAG ACTTTACCGC AGAAATTGAT GCCAATACAC GCCTAACAAT AATTTTTACT
ATTGGGGCTT TAGCAGGAGC GATCGCTTTA GGATTATTTT TGACCCAATT TATTATCCGA
TCAATGGAAC AATTGGGTCA AACTAGCTTG GCGCTCTACA ACGAGCTGCA CTTACGCAAA
ATTGCCGAAT TAGAACTGCG ACGGCAAAAA GACTTGTGTG AAAGCATTTA TAATGAATCG
GCTGATGCTC TGTTCTTAGT AGATCCGCAA ACACTGTTAA TTGCCGACTG TAATCGTCGA
GCAGTAGAAC TATTTGAAGC TGACAGTAAA AGTGAACTAA TTAGTATTAA AGTTAACACT
CTTCAACTGC AACCATTTAC CTCAGACGAA CTGGCACAAA TCACCACCCA AATCCAGCAA
AAAGGTGTTT GGAACACGGA AATTCAATAC CTGACGCGCA AAGGAAATTT GTTTTGGGGC
AACCTCGCAG CCAAGGAAGT GACGATGGTC TGCGACCGAC CGTCGGTTAT CACTAACCAA
GTGGTTTATC TAGTACGAGT AACAGATATT ACTGAGCGCA AGCGAGCCGA AACCGCCCTG
CTACAAAGTG AAGCTCGCTT TCAAAAAATT GCCGCCGCTT CTCCGGCACA GATTTATATT
TTGGCTTATT ACCCAGATAT AAATCAAATG CGTTATGAAT ATATCAGTTC CGGGGTGCAA
GAAATTCAAG AATTAGAACC CCATCAGGTT TTGGCAGATC CTCTACTGAC TTATCAACAA
GTTCATCCTG ATGATCTCGC TCTCTACAAT CAGCTAACAA CTCGTAGCCT CAAAACCCTC
AAACCCTTTG CTCATGAATG GCGAATTATT ACACCCTCTG GCAAAGTCAA GTGGGTACGC
GCCAACTCCC GGCCAGAACG TCGCTCCAAT GGTGAAATTG CTTGGTACGG AGTTGTATTA
GATATCACTG ACCTTAAACA AGCCGAGGCC GCTTTGCGTG AGAGTGAAGA GAGATTTCGC
CACGCGTTTT ATGATGCTCC CATTGGTATG GCGTTGCTGG GATTGGAGGA TCAACAATGG
TTGCAAGCCA ATCCCATGCT CAGGGAAATG TTGGGCTATT CTGAGTTAGA ATTTTTCAGC
TTCCAGGCAT TTGAGATCAT TCACCCAGAA GATATTCATC GGCTAGAAAA CTGCATCACA
CAAGTTTTGA GTAATCAGAA TCCCAGAGTT CAGGTAGAGT TGCGTTACCT GTGTAATGGA
GGACGCATTG CTTGGGGACT AACAAGCTTG TCCCTGGTGC GAGATTGCCA GAATCAACCG
TTGTACTATG TGCTGCAAAT CCAAGATATC ACCGAACAAC AAGCTATTGA ACAGATAAAA
AATGAGTTCA TTTCTATCGT CAGTCATGAA CTGCGTACCC CACTCACAGC CATTCAAGGA
TTTTTAGGAC TGTTGAACAC TGGTATATAC GACAACAAAT CACAAAAAGC CAAATGGATG
ATCCAACAAG CTTTAACAAA TAGCGATCGC CTCGTGCGGT TGGTAAATGA TATTTTGGAT
TTAGAACGTT TGTCTTCTGG GCGAGTACAA CTTGTCAAAG AAGTCTGCCA TGCCGCAGAT
TTAATGCAAC GAGCTGTAGA AGGAGTACAG TCAATCGCCC TAGCATCTGC TATCACAATT
TCCCTGACTC CCACCACCGC TTGCGTTTGG GCTTCTCCCG ACTTAATTAT TCAAACCCTC
ATCAATTTAT TGAGCAACGC CATCAAGTTT TCTCCTCATA ACTCAGTGAT TACTTTGTCT
GCTCAACCCC AATCAGACTG GGTACTGTTT AAAGTCCAAG ACCAAGGCAG AGGTATCCCT
GCCAACAAAC TAGAAACAAT ATTTGAACGT TTTCAGCAAG TGGACATCTC TGACGCTCGT
GCTAAGGGTG GTACAGGTTT AGGTTTGGCA ATTTGTCAAA GTATTATTCA ACAGCATGAT
GGTAGTATTT GGGCAGAAAG TACCCTTGGT GAAGGCAGCA CCTTTTATTT CACTTTGCCA
ATATCAGTAA AAGAACTATG A
 
Protein sequence
MRLSNLFRKR SANNEKLKIS LQFILTVPFV LLIGGTTGLV SYVSWQNSQN SVNSLAYQLM 
NEMSDRIHLY LSNYLKTPHL INRLNVQAST LQHIDVTNPQ SLERYFFAQV QEFASLRVHF
INPQGGLIGA GNDERGVTIS STKDFKKGEL YVYSVDSQRK RKKLLVHQHN YDGTQRPFYQ
QALSTGKPTW TSVYLYVPTS RGLGIAASYP LYNQRQELLG VFTSDIDLVS ISKFLQQLRV
GTHGQVFIME RSGLMVASST PEQPFLTGMG GTQNQRLQVI QSQQPLIRLA GEHLRSHFGN
LAQIQTAKQL NFDIKGKKQF LLVIPYNDQL GLDWLIVTVI PASDFTAEID ANTRLTIIFT
IGALAGAIAL GLFLTQFIIR SMEQLGQTSL ALYNELHLRK IAELELRRQK DLCESIYNES
ADALFLVDPQ TLLIADCNRR AVELFEADSK SELISIKVNT LQLQPFTSDE LAQITTQIQQ
KGVWNTEIQY LTRKGNLFWG NLAAKEVTMV CDRPSVITNQ VVYLVRVTDI TERKRAETAL
LQSEARFQKI AAASPAQIYI LAYYPDINQM RYEYISSGVQ EIQELEPHQV LADPLLTYQQ
VHPDDLALYN QLTTRSLKTL KPFAHEWRII TPSGKVKWVR ANSRPERRSN GEIAWYGVVL
DITDLKQAEA ALRESEERFR HAFYDAPIGM ALLGLEDQQW LQANPMLREM LGYSELEFFS
FQAFEIIHPE DIHRLENCIT QVLSNQNPRV QVELRYLCNG GRIAWGLTSL SLVRDCQNQP
LYYVLQIQDI TEQQAIEQIK NEFISIVSHE LRTPLTAIQG FLGLLNTGIY DNKSQKAKWM
IQQALTNSDR LVRLVNDILD LERLSSGRVQ LVKEVCHAAD LMQRAVEGVQ SIALASAITI
SLTPTTACVW ASPDLIIQTL INLLSNAIKF SPHNSVITLS AQPQSDWVLF KVQDQGRGIP
ANKLETIFER FQQVDISDAR AKGGTGLGLA ICQSIIQQHD GSIWAESTLG EGSTFYFTLP
ISVKEL