Gene Ava_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4401 
Symbol 
ID3680528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5514154 
End bp5516079 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content41% 
IMG OID637719754 
Productmulti-sensor Signal transduction histidine kinase 
Protein accessionYP_324894 
Protein GI75910598 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000108821 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000304156 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGACCAT CAAATAATCA AGTTGCCAAA CTACCAGTTC ATCAGACAAA AGCGCCTCTG 
CAAATTGGAG AATCTGGAAA AAAATACTTT AACTTCAAAG TTAGTTTAGA AACCGAAAAC
AATTTATATC AAGTTATAGA AAATTTTCAT AAAATTATTA TTGTATTCAC GAGTGATGGT
CATGTTTGCT ACACTTCACC TAATGTCACA GAAACTTTAG GGTATGAAGT AATAGAACTG
GAAGGTAAAT CTTTCGCTTC TTTTGTCCAC AATGATGATG TGCGACTATT TACTGATTAT
TTATCCACAG TTGCCAAATC TGGCAACAAG CATCAGCCGC TAGAATACCG GATAAAAGCC
AAAGATGGTA GCTGGCGATG GCAGGAAATT AGCACATCTG TCTTTAAAGA TGAAAATGGT
AATGTTGTGT ATTTTGTTGG CATTACTCAC GATATAACTG ACCGCAAACT TACAGAAGCG
GCGCTAGCAG AAAGAATCTT GTTAGCTAAC TTTCGGACAG CAATTGATAA TGTTTTTTCT
CAAAATCATA CATTACAACA GTTAATGCGT GGCTGTACTG AGACTATGGT GACACATCTC
AATGCAGCCT TTGCCCGCAT CTGGACACTA AATAAACAAA ATAACATCCT CGAATTGCAA
GTTAGTTCGG GGATGTATAC CCACATCGAT GGCCCCCATA GATTTGTACC AGTCGGTAAA
TTCAAAATAG GGTTAATCGC CGAAGAAGCC AAACCCCATC TCACCAACTC TGTACAAACC
GACCCCCGTG TAGGGAATAA AGAGTGGGCA AAGCAAGAGG GAATGATTGC CTTTGCTGGC
TATCCCTTGA TTGTGGAAGG AGAGATATTA GGGGTCATCG CCATGTTCTC TCGCCAAGTA
CTGAGCGAAT CTACCTTTGA AGCTTTGAGA ATTACAGCTC ATGAAGTTGC TATCGGCATT
AAGCGCAAGC AGATTGAAGA AGAACTAAGA AAATCCGAAG CTAAATACCG AGAAATTGCC
CAAGCGTCCC AAGAAAAAGC CCAAAAATTA GAAGCAGCTT TATGGGAACT CCAACAAACC
CAGGCACAAT TAATTCAAAC GGAGAAAATG TCCAGTTTAG GACAGTTAGT CGCGGGTGTT
GCCCATGAGA TTAATAATCC CGTGAATTTT ATCTACGGTA ATATCACCCA TACCCGTGAA
TATATAGAGG ATTTGCTTTA CTTGGTAAAA CTCTATCAAA GTCACTACAA CCCGGTAGCA
CCAGAAATCC TAGACCATAT CTACGGGATG GATTTAGAGT TTATTTCTCA AGATTTGCCC
AAAGTCCTCA ATTCAATGCA CATGGGAGCA GAACGTATTC GACAGATAGT CCTCTCTTTG
CGTAACTTTT CTCGCCTAGA CGAAGATGGC ATGAAAGCAG TAGATATTCA TGAAGGTATC
GATAATACAT TGCTATTATT GCAAAATCGT CTGAAAGCTA AACCAGGCTG TAGCGAGATT
CAAGTAATTA AAGAGTACGG CAACCTACCG AATATCTTAT GTCACGCCGG ACAACTCAAT
CAAGTATTTA TGAATTTACT GACTAATGCA ATTGACGCTT TGGAAGAGTC TGTTGCCAGT
AGTCAGTTGT CAGTGGTAAA TAGTAAAACA ACTAACAATC CCCGAATTCT GATTCGGACT
GAACTTACCA CCGAAAATCA GGTGATGATC TGCATTGCTG ACAACGGGAT GGGAATGGCA
GAGAAAGTTC GTAGCCAGCT ATTTGACCCT TTCTTTACCA CTAAACCCAT AGGTAAAGGC
ACTGGTATGG GACTATCAAT TAGTTACCAA ATTGTGGTGA AAAACCATCA GGGACAGTTA
CAGTGTATCT CTGCGCCAGG AAAGGGGGCT GAGTTTATCA TTACAATTCC AACTGGTGAT
GGGTGA
 
Protein sequence
MRPSNNQVAK LPVHQTKAPL QIGESGKKYF NFKVSLETEN NLYQVIENFH KIIIVFTSDG 
HVCYTSPNVT ETLGYEVIEL EGKSFASFVH NDDVRLFTDY LSTVAKSGNK HQPLEYRIKA
KDGSWRWQEI STSVFKDENG NVVYFVGITH DITDRKLTEA ALAERILLAN FRTAIDNVFS
QNHTLQQLMR GCTETMVTHL NAAFARIWTL NKQNNILELQ VSSGMYTHID GPHRFVPVGK
FKIGLIAEEA KPHLTNSVQT DPRVGNKEWA KQEGMIAFAG YPLIVEGEIL GVIAMFSRQV
LSESTFEALR ITAHEVAIGI KRKQIEEELR KSEAKYREIA QASQEKAQKL EAALWELQQT
QAQLIQTEKM SSLGQLVAGV AHEINNPVNF IYGNITHTRE YIEDLLYLVK LYQSHYNPVA
PEILDHIYGM DLEFISQDLP KVLNSMHMGA ERIRQIVLSL RNFSRLDEDG MKAVDIHEGI
DNTLLLLQNR LKAKPGCSEI QVIKEYGNLP NILCHAGQLN QVFMNLLTNA IDALEESVAS
SQLSVVNSKT TNNPRILIRT ELTTENQVMI CIADNGMGMA EKVRSQLFDP FFTTKPIGKG
TGMGLSISYQ IVVKNHQGQL QCISAPGKGA EFIITIPTGD G