Gene Ava_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4086 
Symbol 
ID3681609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5077980 
End bp5079218 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content42% 
IMG OID637719437 
Producthistidine kinase 
Protein accessionYP_324585 
Protein GI75910289 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.3591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGTAG TTATGGGACT GTCATCGGTG GTAGTGTATC ACTTTTTTGC TTATAGTCTC 
AGTCAGCAAT TAGATAGGCA GTTGTTGACG TTGGCGGATG CAGCAGCTCA TAATTTATCG
GCTATTAAGG TGGATAAGAT GGCTGTAAAC CGCAAGATGC CGCGAATTTT AGATAATGAT
GGGGATTTAG ATATTCCTTG GCAAGATTTA CGGTTATATC GTCAGAGTGT GGAATGGTTT
GATGCTGGGC AGCAATTATT AGGAAAGGCA GGAAAGCCAT TTCCCGAAAC ACCATTTCTA
ACTAATTTTC ACTCATGGCA GCAAAATGGC ATCAGGATAT TAACTATTCC GGTTTATTCT
TCCAGAAAAA ATCAACAACT TTTAGGTTAT GTGCGTGTCA GTGCATCAAC AGTTGAAATA
CAAAAAGAAC TGGAGAGACT GTTGATGGGT TTGGGTATTG GTGGTGTTTT GGGGATGGTT
TTAATTAGTG GTACAGGTTG GTGGCTAACG AGTAAAGCCT TGCAACCGAT TGAGCAGAGT
TTCCAGCAAT TACAACAGTT TACGGCGGAT GCGTCCCATG AATTACGCAG TCCTCTGACG
GCGATTAAAA CGACTGTGGA AGTTATCCAA AGTCACCCAG AACGTATTCA TCCCAGTGAT
GTCAAAAAAA TCGACATCAT AGAGGGTGCA ACACAGCAGA TGACGCACTT AGTAGAGGAT
TTACTATTGT TAGCCAGAAG TGATTCTGCA CCTATAAGTT TGCCTAAAAC CGCAATTCCC
ATACCCATAG ATGAAATTTT AATTGATTTA ATTGATACCT TACAGCCGCA GGCAAAATCT
CAAGAAATTA CTTTAGAGGC TAACTTGATT GATGCGGTGT GGGTAAAGGG GGATGCACAT
CAGTTACAAC GACTATTTGG TAATTTATTA GAAAATGCCC TGCAATATAC GTCTAATGGT
GGTTTAGTCA GGGTAGAAAT CGTTAAAAGG GATGATTTTG TAGTGATTGA AGTGGCAGAT
ACTGGTATTG GTATCGCACC TGAAAATCTG CCTTTTGTAT TTAATCGCTT TTGGCGAGCT
GAAAAAGCCC GTTCTCGTCG TCAAGGTGGT TCGGGTTTGG GTTTAGCTAT TGCCCAAGCT
ATTACTCATG CTCATGGTGG TGAGATTTCT GTGACGAGTA AAGTCGGTGT GGGAAGTTGT
TTTCGCGTGA AGTTACCAGT ATTTAGGTTG GGCAATTAG
 
Protein sequence
MMVVMGLSSV VVYHFFAYSL SQQLDRQLLT LADAAAHNLS AIKVDKMAVN RKMPRILDND 
GDLDIPWQDL RLYRQSVEWF DAGQQLLGKA GKPFPETPFL TNFHSWQQNG IRILTIPVYS
SRKNQQLLGY VRVSASTVEI QKELERLLMG LGIGGVLGMV LISGTGWWLT SKALQPIEQS
FQQLQQFTAD ASHELRSPLT AIKTTVEVIQ SHPERIHPSD VKKIDIIEGA TQQMTHLVED
LLLLARSDSA PISLPKTAIP IPIDEILIDL IDTLQPQAKS QEITLEANLI DAVWVKGDAH
QLQRLFGNLL ENALQYTSNG GLVRVEIVKR DDFVVIEVAD TGIGIAPENL PFVFNRFWRA
EKARSRRQGG SGLGLAIAQA ITHAHGGEIS VTSKVGVGSC FRVKLPVFRL GN