Gene Ava_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1149 
Symbol 
ID3683403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1405817 
End bp1407232 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content42% 
IMG OID637716485 
Producthistidine kinase 
Protein accessionYP_321668 
Protein GI75907372 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.116621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACT CTCTGCATAA GGATTTGTCC GTTTATCAGC TGGCTTTAGG AGTGGAAGCG 
CCTTCTCAAA CAGTGACTCT CAGTCCGGCG ACCTTGCTAT CACTGGTGAG AGCGCAAATA
GACTTACTGA TTGAGCAGCA ACTCAGTGCA ACTTTATGGA TTAAGCTGCC ACCAGGAAAA
GTTTGGTCTA CAGAAATAGT CCGTTATCAG TCATCGGTGG ATAACTCTGG TCGTATTTAC
AATTGCCAAG TTGGAGAAAG TAAATCAGAG GTGGGAGAAG AAAATTTAAC ATCTTTATCT
TGTTCGTATG AGCATTATTT TGATGTGCCA TTACTACCAA ATAATCACTT ACAACGAGAA
TATTTTTTGG TAGTCTTATC GACAAAGTTT TGCAGTTTAA TAGCAGCTTA TCGCCCACGT
CAAAATCACA AAATTGGCCC CTTGGGTAAG ACACAAAATA ATAAAAGTCA ACCGCTATTA
ACGATTACTT CGCTTGAACA TCAAGTTATT CAGCGTGTGT TAGATGGGAT AAAACAAGAG
GTTACCTCTG AATTAAAGCC CATAGCACCA AGCGATTTTA TCTGCTCCAC TGCTTGCGAG
CCAGCCCTCA TCAGCCAAAT GCTCACCAAG CAACTCCAAC GACAAAACGA AATTAATCGT
CGCATCAGTG TTGAACGCAT TGCTAAATTG CAGCAGCACA ATCAAAAATT ACAAACAAAA
GAGCAATTGA AAGATGAATA TTTAACTAAT GTGTGTCAAG AACTGCGTAC ACCCCTGACG
CAAATGAAAA CTGCGCTTTC ACTGTTGAAT TCCCCCACCC TGAAACCACA CCAAAGACAG
AGGTATTTGC AGATGTTAAA TACCCAGTGC GATCGCCAAA GTGTTTTGAT TGCCGGTCTC
ATGACTTTGG TAGAACTAGA GCATAATTTG GAAACAACAA CTTTAGAACT CGTCAAACTT
GCAGATATTG TCCCTGGAGT CGTCAGCACT TACCAACCAG TAGCCCAAGA AAAAGGCATC
ATGCTAGGCT ATACCATCCC TACAGACTTG CCAAGCGTTT TATGTGTCAA TGGTGGCTTG
CGACAAATAG TCATTAATCT GCTACACAAC AGCCTCAAGT TTACAGCCAA TGGTGGTCGA
GTGTGGGTGA GAGCTAGAGT CCAAGGCGAA TACGTCATAC TAGAAATTCG TGACACAGGT
ATCGGTATTG CTGAAAGCGA AATTCCCAAA ATATTCGACT GCTTTTATCG TGTGCGATCG
GGACTAATTG ATGAGACGAA TGGCGCAGGT TTAGGACTCA CAATTGTCCA GAGATTGTTA
TGGCATTGTG GTGGTTCTGT TAATGTGAGA AGCAAGGTTG ATGAAGGTAC TATGGTAATA
GTGCAAATGA AGATAGGAAG CACCTCGCCA ACTTAG
 
Protein sequence
MNDSLHKDLS VYQLALGVEA PSQTVTLSPA TLLSLVRAQI DLLIEQQLSA TLWIKLPPGK 
VWSTEIVRYQ SSVDNSGRIY NCQVGESKSE VGEENLTSLS CSYEHYFDVP LLPNNHLQRE
YFLVVLSTKF CSLIAAYRPR QNHKIGPLGK TQNNKSQPLL TITSLEHQVI QRVLDGIKQE
VTSELKPIAP SDFICSTACE PALISQMLTK QLQRQNEINR RISVERIAKL QQHNQKLQTK
EQLKDEYLTN VCQELRTPLT QMKTALSLLN SPTLKPHQRQ RYLQMLNTQC DRQSVLIAGL
MTLVELEHNL ETTTLELVKL ADIVPGVVST YQPVAQEKGI MLGYTIPTDL PSVLCVNGGL
RQIVINLLHN SLKFTANGGR VWVRARVQGE YVILEIRDTG IGIAESEIPK IFDCFYRVRS
GLIDETNGAG LGLTIVQRLL WHCGGSVNVR SKVDEGTMVI VQMKIGSTSP T