Gene Ava_0062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0062 
Symbol 
ID3683474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp76513 
End bp79437 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content42% 
IMG OID637715389 
ProductCheA Signal transduction histidine Kinases (STHK) 
Protein accessionYP_320583 
Protein GI75906287 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.172563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAG ATAAAGAATT AGAAATCCAG ATGCAGTTTC TGGAAGAAGC TACAGATTAC 
ATTAATACTC TAGAAGGGGT ATTACTAGAA ATCGACACAA GTCACCGCAT TGACTTAGAA
AAAATTAACG GCGCATTACG CGCTGCTCAC TCAATTAAGG GCGGGGCAGC GATGATGGGA
TTTCGCGCTT TAAGTGATTT ATCCCATCGC TTGGAAGATG CTTTCAAGGT TTTAAAAACC
CGCAAAAATT CCCTAGAAAT TGATACCAAA TTACAAAGTT TATTACTTTC TGCTGTTGAT
TGGTTACGGC AAATTGTCCA GTTATTAGGA GAAGGAAATG CCGTAGAAGA AGATTGGTTA
GCGACATTTT GTCACCCAGT CTTTGATGAA TTGCGCGATC GCCTCGGTGA CCCTACTCCA
GAAGATGCTG CAACCATGCT ATCACCAGAA GAGGGGCAAG ATATCATTCC TCTATTATTT
GAATCAGAAA TCGAAGGATG TCTCCAGCGT CTAGAGTCGG TATTGGCAGA TCAAAACCAA
CCTTGTTTAC AAGAAGAAGT CGCCATCATG GCGGCGGAAT TGGGCGGTTT GGGGGAAATG
TTGCAGTTAC CAGCTTTTAC CCAATTATGT GAATCTATTA AATATCATAT AGAAAATGCT
AACGGTAATG TCTCAGAAAT TGCCCAATTA GCTTTACAAA CATGGCGGCG ATCGCAAGCT
TTAGTTTTAA CAAATCAACG TGATAAATTA CCAACAGAAC TTCAGTTTGG TGAATTAGTT
CACATTCCAG TATACACTCC ATCTCTACCA CAATTTCAGG CAGATAAATT ATCGGAAACA
CCTATAGAAT CTAATTTTAT TCCTGAAGAT TTGACAGTAT CAAAAAATAA TAATCATCAT
CCCCTCAATA TTATTGATTT CCCCGAAATA CTGACAGAAT TTGCCAAAGA AAATATCACA
ACAGATTATA CATTAATTGC CAACAAAGAC CGTGAACATC AAGAAAATAC AGTCCGAGTT
CCTAGTAAAC AATTAGAAGA AATAAATGAT TTATTTGGTG AATTAATTAT CCAACGTAAT
GGGCTGAACT CCCAAATTGA AAGATTACGC AAACTTGTAC GCAACTTAAA CCAGCGTGTC
CATACCCTAG AACGAGAAAA TCAAGAATTG CGGACAATTC ACGAGCGAAA AACCACCAAA
GACTTGGTAA ATGCTCATCA TGGGGCAGAA TATCCCCAGA CCCAAGACCC AGAGAGCGAA
TTTGATGCCC TAGAGATGGA TCGCTATAAT GAGTTAAACT TGCGATCGCA AACAGTAGTG
GAAACTATTG TCCAAGTCCA AGAAGTCACC ACAGATATTC AACTCAGCGT TGACGACACA
GACCACGTCG CCCGTAAACT CAATAAAACA TCAAAACAAT TACAAACCAA GCTCAACCAT
ATCCGAATGC GCCCCATGTC GGACTTAGTT GAGCGTTTCC CCAGAGCCAT CCGCGATTTA
AACGTAGAAT ATGGTAAAAA TGTTCAATTG AAAATTACAG GTGGTAACAC CTTAATTGAA
CGCAGCATTT TAGAAGCCTT AAATGAGCCG TTAATGCACT TATTAAGGAA TGCCTTTGAT
CATGGTATCG AAGAATCTGC CACCCGTCAG GCATTAGGCA AACCAGAACA AGGCTTAATT
GAAATTACAG CCACACACCG CAGTAATCGT ACTATCATCA CCATGCGCGA TGATGGCCGG
GGCATTTCTT TAGATAAAAT TCGCACCCGC GCCCAAGGAA TGGGGATAGA TGCAGCTTTA
CTAGCCAACG CCACCGATGA AGAACTCCTA TCACTAATTT TTGAACCAGG GTTTACCACC
TCCGAACAAG TCACCGCCTT ATCTGGTCGT GGTGTGGGGA TGGATGTAGT GCGTAATAAC
CTCAAACTAG TACGGGGCGA TATTACAGTT GATACAGAAT TAGGCGTTGG TACAACCTTT
ACCTTATCAG TACCATTTAC CCTTTCCATC GCCAGAGTTT TGCTCATAGA AACCAACAAA
ATGCTGTTAG CATTTCCTAC CGATGTGGTT TCGGAAATCT TCTTATTGCA AAACGAGCAA
GTATTCCCAA TGGCAGGAAA TGAAGTCCTT AATTGGCAAG GGACGATGCT CCCCCTAATT
CGCCTCGGTC GCCACCTAGA ATTTAACTGT CCCCGCTACG ATAGCCCAGA ACTAGAAACA
CCCCCAGCAA TTAACGCCCA TAGCGTCTTA ATAGTTAAAG GCAATAATCA ACCAGTTGCT
ATATTGGTAG ACCGTTGCTG GGGTGAACAG GAAGTTGCTA TCCGCCAAGT CGAGGGCAAT
ATACCCTTAC CCCAAGGCTT TAGTAATTGC ACCATTTTAG GTGATGGTCG AGTTGTGGCA
TTAGTTAATA CCAACGAACT CCTGTATAAA ATTGCCACCA ATCAACACCC CACCAAAAAT
CTTCAATCAT CATCAGCTAC ACTAAAAACG CCCTTCCTCT TCTTTGATAG TGCCAAACTA
CCAGCACCCC CCACTCAAAA CAAAGGCACA ATTTTAATCA TTGATGACTC CATTAATGTG
CGTCGCTTCC TAGCATTAAC TTTAGAAAAA GGAGGATATC AAGTAGAACA AGCAAAAGAT
GGTGAAGATG CTTGGGAAAA ATTGGAGAGT GGCTTAAAAG TGAAAGCTGT AATTTGTGAT
ATTGAAATGC CACGCCTTGA TGGCTACGGC TTTTTAGACC GGATTAACTC CAACGTGGAC
ACAAAAAATA TCCCAGTAGC CATGCTCACC TCTCGTAGTA GCAATAAACA TCGGCAACTA
GCCATGCAGC TAGGCGCTAG AGCTTATTTT TCTAAACCTT ACAATGAGCA AGAATTACTC
GGTACCCTAG AAGAACTAAT CTTTAATGTA GCGGTAAGTA GTTAG
 
Protein sequence
MSKDKELEIQ MQFLEEATDY INTLEGVLLE IDTSHRIDLE KINGALRAAH SIKGGAAMMG 
FRALSDLSHR LEDAFKVLKT RKNSLEIDTK LQSLLLSAVD WLRQIVQLLG EGNAVEEDWL
ATFCHPVFDE LRDRLGDPTP EDAATMLSPE EGQDIIPLLF ESEIEGCLQR LESVLADQNQ
PCLQEEVAIM AAELGGLGEM LQLPAFTQLC ESIKYHIENA NGNVSEIAQL ALQTWRRSQA
LVLTNQRDKL PTELQFGELV HIPVYTPSLP QFQADKLSET PIESNFIPED LTVSKNNNHH
PLNIIDFPEI LTEFAKENIT TDYTLIANKD REHQENTVRV PSKQLEEIND LFGELIIQRN
GLNSQIERLR KLVRNLNQRV HTLERENQEL RTIHERKTTK DLVNAHHGAE YPQTQDPESE
FDALEMDRYN ELNLRSQTVV ETIVQVQEVT TDIQLSVDDT DHVARKLNKT SKQLQTKLNH
IRMRPMSDLV ERFPRAIRDL NVEYGKNVQL KITGGNTLIE RSILEALNEP LMHLLRNAFD
HGIEESATRQ ALGKPEQGLI EITATHRSNR TIITMRDDGR GISLDKIRTR AQGMGIDAAL
LANATDEELL SLIFEPGFTT SEQVTALSGR GVGMDVVRNN LKLVRGDITV DTELGVGTTF
TLSVPFTLSI ARVLLIETNK MLLAFPTDVV SEIFLLQNEQ VFPMAGNEVL NWQGTMLPLI
RLGRHLEFNC PRYDSPELET PPAINAHSVL IVKGNNQPVA ILVDRCWGEQ EVAIRQVEGN
IPLPQGFSNC TILGDGRVVA LVNTNELLYK IATNQHPTKN LQSSSATLKT PFLFFDSAKL
PAPPTQNKGT ILIIDDSINV RRFLALTLEK GGYQVEQAKD GEDAWEKLES GLKVKAVICD
IEMPRLDGYG FLDRINSNVD TKNIPVAMLT SRSSNKHRQL AMQLGARAYF SKPYNEQELL
GTLEELIFNV AVSS