Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0062 |
Symbol | |
ID | 3683474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 76513 |
End bp | 79437 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637715389 |
Product | CheA Signal transduction histidine Kinases (STHK) |
Protein accession | YP_320583 |
Protein GI | 75906287 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.172563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.120258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAAG ATAAAGAATT AGAAATCCAG ATGCAGTTTC TGGAAGAAGC TACAGATTAC ATTAATACTC TAGAAGGGGT ATTACTAGAA ATCGACACAA GTCACCGCAT TGACTTAGAA AAAATTAACG GCGCATTACG CGCTGCTCAC TCAATTAAGG GCGGGGCAGC GATGATGGGA TTTCGCGCTT TAAGTGATTT ATCCCATCGC TTGGAAGATG CTTTCAAGGT TTTAAAAACC CGCAAAAATT CCCTAGAAAT TGATACCAAA TTACAAAGTT TATTACTTTC TGCTGTTGAT TGGTTACGGC AAATTGTCCA GTTATTAGGA GAAGGAAATG CCGTAGAAGA AGATTGGTTA GCGACATTTT GTCACCCAGT CTTTGATGAA TTGCGCGATC GCCTCGGTGA CCCTACTCCA GAAGATGCTG CAACCATGCT ATCACCAGAA GAGGGGCAAG ATATCATTCC TCTATTATTT GAATCAGAAA TCGAAGGATG TCTCCAGCGT CTAGAGTCGG TATTGGCAGA TCAAAACCAA CCTTGTTTAC AAGAAGAAGT CGCCATCATG GCGGCGGAAT TGGGCGGTTT GGGGGAAATG TTGCAGTTAC CAGCTTTTAC CCAATTATGT GAATCTATTA AATATCATAT AGAAAATGCT AACGGTAATG TCTCAGAAAT TGCCCAATTA GCTTTACAAA CATGGCGGCG ATCGCAAGCT TTAGTTTTAA CAAATCAACG TGATAAATTA CCAACAGAAC TTCAGTTTGG TGAATTAGTT CACATTCCAG TATACACTCC ATCTCTACCA CAATTTCAGG CAGATAAATT ATCGGAAACA CCTATAGAAT CTAATTTTAT TCCTGAAGAT TTGACAGTAT CAAAAAATAA TAATCATCAT CCCCTCAATA TTATTGATTT CCCCGAAATA CTGACAGAAT TTGCCAAAGA AAATATCACA ACAGATTATA CATTAATTGC CAACAAAGAC CGTGAACATC AAGAAAATAC AGTCCGAGTT CCTAGTAAAC AATTAGAAGA AATAAATGAT TTATTTGGTG AATTAATTAT CCAACGTAAT GGGCTGAACT CCCAAATTGA AAGATTACGC AAACTTGTAC GCAACTTAAA CCAGCGTGTC CATACCCTAG AACGAGAAAA TCAAGAATTG CGGACAATTC ACGAGCGAAA AACCACCAAA GACTTGGTAA ATGCTCATCA TGGGGCAGAA TATCCCCAGA CCCAAGACCC AGAGAGCGAA TTTGATGCCC TAGAGATGGA TCGCTATAAT GAGTTAAACT TGCGATCGCA AACAGTAGTG GAAACTATTG TCCAAGTCCA AGAAGTCACC ACAGATATTC AACTCAGCGT TGACGACACA GACCACGTCG CCCGTAAACT CAATAAAACA TCAAAACAAT TACAAACCAA GCTCAACCAT ATCCGAATGC GCCCCATGTC GGACTTAGTT GAGCGTTTCC CCAGAGCCAT CCGCGATTTA AACGTAGAAT ATGGTAAAAA TGTTCAATTG AAAATTACAG GTGGTAACAC CTTAATTGAA CGCAGCATTT TAGAAGCCTT AAATGAGCCG TTAATGCACT TATTAAGGAA TGCCTTTGAT CATGGTATCG AAGAATCTGC CACCCGTCAG GCATTAGGCA AACCAGAACA AGGCTTAATT GAAATTACAG CCACACACCG CAGTAATCGT ACTATCATCA CCATGCGCGA TGATGGCCGG GGCATTTCTT TAGATAAAAT TCGCACCCGC GCCCAAGGAA TGGGGATAGA TGCAGCTTTA CTAGCCAACG CCACCGATGA AGAACTCCTA TCACTAATTT TTGAACCAGG GTTTACCACC TCCGAACAAG TCACCGCCTT ATCTGGTCGT GGTGTGGGGA TGGATGTAGT GCGTAATAAC CTCAAACTAG TACGGGGCGA TATTACAGTT GATACAGAAT TAGGCGTTGG TACAACCTTT ACCTTATCAG TACCATTTAC CCTTTCCATC GCCAGAGTTT TGCTCATAGA AACCAACAAA ATGCTGTTAG CATTTCCTAC CGATGTGGTT TCGGAAATCT TCTTATTGCA AAACGAGCAA GTATTCCCAA TGGCAGGAAA TGAAGTCCTT AATTGGCAAG GGACGATGCT CCCCCTAATT CGCCTCGGTC GCCACCTAGA ATTTAACTGT CCCCGCTACG ATAGCCCAGA ACTAGAAACA CCCCCAGCAA TTAACGCCCA TAGCGTCTTA ATAGTTAAAG GCAATAATCA ACCAGTTGCT ATATTGGTAG ACCGTTGCTG GGGTGAACAG GAAGTTGCTA TCCGCCAAGT CGAGGGCAAT ATACCCTTAC CCCAAGGCTT TAGTAATTGC ACCATTTTAG GTGATGGTCG AGTTGTGGCA TTAGTTAATA CCAACGAACT CCTGTATAAA ATTGCCACCA ATCAACACCC CACCAAAAAT CTTCAATCAT CATCAGCTAC ACTAAAAACG CCCTTCCTCT TCTTTGATAG TGCCAAACTA CCAGCACCCC CCACTCAAAA CAAAGGCACA ATTTTAATCA TTGATGACTC CATTAATGTG CGTCGCTTCC TAGCATTAAC TTTAGAAAAA GGAGGATATC AAGTAGAACA AGCAAAAGAT GGTGAAGATG CTTGGGAAAA ATTGGAGAGT GGCTTAAAAG TGAAAGCTGT AATTTGTGAT ATTGAAATGC CACGCCTTGA TGGCTACGGC TTTTTAGACC GGATTAACTC CAACGTGGAC ACAAAAAATA TCCCAGTAGC CATGCTCACC TCTCGTAGTA GCAATAAACA TCGGCAACTA GCCATGCAGC TAGGCGCTAG AGCTTATTTT TCTAAACCTT ACAATGAGCA AGAATTACTC GGTACCCTAG AAGAACTAAT CTTTAATGTA GCGGTAAGTA GTTAG
|
Protein sequence | MSKDKELEIQ MQFLEEATDY INTLEGVLLE IDTSHRIDLE KINGALRAAH SIKGGAAMMG FRALSDLSHR LEDAFKVLKT RKNSLEIDTK LQSLLLSAVD WLRQIVQLLG EGNAVEEDWL ATFCHPVFDE LRDRLGDPTP EDAATMLSPE EGQDIIPLLF ESEIEGCLQR LESVLADQNQ PCLQEEVAIM AAELGGLGEM LQLPAFTQLC ESIKYHIENA NGNVSEIAQL ALQTWRRSQA LVLTNQRDKL PTELQFGELV HIPVYTPSLP QFQADKLSET PIESNFIPED LTVSKNNNHH PLNIIDFPEI LTEFAKENIT TDYTLIANKD REHQENTVRV PSKQLEEIND LFGELIIQRN GLNSQIERLR KLVRNLNQRV HTLERENQEL RTIHERKTTK DLVNAHHGAE YPQTQDPESE FDALEMDRYN ELNLRSQTVV ETIVQVQEVT TDIQLSVDDT DHVARKLNKT SKQLQTKLNH IRMRPMSDLV ERFPRAIRDL NVEYGKNVQL KITGGNTLIE RSILEALNEP LMHLLRNAFD HGIEESATRQ ALGKPEQGLI EITATHRSNR TIITMRDDGR GISLDKIRTR AQGMGIDAAL LANATDEELL SLIFEPGFTT SEQVTALSGR GVGMDVVRNN LKLVRGDITV DTELGVGTTF TLSVPFTLSI ARVLLIETNK MLLAFPTDVV SEIFLLQNEQ VFPMAGNEVL NWQGTMLPLI RLGRHLEFNC PRYDSPELET PPAINAHSVL IVKGNNQPVA ILVDRCWGEQ EVAIRQVEGN IPLPQGFSNC TILGDGRVVA LVNTNELLYK IATNQHPTKN LQSSSATLKT PFLFFDSAKL PAPPTQNKGT ILIIDDSINV RRFLALTLEK GGYQVEQAKD GEDAWEKLES GLKVKAVICD IEMPRLDGYG FLDRINSNVD TKNIPVAMLT SRSSNKHRQL AMQLGARAYF SKPYNEQELL GTLEELIFNV AVSS
|
| |