Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1005 |
Symbol | |
ID | 3679917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1214166 |
End bp | 1216106 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637716340 |
Product | multi-sensor Signal transduction histidine kinase |
Protein accession | YP_321524 |
Protein GI | 75907228 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.442988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000217029 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCACAAC CGCTAACCGT TTTAATTATT GATGATTGTG CTGAAGATCG CCAAGTTTAT CGCCGCTATC TCCAGCAAGA CCAAGAGCAT CACTATACAA TTCTGGAACA AGAATCAGGA GAGGAGGCAC TGGAATTATG TCGGCAGTTG CAACCTGATG TAATTTTATT AGACTTTTTA CTGCCAGACC AAGATGGACT AGAACTTCTG GCTGAATTAC AAAAGCTGCT GCAAGGAAGT ATGCCAGCTA TAGTCATGTT AACGGGTTAC GGTAATGAAG CGATCGCCGT ACAAGCAATG AAAAGTGGTG TTCATGACTA TTTAGTGAAA GAACAAACGA CAGCAGAACG TTTGCGTTCC ACTCTTCACT CTGCAATTAT CCAAACGCGA TTACGCCAAG AACTTTCCCG GAGTCAGGAG CAATTACACG ACAGCCAAAA ATTTATTGAA CGCATTGCAG AAACCACTCC GGGGATTTTG TACGTTCACG ATTTAGTCGA AAAGCGGAAT GTTTATATCA ATGGTAAGGT TGGTAATTTA TTAGGTCATA CTCCCCAAAC AATTCAAAAT TTGGGAAAAG AATTTCTCAT CACATTCATG CACCCTGAGG ATTTAGTTCG GCTTCCGCAA GTATTTCAGC AGTTTGACTC AGCCAAAGAT GGAGACATCA TCGAACATGA GTATAGGATG CGACACGCTG ATGGTGGATG GTGTTGGTTT TTTAGTCGTA ATTCTGTGTT CATGAGAAAC AGTGATGGTT CACCACGCCA AATAGTCGGT ACAGCCTTTG ATATCACCGC TCGTAAGCAA GCTGAAGAAG AACTACGCTC TAGCAATGAA CGTTTCCGAC TAGCTGCCGC CGCCGTGAAT TCTCTCATTT ATGAATGGGA TATAAAAAAA GGTACGGTCA CTAGGACGGA GGGATTGACC CGGATTTTAG GCTACTCTGT GGAGGAAGCC ACCCCAAATA TTGAATGGTG GCAAGAGCAA ATTCATCCTG ACGACCAGGA CTTTTTGGTT GAGCAATTCC AAAATCCACC TGTTAATCAA ACTCATTATA CTTTTGAATA TCGCATCCGA CACAAAAACA ATCAGTATTT ATACGTATTA GATCAAGGGA TAATCACCAG AGATGAAAAT GGCCAGCCAG TGCGAGTAGT CGGTAGTGCA ACCGATATTA GCGATCGCAA ACTTGCGGAA GCAGAACGCG ACCAACTATT GCAATTAGAA AAATTAGCCC GTGCGGAAGC AGAAGCCGCC AATCAAACGA AAGATGAGTT TGTGGCGATG GTATCTCATG ATTTGCGATC GCCATTGAAT GCAATTCTCG GCTGGTCACA ATTGTTACGA ACTCGCCAGC TTGATGAAAG CACATTTACC CGTGCGTTAG AAACCATTGA ACGCAACGCA CAATCACAGT CAAAACTTTT GGAGGACTTA CTGGATATGT CTCGTATCCT CAGGGGTAAG TTACAGTTGG AGTTTTGCCT AGTGGAGTTA CCTGCTATTA TCGGCGTAGC AATTGAGACT GCTTATCCCT CAGCCAAAGC TAAAGATATT CATTTAAAGT CGGTGATAGA TGAATCAATT CCACTGATTC CTGGTGATAT CAATCGCTTG TTACAAGTTT TAGGCAATCT GCTTTCAAAT GCGATTAAAT TCACACCTCC AGAAGGACAG ATCACAGTCC AACTATCATA TACAGACTCT GAAGCTCACA TCACAGTGAT TGATACAGGT ATTGGGATTA AGTCAGAATT TTTACCCTAC GTTTTTGAGC GCTACCGTCA AGCGGACTGC CAACATAAAC AACATGGTTT GGGTTTAGGG TTAGCGATCG CCCGTTATCT CATAGAATTA CACGGCGGGG CGATTCATGT GAATAGCCTG GGTGAGGGAC AAGGAACTAC TTTCACTATT AAATTACCAT TGCACCAATA A
|
Protein sequence | MAQPLTVLII DDCAEDRQVY RRYLQQDQEH HYTILEQESG EEALELCRQL QPDVILLDFL LPDQDGLELL AELQKLLQGS MPAIVMLTGY GNEAIAVQAM KSGVHDYLVK EQTTAERLRS TLHSAIIQTR LRQELSRSQE QLHDSQKFIE RIAETTPGIL YVHDLVEKRN VYINGKVGNL LGHTPQTIQN LGKEFLITFM HPEDLVRLPQ VFQQFDSAKD GDIIEHEYRM RHADGGWCWF FSRNSVFMRN SDGSPRQIVG TAFDITARKQ AEEELRSSNE RFRLAAAAVN SLIYEWDIKK GTVTRTEGLT RILGYSVEEA TPNIEWWQEQ IHPDDQDFLV EQFQNPPVNQ THYTFEYRIR HKNNQYLYVL DQGIITRDEN GQPVRVVGSA TDISDRKLAE AERDQLLQLE KLARAEAEAA NQTKDEFVAM VSHDLRSPLN AILGWSQLLR TRQLDESTFT RALETIERNA QSQSKLLEDL LDMSRILRGK LQLEFCLVEL PAIIGVAIET AYPSAKAKDI HLKSVIDESI PLIPGDINRL LQVLGNLLSN AIKFTPPEGQ ITVQLSYTDS EAHITVIDTG IGIKSEFLPY VFERYRQADC QHKQHGLGLG LAIARYLIEL HGGAIHVNSL GEGQGTTFTI KLPLHQ
|
| |