Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5122 |
Symbol | |
ID | 5737080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 162686 |
End bp | 165547 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641282287 |
Product | signal transduction protein |
Protein accession | YP_001547878 |
Protein GI | 159901632 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5635] Predicted NTPase (NACHT family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATA GCACTGATGG TTCCGTCAAC GCTGATGATT CTGAGTTCTA CGGCCCCGTG GTGGGGGTCA ATCTTGGCAC GATCATCTAT GGCCGTCCAC CCGAAGATGC CGAGCGCCAA CGCTTAGTGG CCTATTTGGA GCAGGTGACG AAAAGCCACA ATACCTTGCG GGTAGTTGGG GTTGGCTCGT CGCATCTCGC GTCAGGCATT GACCTCGCAT CCGCCTATAT GATGCTAGCG GTGCAGGGAC GGCAGCGGAT GCTGCGGCCA CTCACGGCGG AAGAAGTCGA GGCATATCAG CAACACAGGT TTAAAATTCC CAAGGAACTG AGTGCTGATC GCTGTTTGCC CGATCACGCC GTGCTTGCGG TCGTTGAGGA TAGTCAGTCT GGTCAATTGG CGTTGTTTCG CGCTGAGTTG GCCACGGAAA CCGTCTTAGC GCATCCCTAC CTCGTGCTGT GTGGCCCGCC GGGGAGCGGG AAATCAACCT TCGCCAAGCA TCTCGTGTGG GCTTTGGCCC AACGTGGCCG TGACCAGATC AACCACCATA CAGGCTTACT GGGCTGGAAT GATCACCAGC GCGTGTTGCC CGTGTTCATG TCCTTGCGAA CCTTGGCAGG CGCATTAATT GGGAAGGATT TAGGGTTGAC CGACACACCA AACATTGGGC TGTTGCTCGA TGCGGTGTGT GCGCACCTGC AAACTAAGTA TGGACTTGAA CAGCCGCGCG AGCTGCTGAA GGCGGGGTTA AAAGGTTCGC TGACGGTGTT GTTTGTCTTT GATGGCTTGG ATGAAGTGCC ACTGGAGGCG ACCGCAGCGA GCCTTGATCG TCGCTCGCTG TTGACCTTTG TGCGGTTGTT TGCCAGTGCC TATGCTGCTC GTATCCTCAT CACCTGCCGC TCGCGGGCGT GGACAGAAGA CTATCGCCAG ATCACGCAGT GGCCCATGGT CGAATTGGCT CCGCTGAGCG GTGGCCAAAT GACCCAGTTT ATCAATACAT GGTTTCCGTT GTTGCACGCC AAGGGCGTGA TTGAGCACGA GGCCATTGCA CGTTATGGTG CGCAGTTGAT GCAGGCGTTG CGCGATCCCC AGCGCCGCCG CTTACGGGAC ATGGCCGACA ATCCGTTGTT GCTGAGCATG ATGATCTTTG TGTTGGCTCG CAAGGGTGTC TTGCCGCGTG ACCGCCATAG CTTGTACGAC GATATTCTGA AGCAACTCTT GGGCGAGTGG GATACCACCA GTCGCAATGG GCAGAATTTG GGGCAAGCGG TTGGGGATGA TCGGATTACG GGCGACGAGG TGCGCGATCA GGTGTTGGAT CGGTTGTGTT ATCAGGCGCA TTTAACCGCC ACGTCAGCGG ATGGGCGGGG GCGGATTCCA AGCCGCGAGC TTCAAATTGC CTTGATGGAG TATTTTGCAC GCGTCAACGT GGCTGATCCC TATCGAGCGG CAGAACGCTG TGTTGCCTAT ATCGATCAAT GCAGCGGCTT GCTTCAGCCC GAGGATGAGG GGATGGTCTA TGCCTTTGCC CACTTAACCT TGCAAGAACA GAGCGCTGGT CGCCACTTGG TGTTTTCTGA ATCGCTCGAT CAATTGTTGG CCTTACGTCG TGATGACCGT TGGCGTGAGC CGATCTTCTT AGGGGTTGGC TGCCTGACCA AAGCGAGGCT TGGCAGTGCC AAAATTGAGC AACTCCTGAC AACGTTGGTT GATTCTGATG CCTATGAAGC GGGGGAGATG CACCAATACG ATTGGTATCG TGATCTGATT TTGGCCGCTG AGTTAGGCGC GGATTGTGAT TGGGGCTTGC TGCATGGCAA GCAGATCAAG GTGGATCGCA TCCAGCGACG GTTGCGGGCG GGGCTGGTTA ACCTGCTTGA AGACTATGAC CATGCGCAAG CGGCGCTTGC CTATTATAAC GGTCAAGCGA TGGAGCCAGC GCCGTTGTTG GTGCGTGAAC GGCAAAAGGG TGCCGAACTC TTGGCAGGTT TGGGTGATGC ACGTTATCCG GTGAGTATCG AGCAATGGCA ACAGGTAACC TGCCAGCTTT CCACCCAGTT TGGTCGCGAG GGTACTCATT ATTGGCGGTA TATCCCCGCA GGCTGCTATC GGGTTGGTGG TTGGGATGGA GATGAACAAG CCACAACCGT CGAACTTCCA TCCTACTGGG TCGGACGATT TATGGTGACC GTTGATCAAT ATCGGGCGTT TATCGAGGCA GGCGGCTATA CCAACGATGC ATGGTGGACA ACGCAAGGCT TAGCTTGGAA AAAGGAAACA AACCGAACAG AACCATGGGG TTGGAATGGT CAAATCGAGC AGGAATACCG GAATCAGCCT GTTTATGGGG TGAGTGGGTA TGCAGCGATG GCCTATTGTC AGTGGTTGAG CGAGCAGCTT ACGCCATGGC TGCCGCAGGG GTATTGCATT CGGTTGGCCA GTGAGGCGGA ATGGGAAGTT GCAGCAGCGT ATAATGCCGA TGGCCAGCGC CATACCTATC CGTGGGGCGA GCAGCCTGCC ACACCGGAGC ATGCGGTCTA CGATTGGAGC GATGAACGGC GACCGCTATC AGTGGGTTTA GGGCTGCTGG GCCAAGCGGC TTGTGGTATG CTGGATAGCG TTGGGAACCT GTGGGAATGG GCCGCCGTGC GGTATCAGGA CAATGGTGGC GATAGGCAGC AGGTGCTTGC GGATAGTAAC GATTGGATGG TACTGCGTGG TAGCTTATAT TACAACAATA GTACAAAGAT TCTTTGCGCG GCGCGTGACT GGTGTCGTCC CGACGACGAC GACGTCTACA ACTGCCCTGG ATTTCGTTGT TTTTTAGCCC CTCGTTCATA TGTTTTGCAT GCTGCATCCT GA
|
Protein sequence | MADSTDGSVN ADDSEFYGPV VGVNLGTIIY GRPPEDAERQ RLVAYLEQVT KSHNTLRVVG VGSSHLASGI DLASAYMMLA VQGRQRMLRP LTAEEVEAYQ QHRFKIPKEL SADRCLPDHA VLAVVEDSQS GQLALFRAEL ATETVLAHPY LVLCGPPGSG KSTFAKHLVW ALAQRGRDQI NHHTGLLGWN DHQRVLPVFM SLRTLAGALI GKDLGLTDTP NIGLLLDAVC AHLQTKYGLE QPRELLKAGL KGSLTVLFVF DGLDEVPLEA TAASLDRRSL LTFVRLFASA YAARILITCR SRAWTEDYRQ ITQWPMVELA PLSGGQMTQF INTWFPLLHA KGVIEHEAIA RYGAQLMQAL RDPQRRRLRD MADNPLLLSM MIFVLARKGV LPRDRHSLYD DILKQLLGEW DTTSRNGQNL GQAVGDDRIT GDEVRDQVLD RLCYQAHLTA TSADGRGRIP SRELQIALME YFARVNVADP YRAAERCVAY IDQCSGLLQP EDEGMVYAFA HLTLQEQSAG RHLVFSESLD QLLALRRDDR WREPIFLGVG CLTKARLGSA KIEQLLTTLV DSDAYEAGEM HQYDWYRDLI LAAELGADCD WGLLHGKQIK VDRIQRRLRA GLVNLLEDYD HAQAALAYYN GQAMEPAPLL VRERQKGAEL LAGLGDARYP VSIEQWQQVT CQLSTQFGRE GTHYWRYIPA GCYRVGGWDG DEQATTVELP SYWVGRFMVT VDQYRAFIEA GGYTNDAWWT TQGLAWKKET NRTEPWGWNG QIEQEYRNQP VYGVSGYAAM AYCQWLSEQL TPWLPQGYCI RLASEAEWEV AAAYNADGQR HTYPWGEQPA TPEHAVYDWS DERRPLSVGL GLLGQAACGM LDSVGNLWEW AAVRYQDNGG DRQQVLADSN DWMVLRGSLY YNNSTKILCA ARDWCRPDDD DVYNCPGFRC FLAPRSYVLH AAS
|
| |