Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5163 |
Symbol | |
ID | 5737121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 235604 |
End bp | 238513 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641282328 |
Product | signal transduction protein |
Protein accession | YP_001547919 |
Protein GI | 159901673 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5635] Predicted NTPase (NACHT family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGATA CTACCATCGG TTCGGTCAAT ACCAATTATT CAATGATTGA TGGCCCCGTT GTTGGCATTA ATTTAGGCAC GATTATCTAT GGACGGGCAC CAGAGGAAGG TGAGCGGCAG AGCTTAGTCC GATATTTGGA ACAACTTTCC AACAGCCATC GCAAAATCCG AGTCATTGGA CTTGGTCCTT CGCGCCTTGA ATCGGGCATT GATCTTGCAT CCGTCTATAT TATGCTGGCG GTGCAAAAAC GGTATCGCAT TGTTCGCAAA CTGACTTCGT TTGAAATTAT TGACTATCAA CGTCAAAAGC TCAGGATTCC CCATGAGTTG AAGCCTGATC GCTGTTTACC CGATCAGGCG ATTATCAAGA TTGGGAAACA TCAACGATAT GGTTGGTTGA TGTTCCGCGC GGAATTAGCG ACTGAAACCA TTGCGCAGTA TCAGTATCTC ATCCTCTGCG GTGCGCCTGG GAGTGGAAAA TCTACATTTG CTAAACACTT GGTCTGGGCA TTGGCGCAGC GGGGACTTGA TCAGATCAAT CATCAAACAC ACCTCCGTGG TTGGACTGAT AAACGGCAGC TTCTCCCTAT CTTTATGCCA CTACGGCAGC TGGCAGGAGC TTTAGCGGGC AATGATCTGG GTTTGCATGC TGAACCAAAA ATTGGGTTAT TGCTCGATGC ACTCTGTGAC TATTTACAGA CACACTATGG GTTAGATGAA CCACGTACCC TGTTAACGGC TGGTTTGAAC CAGCGTCACA AGGTCTTGTT TGTGTTTGAT GGACTTGATG AAGTTCCGGT TGAAGCCAAT GAGCATAGCC TTGATCGCGC GTCGTTGCTG CGGTTCTTGC GGATTTTTGC CGATCATCAG CCGAACGCTC GTATGCTTAT TACCTGTCGT TCACGGGCGT GGACATCGGA ATATCGCATG ATCACCCAAT GGCCGATGCA CGAGTTAGCT CACTTGACTG GGGGGCAAAT TACTCATTTT GTTCATTATT GGTTTCCGCA GTTGGTATTA AGTGGGGTTA TTGGTCATGA CGAAGCACAG CGGTATAGTA CCGAACTTTT GAAAGCCTTA CAGCACCCCA AACGCCAAAA ATTACGGCTG ATGGCAGAAA ATCCCTTGTT ACTAAGTATG ATGATTTTTG TACTGGCCGA AAATGGTGTC TTGCCCCGCG ACCGTCATAG TCTCTACGAG CAGGTGCTCG GTCAATTGTT GGGGCAATGG GATGCGAAAC GTGAAGGGGA CAACTTAGGA CAAGCCATCG GCGATGAACG AATTACCAGC CAAGAGCTCC GTAATCGGGT GCTTGATCGG CTGTGCTATC ACGCACATAT GCAGGCCTTG TCGGTTGATG GGCGTGGACG AATTCATGGC CGTGAGCTGC GCCTTGAGTT AATGGATTAT TTTAATCGGG TCAAAGTTGC CGATCCCTAT CGGGCAGCGG AGCGCTGTAT TGCCTATATC GATCAGCGTA GTGGTCTATT ACATCCGGAA GATGCAGGTA TGGTGTATGC CTTTGCGCAC CTGACCTTGC AAGAACATAG TGCAGGCCGC CATTTGTTGT TTTATGAATC AATTGGACAA ATTTTAGCCT TACGGCATGA TGATCGGTGG CGTGAGCCGA TCTTTTTAGG TGTAGGGTGC TTGACGAGTG AAAGTTTAGG ATCAAGCAAG ATCAGCGAAC TGTTAACCGC ATTAATCGAT CGCTATGACT ATGGAAGTGA TACCTGCAAA CCGTCTCATG TATGGTATCG CGACGTAGTG CTGGCTGCTG AATTGGGATT TGATCGTGAT TGGGGGTTAT TGAGCGGTAC AGGGATTGAT GTTCGCCGTA TTAAACGCGA AATACGGCTC GGCGTAGTGC AAATGCTTCA TGATCGTCAG CATGCGCAAT CCGCTCTTGA GTATTTCTAT GGGGCAGCCA TGAAACCGAC ACCGCTCTTA GTCAAGGAAC GCCAACACGC TGCCGAATTG TTGGCAGGGC TGGGAGACCC ACGCTATCCT ATCGATGGAA CGCAATGGCA GCAGGAGACA ACACACCTAT CACAACAGTT CGGACGCGAG GGGACCCATT ACTGGCGCTA TATGCCTGCG GGTCAGTACC AGCGTGGTGA CGCAGGAACA GACATAGCCG AGACCATAGA AAAACATCTG GGTGATTTTG ATCCTGCTGG GATGAATCAC CGTGGTCAGG GTGATACTCA GATCGGGGAT GTAGGGGTTG TCCCTCATCC ATATTGGATT GGACAATTTA TGGTGACGGT GGAGCAATAC CAGGCCTTTA TCCAAGCAGG AGGGTATCAC ACTGATCGAT GGTGGTCGAC GCATGGCAAG GCATGGAAAA CAATGATTGC ATGTAGCGAG CCTTGGTGGT GGGAACAACA AACGCTCCAG CAATATATCA ACCAACCCAT CTATGGAGTA AGTTGGTATG AAGCCGTGGC ATATTGTAAC TGGCTGAATC ACTACCTCCA GCCAATGCTA CCAGTAGGCT ATCGCGTCTG TGTACCAAGT GAGACGGAAT GGATGAGCGC GGCCTATAAT GATGAACATG GGCAATTTCA TAACTACTCA TGGGGAAATC AGCCCTTAAC TCCTGAGCAT GCAGTCTATG ATTGGGTTGA GGAACGGCGG CCAGCCCCAG TCGGCTTAAG TAGGATGGGT GATGCACCAT GTGGTGCTGC GGACATGACA GGCAATCTGT GGGAATGGAC AGCGACACTG GATGGAAAGC AGGACGAGCA CATTGATGAA TCTGCGGTGA ATGATGCCTG TTTGATCACA CTACGCGGTG GATCTTGTTA TGATAATGTT ACAACGATTC TTTTTGCTGC GAATGATACA TCGCTCCCGA TAAATGTTAG TTACAATCGT GGATTTCGGT GTGTGATTGC CCGGCGTTGA
|
Protein sequence | MPDTTIGSVN TNYSMIDGPV VGINLGTIIY GRAPEEGERQ SLVRYLEQLS NSHRKIRVIG LGPSRLESGI DLASVYIMLA VQKRYRIVRK LTSFEIIDYQ RQKLRIPHEL KPDRCLPDQA IIKIGKHQRY GWLMFRAELA TETIAQYQYL ILCGAPGSGK STFAKHLVWA LAQRGLDQIN HQTHLRGWTD KRQLLPIFMP LRQLAGALAG NDLGLHAEPK IGLLLDALCD YLQTHYGLDE PRTLLTAGLN QRHKVLFVFD GLDEVPVEAN EHSLDRASLL RFLRIFADHQ PNARMLITCR SRAWTSEYRM ITQWPMHELA HLTGGQITHF VHYWFPQLVL SGVIGHDEAQ RYSTELLKAL QHPKRQKLRL MAENPLLLSM MIFVLAENGV LPRDRHSLYE QVLGQLLGQW DAKREGDNLG QAIGDERITS QELRNRVLDR LCYHAHMQAL SVDGRGRIHG RELRLELMDY FNRVKVADPY RAAERCIAYI DQRSGLLHPE DAGMVYAFAH LTLQEHSAGR HLLFYESIGQ ILALRHDDRW REPIFLGVGC LTSESLGSSK ISELLTALID RYDYGSDTCK PSHVWYRDVV LAAELGFDRD WGLLSGTGID VRRIKREIRL GVVQMLHDRQ HAQSALEYFY GAAMKPTPLL VKERQHAAEL LAGLGDPRYP IDGTQWQQET THLSQQFGRE GTHYWRYMPA GQYQRGDAGT DIAETIEKHL GDFDPAGMNH RGQGDTQIGD VGVVPHPYWI GQFMVTVEQY QAFIQAGGYH TDRWWSTHGK AWKTMIACSE PWWWEQQTLQ QYINQPIYGV SWYEAVAYCN WLNHYLQPML PVGYRVCVPS ETEWMSAAYN DEHGQFHNYS WGNQPLTPEH AVYDWVEERR PAPVGLSRMG DAPCGAADMT GNLWEWTATL DGKQDEHIDE SAVNDACLIT LRGGSCYDNV TTILFAANDT SLPINVSYNR GFRCVIARR
|
| |