Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3459 |
Symbol | |
ID | 5735320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4351458 |
End bp | 4353092 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280606 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001546223 |
Protein GI | 159899976 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGTG GTCAAGAAAC GATTCTAGTG ATCGACGATA GCGAAGCAAT TCGGGTCAAA CTTCAGGCTC AATTGCGTTC GCTGGGCTAT CAGGTCGTGT TGGCTGAAAC TGGGCGAAAT GGGCTAAACG CGATTACCCA ACATCATCCG CACCTCATTT TGCTCGATTA TCAGCTGCCT GATACCACTG GGTTGGATTT GCTGCGTAAG CTGCGCACCG ATGGCAACAC CGTGCCAATT TTGCTGATGA CTGCCGAAGG TTCCGAACGC ATCGCCGTAA CGGCCTTCAA AATGGGCGTG CGCGATTATT TGATCAAGCC TTTTGAGCCG CAAGATGTTG CTCAGGCGAT CGATCGGGCG CTGCGTGAGT GGCGTTTGCA ACGTGAAAAA GAAATTTTGC TTGGTCAATT ACAAGGCCAA GTGCGCCAAT TAACGGTTTT GCATCGGGTT GGTAAGGCGG TAACTGCCCA ACTTGATGCC AGCAATTTGC TTGAACGAAT TGTTGAAGCC TCAGTTTTTC TTTCGAATGC CGATGAAGGT TTTGTGCAAT TAATTGATGA TAATCAGTTG GTTGTGCGAG CCTCACACAA TATTAACCCA TTGCATTTGC GTGAGCTAAG CAAACATACC GATTATGAAT TGGCAACGCG CACAATTAAA ACCAACAAAC CAATTCGGAT CAATTCCGAG CGCGATGGAA TTCGGGTGCA AGCCAATTAT TTGGCCCAAG CTGTGCTCAT GGTGCCGTTG TTGGTGGGGA CTGAAGCCTT GGGCGTGCTG ACCGTGGCTG CGACAACCCA TCGCCGCAAT TTTGATGAAG GCGATGAGCG CTTGATGCAG ATGCTGGCCG ACTACGCCTC GATTGCCTTG CACAACGCCC GCACCTACAG CGCACTGCGC GAAACCCAAG GTCGCTTGGT TGAAGCTGAA AAATTATCGG GCATGGGGCG CATGGCGGCT TCGCTAGCCC ACGAAATCAA CAATCCACTG GCGATTATTC GCTCAGGCTT GGAGTTGGTG GCGCAACAAC ACACACCTGG CACGGCGCTT GGCGATTTGG TGCAAGGACT GGATGAAGAA GTGGCGCGGA TTGCGCGGTT GCTCTATACC TTGGTGAATT TCTATCAGCC CAATAACGAT GGTGTGCCGC CTGATCTCAA TCATTTGATC ATTTCACTGA TGCACATCAC CAAACCACAA CTTGATAAAG CCAATGTTAA ACTGTATCAG GAGTTGGCGA CCGATCTGCC AGCGCCAAAC ATTAGCAGCG ATGCTTGTAA GCAAGTTTTG ATCAATTTGG TGCGCAATGC GATTGATGCG ATGCCCGATG GCGGCAAATT GACCATTCGC ACGGCCCATC AAAAGGGTCA AATTTTTGTC AATGTTGAAG ATAGCGGAAT TGGGATTCCG CCCGAACATC GCGAACGTAT TTTTGAGCCA TTCTTTAGCA CTAAAGGTGT GACGGGAACG GGGCTTGGGC TTTCAGTTGT TTATGGTATT TTGCAACAAG TTGGCGGCGC AATTTCGGTA GAGAGCATCG TGGATAAAGG CTCGAACTTT ACCTTGCGCA TTCCGGTTGC AGCCCAACGC AGCCAATCGC CCGATCTTGA TTCCGATGAA TTGTTGATTG GTTAA
|
Protein sequence | MASGQETILV IDDSEAIRVK LQAQLRSLGY QVVLAETGRN GLNAITQHHP HLILLDYQLP DTTGLDLLRK LRTDGNTVPI LLMTAEGSER IAVTAFKMGV RDYLIKPFEP QDVAQAIDRA LREWRLQREK EILLGQLQGQ VRQLTVLHRV GKAVTAQLDA SNLLERIVEA SVFLSNADEG FVQLIDDNQL VVRASHNINP LHLRELSKHT DYELATRTIK TNKPIRINSE RDGIRVQANY LAQAVLMVPL LVGTEALGVL TVAATTHRRN FDEGDERLMQ MLADYASIAL HNARTYSALR ETQGRLVEAE KLSGMGRMAA SLAHEINNPL AIIRSGLELV AQQHTPGTAL GDLVQGLDEE VARIARLLYT LVNFYQPNND GVPPDLNHLI ISLMHITKPQ LDKANVKLYQ ELATDLPAPN ISSDACKQVL INLVRNAIDA MPDGGKLTIR TAHQKGQIFV NVEDSGIGIP PEHRERIFEP FFSTKGVTGT GLGLSVVYGI LQQVGGAISV ESIVDKGSNF TLRIPVAAQR SQSPDLDSDE LLIG
|
| |