Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2137 |
Symbol | |
ID | 5734039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2687754 |
End bp | 2690585 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279278 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001544905 |
Protein GI | 159898658 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAATT CGCTACGACC CGCCGAGCCA CGCATCGATC AAGCTACGAT AACCAAGCTG CTGCATCAGT TTGATTGGAG CACAACGCCG CTTGGGGTGA TCGATACTTG GCCAACTGCC ATGCGCAGTA TCGTTGATAT GATGTTGGCG GTTCCGGTTG CCATGACAAC CATGTGGGGT AAGCAGGGCA TCATGATCTA TAACGCTGGC TATGCGGTCA TTGCTGGCTC ACGTCACCCC GAAGTATTTG GGCTGCCGGT AACCGCCGCT TGGCCTGAAG CCGCCGATTT CAATCAATCG ATCCTCGATC AGGTCTTTGC GGGCAAAAGC CTTACTTACC AAAAACGCAA TTTTATGTTG CACCGTAATG GCCGTAACGA ACAAGTCTGG TTTGATCTTA GTTATAGCCC GATTTTTGAC CAAAACGGCG AAATTATTGG GGTGATGGCG CTGGTGGTCG AAATAACTGC CCAAGTTTTA GCTGAACAAC AACGCCATCG CGCCGAAGAA CGCTTTCAAT TAGCCTTGGA TGCTGGCTTG CTGATCGGCA CATGGGATTG GGATATCGTA GCCGATCGCT GTATCGTCGA TCCACGTTTT GCCGAATATT TTGCGCTTGA CCCGACCCTT GCCAGCCAAG GTGTGAGCGT CGAAACCATG CTTGAGGCGA TTCATCCCGA TGATCAAGCA ACCATTGCCC AATTAATTCA ATCGGCAATC CATAGCGGCC AATCATATCG CGCTGAATAT CGCGTGCTCC ACCGCGACGG CGATTATCGC TGGGTTGAGG CCAATGGCTT TTGCATTTTT GAGCACAATC AGCCGCAACG CTTTCCTGGC GTATTAATCG ATATTACCGA GCGCAAGCGC CGCGAAGATG CCTTGGAGCA TAGCGAGGCT CGCTTACGCG CAATTTTCGA TACCTTGCCC GTCGGCTTGG TTTTTGCCGA AACCCCAACT GGACGGATCA CCGATGGCAA TGCCCATGTC GAGCAGATTT TGCGCCATCC GGTACTGCCA TCGCCTGCGA CCGAGGCCTA TGGCGAGTGG ATCGCTTACG ACGAACACAA TCAACTTGTA CCAATTGAAG AATATCCATT GGCGATTGCC GTCAAAACTG GCCAAGTTTC CGAGCGCGAT TTCCACTACC AGCGGGGCGA TGGCACACGG GCTTGGATCA AGGTTATCGG CGGCCCAGTC CGCGATCTTG ATAACACGAT CACTGGCGGT TTGATCACAA TTATCGATAT TGACCGTGAA AAACGCACCG AAACGCAACT TCAAGCGCTC AACAACGATT TAGAAGGCTT GGTTGCCCAA CGCACCCGTG AGCGCGACCG AATTTGGCTG GTCAGTCAAG ATCTGTTAGG CATTGCCGAT CCAGCGGGCA ATTGGCTCGC GATCAATCCT GCTTGGCAAC GCACGCTCGG CTGGGATGAT AGCGATATTC TTGGGCGCAC CAGCGAATGG CTTGAACACC CCGACGACCA TGAACGCACG CGCCAAGAGG TTGCCAAGCT AGCCAATGGC ATTCCCACCA GCTATTTTGA AAATCGTTTT CGCAGTCGCG ATGGCGAGTA TCATTGGCTT TCGTGGACGG CAGTGCTGGA TAATGGCTAT CTCTATTGCG TTGCCCGTGA TATTAGCAAC GAAAAACAGC GCCAAGCTGA AATCGAACGC ATGCAAACCC AATTGCGGCA ATCACAAAAA ATGGAGGCGA TTGGACAACT CACAGGCGGC ATCGCCCACG ATTTCAATAA TATTTTGACC AGCATTTTGG GTGGGCTCGA TTTGCTGCAA CGGCGGATCA ACGTAGGCCG TTTCGATACA ATTGAGCGCT ACATCAACAG TGCAATTAAA TCGGCCAAAC GTGCTGCCGC ATTAACTCAA CGTTTATTGG CATTTTCGCG CCAGCAAGCG CTGGATGTGC AACCAATCAA TATCAATAGC CTGATCCAAT CGCTCGACGA TTTACTCCAA CGCAGCCTTG GCGAGCAAAT TCAGGTGGCA ACCAACCTCA GCGATGACGT TTGGCGGGTG CGCACCGATG CCAATCAGCT TGAAAACGCC TTGCTCAACT TGGCGATCAA CGCCCGTGAT GCCATGCCCT ACGGCGGAAT TCTGACGATT AGCACGAGCA ACATCGACGC GCAACAAGCA AATCAACTGC AACTTGATCC GCATGAATAT GTATTAATCG AAGTCATCGA TACCGGCACT GGCATGAGTA GTGAGGTCAT CGAACGCGCT TTCGATCCAT TTTTTACCAC CAAACCTTTG GGGCAAGGCA CAGGTTTAGG TTTATCGATG ATTTATGGCT TTATCAAACA AATTGGCGGG CATATTCAAA TTGAGAGCCA ACTCGAACAA GGCACCAGCG TCAAGCTCTA TTTGCCGCGT GACCAAAGTG GGCTTGAGCA TGGCTTTGCC GCCGAATCGG CCCAAGCTCG CAGCATCGAG GGAGCCACAA TTTTGGTGGT TGAAGATGAT GAAGCGGTAC GCATGGTGTT AATCGATGTA CTCGACGAGT TAGGTTACCA TACGCTTGAG GCTGAGGATG CCAGCAGTGC CTTGACCTTC TTCGAGCAAC CAACAACAAT TGATTTGGTG ATTAGCGATA TTGGATTGCC TAAAACCGAT GGCTATGATC TAGCCCTCCA GATCCGCCAG CGCTACCCAA CCTTGCCAAT TTTGCTGGTC AGTGGCTACA CCGATCGGGC GGCAGTGCGC AGCGGCGAGC TTGAACCACA GATGGAATTG CTCAGCAAAC CATTTGAAAT TACCGACCTG GCCAATAAAA TTCACGATTT GCTACAACAA GGCCACACGT AA
|
Protein sequence | MPNSLRPAEP RIDQATITKL LHQFDWSTTP LGVIDTWPTA MRSIVDMMLA VPVAMTTMWG KQGIMIYNAG YAVIAGSRHP EVFGLPVTAA WPEAADFNQS ILDQVFAGKS LTYQKRNFML HRNGRNEQVW FDLSYSPIFD QNGEIIGVMA LVVEITAQVL AEQQRHRAEE RFQLALDAGL LIGTWDWDIV ADRCIVDPRF AEYFALDPTL ASQGVSVETM LEAIHPDDQA TIAQLIQSAI HSGQSYRAEY RVLHRDGDYR WVEANGFCIF EHNQPQRFPG VLIDITERKR REDALEHSEA RLRAIFDTLP VGLVFAETPT GRITDGNAHV EQILRHPVLP SPATEAYGEW IAYDEHNQLV PIEEYPLAIA VKTGQVSERD FHYQRGDGTR AWIKVIGGPV RDLDNTITGG LITIIDIDRE KRTETQLQAL NNDLEGLVAQ RTRERDRIWL VSQDLLGIAD PAGNWLAINP AWQRTLGWDD SDILGRTSEW LEHPDDHERT RQEVAKLANG IPTSYFENRF RSRDGEYHWL SWTAVLDNGY LYCVARDISN EKQRQAEIER MQTQLRQSQK MEAIGQLTGG IAHDFNNILT SILGGLDLLQ RRINVGRFDT IERYINSAIK SAKRAAALTQ RLLAFSRQQA LDVQPININS LIQSLDDLLQ RSLGEQIQVA TNLSDDVWRV RTDANQLENA LLNLAINARD AMPYGGILTI STSNIDAQQA NQLQLDPHEY VLIEVIDTGT GMSSEVIERA FDPFFTTKPL GQGTGLGLSM IYGFIKQIGG HIQIESQLEQ GTSVKLYLPR DQSGLEHGFA AESAQARSIE GATILVVEDD EAVRMVLIDV LDELGYHTLE AEDASSALTF FEQPTTIDLV ISDIGLPKTD GYDLALQIRQ RYPTLPILLV SGYTDRAAVR SGELEPQMEL LSKPFEITDL ANKIHDLLQQ GHT
|
| |