Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2167 |
Symbol | |
ID | 4709712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2379876 |
End bp | 2381987 |
Gene Length | 2112 bp |
Protein Length | 703 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639856642 |
Product | CheA signal transduction histidine kinases |
Protein accession | YP_001003733 |
Protein GI | 121998946 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0949772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTAG ACCTGAGCCA GTTCCTGCAG ACCTTCTTCG AGGAGAGCTT CGAGGGGCTG GATACCATGG AGTCGGGCCT GCTGGAACTC GATCCAGAGA GCCCGGATCC CGAGGCCCTC AACAACGTCT TCCGCGCGGC CCACTCCATC AAGGGGGGCG CCGGAACCTT CGGTCTCAGC GCGGTCTCGG ACTTCACCCA CCGCATGGAG ACCCTGCTCG ATCGACTGCG TGACGGCAAA CAGGCGGTTA CGCCCGACTG CGTCAATGTG TTGCTGAATG CCGTCGATTG CCTGCGAGGC ATGCTGGTGG CGATCCAGAG TGACCAAACC CTGGACGCCG ACGCGATCAG CACTGCCCAG CAGCGGCTCG ACGAGCAGCT CGGGCAGGCA CCCGTGGGCG GTGGGGGGGC GACCGGCGCC GCGGCCGGAC CGACAGCCGG CGGCGGTGAT GCCGGCGGTG ACGATGCGCC GCCGCCCCGC GGGGGAGGCG GCGGCGGCGG CTGGTTGATT CGCTTCGAGC CGCAGCCCCA CCTGTTCGCC ACGGGCAACG ACCCGCGTCG GCTCTTCTTG GCCCTCCAGG ATCTCGGCGA GCTTGAGGTC GAATGCGACA CCTCGGGTTT GCCGCCCTTC GAGCAGCTCG ATCCGGAAAC CTGTCAGCTG GCTTGGACCC TGCGCCTCTA CGCCGACGTG CCCGAGGCGG CGGTGCGCGA GGTCTTCGAG TGGGTCGAGG ACGATGCCCG GCTCGAGATC CAGCCCCTGG AGGCCGCAGG TACCGAGGAG GTAGCTCCGG TAGCACCCGG GCAGCCCCCG ATGCCGACCT CTGAGGAGAG TGGCCCTGCG GCGCCCGCAG CCGGCGGTGG CGAAGCGCGC AAGCCGGCCG CACGCCGTGG GGGAGGGAAT AGCTCCATCC GGGTGGATAC CGAGAAGATC GACGCCCTCA TCGATATGGT CGGTGAGCTG GTGATCACCC AGTCGATGCT CAGCCAGGTC GGCAAGGAGT TCACCGCCGA GCGCCTGGAA GAACTCCAGG ACGGCCTGGC GCAGCTCGAG CGCAACACGC GCGAACTGCA AGAGAACGTC ATGCGCATCC GCATGGTGCC AATCAGCTTC GCCTACTCGC GGCTGCCGCG CATCGTCCAC GACACCAGCC GCGCCCTGGG CAAGGCCGTC GACTTCCAGA TGGAGGGCGA GCAGACCGAG CTGGACAAGA CGGTGATGGA GAAGATCATC GATCCGCTGG TCCATCTGGT CCGCAACAGC GTCGACCACG GCATCGAGCC GCCGGAGGAG CGGGCCGCTG CCGGCAAGCC CGAGACCGGC ACCATCACCA TCGAGGCCTA CCACAAGGGC GGCAATATCA TCATCGAGAT CGCCGATGAT GGTCGCGGCA TCAATCGCGA CAAGCTGCTG GCCAAGGCGC GCAGCTCCGG GCTGCTCGAG GACGGCACCG AACTGCCCGA TGACCAGGTC TTCGACCTGA TCTTCCATCC CGGGCTCTCC ACTCATGAGC AGGCCACCGA ATACTCCGGC CGGGGTGTGG GGATGGATGT GGTCAAGCGC AACGTCCGCT CCCTGTCCGG CAATATCCAT GTGCGCTCAG CTCAAGGGCA GGGCACGACC ATCACCATCT CGCTGCCGCT GACCCTCTCG ATCCTGGACG GCCAGCTGTT CCGGGTAGGC GACCAGACCT ACATCGTGCC GCTGGTTTCG GTCATCGAGT CGCTCCAAGT CGACGGCAGC AAGCTCAGCC GGGTTACCGG CCGGGGCGAG GTCTACCACT GGCGCGAGGG GTACGTCCCC ATCGTGCGCC TCCACGAGCT CTTCGACACC GAACCGGTTC GCCGCGAGCT CGCCGGTGGG CTGATGGTGA TCGTCGAGGA CGAAGACACC TACCTGGGAG TCTTTGTCGA CGACCTCCTT GATCAGCAGC AGGTGGTCAT CAAGAGCCTG GAGGCCAACT ACCTGCAGGT GCCCGGCATC GCTGGCGCCA CCATCCTCGG CGACGGCACC GTGGCCCTGA TCCTCGATAT CGCCGGGCTC ATCGAGATGA GCCGCGGTGG CCGTCGTCAG CCCTACATCC CCACCCCGGA GGACAGCGAT GAGGCCGCCT GA
|
Protein sequence | MSVDLSQFLQ TFFEESFEGL DTMESGLLEL DPESPDPEAL NNVFRAAHSI KGGAGTFGLS AVSDFTHRME TLLDRLRDGK QAVTPDCVNV LLNAVDCLRG MLVAIQSDQT LDADAISTAQ QRLDEQLGQA PVGGGGATGA AAGPTAGGGD AGGDDAPPPR GGGGGGGWLI RFEPQPHLFA TGNDPRRLFL ALQDLGELEV ECDTSGLPPF EQLDPETCQL AWTLRLYADV PEAAVREVFE WVEDDARLEI QPLEAAGTEE VAPVAPGQPP MPTSEESGPA APAAGGGEAR KPAARRGGGN SSIRVDTEKI DALIDMVGEL VITQSMLSQV GKEFTAERLE ELQDGLAQLE RNTRELQENV MRIRMVPISF AYSRLPRIVH DTSRALGKAV DFQMEGEQTE LDKTVMEKII DPLVHLVRNS VDHGIEPPEE RAAAGKPETG TITIEAYHKG GNIIIEIADD GRGINRDKLL AKARSSGLLE DGTELPDDQV FDLIFHPGLS THEQATEYSG RGVGMDVVKR NVRSLSGNIH VRSAQGQGTT ITISLPLTLS ILDGQLFRVG DQTYIVPLVS VIESLQVDGS KLSRVTGRGE VYHWREGYVP IVRLHELFDT EPVRRELAGG LMVIVEDEDT YLGVFVDDLL DQQQVVIKSL EANYLQVPGI AGATILGDGT VALILDIAGL IEMSRGGRRQ PYIPTPEDSD EAA
|
| |