Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_0226 |
Symbol | |
ID | 3761626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 268014 |
End bp | 271070 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637784931 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_390496 |
Protein GI | 78484571 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTTAA GGCTATATAA AGAATCTGTT GTTACTCATT GGAAGAATGG TGGAAAGGTT TTTGCGTTAA CCTTTTTGCT CGGCATCTGT TTGGTATTCT TTTTTATTTA TATTTATTTG CTGGTTGATG CCGAAGGCAA AAAAGAGCGC ATCCAACTTC ACAATGAACA CCTGTCTTTA GCGGCCGATA CTCTCATTAA TCAGTGGGTT AGCGGTATGG CTCAAAATGT TCTGTTTCTG GCGGAAGAAA CCACCAGCCT GACTCAGAAT ACCTCTCACA TTAACCTGAC ACCCTTAAGA AATCTCTACT TTAACTTTAT CAATCACCAG AAAGTACTGG GACAGATTCG ATATATCAAT AATGCGGGTC AGGAAATGAT TCGTTTTAAC CAGACGCCTG CGGGGTTAAG GGAAGTAATG TCTGAAAATC TGCAGGATAA GTCATCGCGT TACTATTTCA AAGATGCGGT GGATCTTAAA GATGGGCAGT TGGCCATTTC GAAGTTGGAT TTAAATATTG AAAATAATAG AGTTGAGATT CCTATAAAAC CGACCTTAAG AATTTCAACG CCCATTTTTG ACAGTGAAGG TCATCGTATG GGGGTTTTGG TGGTCAACTT TTTAGCTCAT GATCTTCTAG CAAAACTCGA TAATTTGAAA AAGGAATCAC AGCAACAACT CTGGATTGTC AATCAGCAAT TTGACTGGGT GTTGGCACCG CCAGATCAAA TGATTCTTGG TGAGCAACAA GGCTTTCATC AGGATGATTT GTTTGCAAAA TACCCTGAAC TTGCGAAAGT TCTAAGTCTT GAAGATTCCC CGCCTTCTTT TTGGCACGAA GCGGGGCGTT TACTTTATGT CCATGCAATT AAGTTGTTCG ACTCTCCTGA AATGATGCAA AACGAAGTCA TCAAAAATCG CTCCGGCATG TTTTATATCA TTTCTGAAAT GCCGGATTTG CCAGTTTGGT ATGCTCAATT ATTAGATGAT GACCGTTTAA AACAATGGAC GATTCAACTC AGTCTTTTAA TTTTTATGTT TGCATTGATG TTGGGATTTT ATGCCAACAA AAGTGCCTAC TTACAACGCC ATAATTTTTA TCAAAAAAAA TTGTTTGATA ATTTTTTTAA CCGATCTCCC AACGGATTGT TTTTATGCGA TCAAGACGGA GCGATTGTGT TTCAAAATGA AGCCGGTCAA ATTTTGATGT CAGAACTTGC GTTAAACAAA GTCTATACTG GGCTACAATT TTTTCATAAA TCATCGCGTC GTATTTTATG GCAGCAACTT GCCGATGGAA ACCCACCTGT TCAGAACGAA TTAACGATGA TGGTGGACCG CAAAAAACGC TGTTTCAGAG TGCAATTCTT TTTGATGAAT GCGGATGTGT TAGAGGCGCC ACTTTTGGCA GTGGTTTTTT ATGAGATTAC ACAACTTGTA GATGCACAAC AAAAAATTAA AGACAGTGAA GGACAGATTC GGACACTATT AGACAGCGCA CCTGATGCAA TTTTACTGTC GGATATAAGC GGCACCATAT ACATGGCCAA CAAAAAAGCT CAACAGCTGT TTGATATGAC GTCAGAAGAG TTTTTAAATG CGACGATTGA ATCACTGGTT CCGATAGAAC TTAGAGAGCA TCATGCGGTT TTACGAGAAA AATACGCTGA AAATCCTCAA GAACGGGTGA TGTCAATGGG CACCGACTTT AAGGCACAAA AAGCCAATGG TGCAACGTTT GATGCTGAAA TTAGCCTCAG TCCGATTACG ATCAAAGGAA AGCAGCATGT CATCAGCATT ATTCGAGATA TTACAGAGCG TAAAAAACTG GAAAACGATG TGCGGCAGTC ACAAAAAATG GATGCGCTTG GAAAGCTGAC CGGTAATTTA GCGCATGACT TCAATAACTT TCTAACCACA ATTATCGGAA ACCTCGATTT ATCCAAATTA TTGTTGGAAC AGCCCGACAT TGACAAATTA AAGCTTGAAG AGAAATTAAA CGCGGCGGTA TCGGCGTCTG AAAAAGCTTC GAAGTTAACT CGGCGCTTAT TAACCTTTTC CCGACAACAA CCTGTTTCGG AAAACTGCGT GTTGTTATTA CCATTCCTGG AAGAGGAATC TCGTATCTTG GCAGCCGCTT CAGGAAAGCT AGTAGAGTTT AAGATTTGTC CAAGAAAGTT TGCTTGGCCC ATCATGGTTA ACCGTGATGA ATTGATGACG GCAATGCTTA ACCTGCTGAC AAATGCGAAG GATGCCATGC CTCAAGGAGG GAGCGTTTTT ATTGATGTGG AAAACTTTAT GCTGGAGGAC AAAGGCATTG ATGTTTTAGG GGGTGAAATT CCTGTAGGCG ATTATGTCAT TTTATCGGTA TCGGATACCG GAACTGGGAT TGAATCCAAA AATTTGAATA ATATTTTTGA ACCCTTTTTC ACCACCAAAC CTAAAAATAA AGGCACCGGC TTTGGCTTGG CACAAGTCTT CAGTTTCATG AAACAATCCC AGGGCTTTAT TAAGTTGTAT TCCGAAGAGG GGCTTGGCAC CACTTTTCAA TTGTTTTTCC CGCGAAATGA AAGTGAAGAA TGCATGAAAT TAATGGCCGG GGTTCAGCCT GAAAAGCCCG ATTTTTCTCC GGCCGACATG GCAGTCGACG ATTTGTTTGA CGCATCACAA TTCTGTATTT TGGTGGTCGA TGATGATGTG AGTGTTCGTG CATTGGCCGT TGAATATTTA GAAAGGGCCG GTTATAACGT CGTTATGGCT TACAGTGCCG ACAACGCCTT GTCGGTGTTA AAGAAGCATT CAGTGGATTT GATGTTGACC GATGTGGTGA TGCCGGAAAA AAATGGATTT GAACTGGCGA ACATTGTTGA ATCTGAATAC CCCGATGTGG ATATTGTTTT CAGTTCAGGG TTTCCTAAAG ATATCTTGAA TCAGGCCCGA TTACTTAAGA ATAAGATGAT TTTACTTGAT AAACCTTATC GAAAAGCTCA TTTGCTAAGC ATTATACAAA GCCGTTTGAT GTTACGTAAA ACCACAGCTA GCTCGGACAA TGAATAA
|
Protein sequence | MPLRLYKESV VTHWKNGGKV FALTFLLGIC LVFFFIYIYL LVDAEGKKER IQLHNEHLSL AADTLINQWV SGMAQNVLFL AEETTSLTQN TSHINLTPLR NLYFNFINHQ KVLGQIRYIN NAGQEMIRFN QTPAGLREVM SENLQDKSSR YYFKDAVDLK DGQLAISKLD LNIENNRVEI PIKPTLRIST PIFDSEGHRM GVLVVNFLAH DLLAKLDNLK KESQQQLWIV NQQFDWVLAP PDQMILGEQQ GFHQDDLFAK YPELAKVLSL EDSPPSFWHE AGRLLYVHAI KLFDSPEMMQ NEVIKNRSGM FYIISEMPDL PVWYAQLLDD DRLKQWTIQL SLLIFMFALM LGFYANKSAY LQRHNFYQKK LFDNFFNRSP NGLFLCDQDG AIVFQNEAGQ ILMSELALNK VYTGLQFFHK SSRRILWQQL ADGNPPVQNE LTMMVDRKKR CFRVQFFLMN ADVLEAPLLA VVFYEITQLV DAQQKIKDSE GQIRTLLDSA PDAILLSDIS GTIYMANKKA QQLFDMTSEE FLNATIESLV PIELREHHAV LREKYAENPQ ERVMSMGTDF KAQKANGATF DAEISLSPIT IKGKQHVISI IRDITERKKL ENDVRQSQKM DALGKLTGNL AHDFNNFLTT IIGNLDLSKL LLEQPDIDKL KLEEKLNAAV SASEKASKLT RRLLTFSRQQ PVSENCVLLL PFLEEESRIL AAASGKLVEF KICPRKFAWP IMVNRDELMT AMLNLLTNAK DAMPQGGSVF IDVENFMLED KGIDVLGGEI PVGDYVILSV SDTGTGIESK NLNNIFEPFF TTKPKNKGTG FGLAQVFSFM KQSQGFIKLY SEEGLGTTFQ LFFPRNESEE CMKLMAGVQP EKPDFSPADM AVDDLFDASQ FCILVVDDDV SVRALAVEYL ERAGYNVVMA YSADNALSVL KKHSVDLMLT DVVMPEKNGF ELANIVESEY PDVDIVFSSG FPKDILNQAR LLKNKMILLD KPYRKAHLLS IIQSRLMLRK TTASSDNE
|
| |