Gene Tcr_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_0226 
Symbol 
ID3761626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp268014 
End bp271070 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content41% 
IMG OID637784931 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_390496 
Protein GI78484571 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTTAA GGCTATATAA AGAATCTGTT GTTACTCATT GGAAGAATGG TGGAAAGGTT 
TTTGCGTTAA CCTTTTTGCT CGGCATCTGT TTGGTATTCT TTTTTATTTA TATTTATTTG
CTGGTTGATG CCGAAGGCAA AAAAGAGCGC ATCCAACTTC ACAATGAACA CCTGTCTTTA
GCGGCCGATA CTCTCATTAA TCAGTGGGTT AGCGGTATGG CTCAAAATGT TCTGTTTCTG
GCGGAAGAAA CCACCAGCCT GACTCAGAAT ACCTCTCACA TTAACCTGAC ACCCTTAAGA
AATCTCTACT TTAACTTTAT CAATCACCAG AAAGTACTGG GACAGATTCG ATATATCAAT
AATGCGGGTC AGGAAATGAT TCGTTTTAAC CAGACGCCTG CGGGGTTAAG GGAAGTAATG
TCTGAAAATC TGCAGGATAA GTCATCGCGT TACTATTTCA AAGATGCGGT GGATCTTAAA
GATGGGCAGT TGGCCATTTC GAAGTTGGAT TTAAATATTG AAAATAATAG AGTTGAGATT
CCTATAAAAC CGACCTTAAG AATTTCAACG CCCATTTTTG ACAGTGAAGG TCATCGTATG
GGGGTTTTGG TGGTCAACTT TTTAGCTCAT GATCTTCTAG CAAAACTCGA TAATTTGAAA
AAGGAATCAC AGCAACAACT CTGGATTGTC AATCAGCAAT TTGACTGGGT GTTGGCACCG
CCAGATCAAA TGATTCTTGG TGAGCAACAA GGCTTTCATC AGGATGATTT GTTTGCAAAA
TACCCTGAAC TTGCGAAAGT TCTAAGTCTT GAAGATTCCC CGCCTTCTTT TTGGCACGAA
GCGGGGCGTT TACTTTATGT CCATGCAATT AAGTTGTTCG ACTCTCCTGA AATGATGCAA
AACGAAGTCA TCAAAAATCG CTCCGGCATG TTTTATATCA TTTCTGAAAT GCCGGATTTG
CCAGTTTGGT ATGCTCAATT ATTAGATGAT GACCGTTTAA AACAATGGAC GATTCAACTC
AGTCTTTTAA TTTTTATGTT TGCATTGATG TTGGGATTTT ATGCCAACAA AAGTGCCTAC
TTACAACGCC ATAATTTTTA TCAAAAAAAA TTGTTTGATA ATTTTTTTAA CCGATCTCCC
AACGGATTGT TTTTATGCGA TCAAGACGGA GCGATTGTGT TTCAAAATGA AGCCGGTCAA
ATTTTGATGT CAGAACTTGC GTTAAACAAA GTCTATACTG GGCTACAATT TTTTCATAAA
TCATCGCGTC GTATTTTATG GCAGCAACTT GCCGATGGAA ACCCACCTGT TCAGAACGAA
TTAACGATGA TGGTGGACCG CAAAAAACGC TGTTTCAGAG TGCAATTCTT TTTGATGAAT
GCGGATGTGT TAGAGGCGCC ACTTTTGGCA GTGGTTTTTT ATGAGATTAC ACAACTTGTA
GATGCACAAC AAAAAATTAA AGACAGTGAA GGACAGATTC GGACACTATT AGACAGCGCA
CCTGATGCAA TTTTACTGTC GGATATAAGC GGCACCATAT ACATGGCCAA CAAAAAAGCT
CAACAGCTGT TTGATATGAC GTCAGAAGAG TTTTTAAATG CGACGATTGA ATCACTGGTT
CCGATAGAAC TTAGAGAGCA TCATGCGGTT TTACGAGAAA AATACGCTGA AAATCCTCAA
GAACGGGTGA TGTCAATGGG CACCGACTTT AAGGCACAAA AAGCCAATGG TGCAACGTTT
GATGCTGAAA TTAGCCTCAG TCCGATTACG ATCAAAGGAA AGCAGCATGT CATCAGCATT
ATTCGAGATA TTACAGAGCG TAAAAAACTG GAAAACGATG TGCGGCAGTC ACAAAAAATG
GATGCGCTTG GAAAGCTGAC CGGTAATTTA GCGCATGACT TCAATAACTT TCTAACCACA
ATTATCGGAA ACCTCGATTT ATCCAAATTA TTGTTGGAAC AGCCCGACAT TGACAAATTA
AAGCTTGAAG AGAAATTAAA CGCGGCGGTA TCGGCGTCTG AAAAAGCTTC GAAGTTAACT
CGGCGCTTAT TAACCTTTTC CCGACAACAA CCTGTTTCGG AAAACTGCGT GTTGTTATTA
CCATTCCTGG AAGAGGAATC TCGTATCTTG GCAGCCGCTT CAGGAAAGCT AGTAGAGTTT
AAGATTTGTC CAAGAAAGTT TGCTTGGCCC ATCATGGTTA ACCGTGATGA ATTGATGACG
GCAATGCTTA ACCTGCTGAC AAATGCGAAG GATGCCATGC CTCAAGGAGG GAGCGTTTTT
ATTGATGTGG AAAACTTTAT GCTGGAGGAC AAAGGCATTG ATGTTTTAGG GGGTGAAATT
CCTGTAGGCG ATTATGTCAT TTTATCGGTA TCGGATACCG GAACTGGGAT TGAATCCAAA
AATTTGAATA ATATTTTTGA ACCCTTTTTC ACCACCAAAC CTAAAAATAA AGGCACCGGC
TTTGGCTTGG CACAAGTCTT CAGTTTCATG AAACAATCCC AGGGCTTTAT TAAGTTGTAT
TCCGAAGAGG GGCTTGGCAC CACTTTTCAA TTGTTTTTCC CGCGAAATGA AAGTGAAGAA
TGCATGAAAT TAATGGCCGG GGTTCAGCCT GAAAAGCCCG ATTTTTCTCC GGCCGACATG
GCAGTCGACG ATTTGTTTGA CGCATCACAA TTCTGTATTT TGGTGGTCGA TGATGATGTG
AGTGTTCGTG CATTGGCCGT TGAATATTTA GAAAGGGCCG GTTATAACGT CGTTATGGCT
TACAGTGCCG ACAACGCCTT GTCGGTGTTA AAGAAGCATT CAGTGGATTT GATGTTGACC
GATGTGGTGA TGCCGGAAAA AAATGGATTT GAACTGGCGA ACATTGTTGA ATCTGAATAC
CCCGATGTGG ATATTGTTTT CAGTTCAGGG TTTCCTAAAG ATATCTTGAA TCAGGCCCGA
TTACTTAAGA ATAAGATGAT TTTACTTGAT AAACCTTATC GAAAAGCTCA TTTGCTAAGC
ATTATACAAA GCCGTTTGAT GTTACGTAAA ACCACAGCTA GCTCGGACAA TGAATAA
 
Protein sequence
MPLRLYKESV VTHWKNGGKV FALTFLLGIC LVFFFIYIYL LVDAEGKKER IQLHNEHLSL 
AADTLINQWV SGMAQNVLFL AEETTSLTQN TSHINLTPLR NLYFNFINHQ KVLGQIRYIN
NAGQEMIRFN QTPAGLREVM SENLQDKSSR YYFKDAVDLK DGQLAISKLD LNIENNRVEI
PIKPTLRIST PIFDSEGHRM GVLVVNFLAH DLLAKLDNLK KESQQQLWIV NQQFDWVLAP
PDQMILGEQQ GFHQDDLFAK YPELAKVLSL EDSPPSFWHE AGRLLYVHAI KLFDSPEMMQ
NEVIKNRSGM FYIISEMPDL PVWYAQLLDD DRLKQWTIQL SLLIFMFALM LGFYANKSAY
LQRHNFYQKK LFDNFFNRSP NGLFLCDQDG AIVFQNEAGQ ILMSELALNK VYTGLQFFHK
SSRRILWQQL ADGNPPVQNE LTMMVDRKKR CFRVQFFLMN ADVLEAPLLA VVFYEITQLV
DAQQKIKDSE GQIRTLLDSA PDAILLSDIS GTIYMANKKA QQLFDMTSEE FLNATIESLV
PIELREHHAV LREKYAENPQ ERVMSMGTDF KAQKANGATF DAEISLSPIT IKGKQHVISI
IRDITERKKL ENDVRQSQKM DALGKLTGNL AHDFNNFLTT IIGNLDLSKL LLEQPDIDKL
KLEEKLNAAV SASEKASKLT RRLLTFSRQQ PVSENCVLLL PFLEEESRIL AAASGKLVEF
KICPRKFAWP IMVNRDELMT AMLNLLTNAK DAMPQGGSVF IDVENFMLED KGIDVLGGEI
PVGDYVILSV SDTGTGIESK NLNNIFEPFF TTKPKNKGTG FGLAQVFSFM KQSQGFIKLY
SEEGLGTTFQ LFFPRNESEE CMKLMAGVQP EKPDFSPADM AVDDLFDASQ FCILVVDDDV
SVRALAVEYL ERAGYNVVMA YSADNALSVL KKHSVDLMLT DVVMPEKNGF ELANIVESEY
PDVDIVFSSG FPKDILNQAR LLKNKMILLD KPYRKAHLLS IIQSRLMLRK TTASSDNE