Gene Hore_21850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_21850 
Symbol 
ID7313733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2376050 
End bp2377504 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content36% 
IMG OID643612638 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_002509926 
Protein GI220933018 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.925753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGA AAAACGAGTT ATACCAACAA AATCTTTTGT TATTAAAAAT CATGTGGTTA 
ATAACCGTAA TTGGAGTCGT GTTTTTACTT TATAGAAGTG CTCCTGTGAT TATGATCACT
GGTTTTACCA TAACATGCCT GATTGTAAAT ATATTACTAA CATTTTTAAT GTGGAAAAAT
ATCCTTACCC AGTACATACA GTATTTCATT ATTGTCGGAT TATCGGTTCT AGCATGGTTA
TTAATTTCCA CTTCTAATAG TTTTTATAGT TACCTGATTT TATTTGCCAA TCTGGTGTTG
ATTGCCCTTT ATACTAACTA CTGGCCAACT TTGATTATGG GGATAAGCAA TTTGATTATT
GTTAATTCTC TTCTGGGAGG GATAGGCAGG AACAATCTTT TAATTATTGA TTTATATCTG
GTTTTAATAA CTGTCCTGGT ATTGACCCAG ACCCATCTCC GGATTAAAAT TCAGAGAAGA
ATAGATGAAA AACATCAGCA GGTCATCAAG ACTGAAAAGG AAAAAGAATC TCTCTTAGAG
GAAATAATGA CTACTGTTAA AGAGTTGAGC AACTTTAACT CCAGTTTGAA GGAAAACATT
AAGGTAACAG ACCGTATTTC CAGTGAAGTT ACTAAAGCTT TTACTGATAT TGCAAAAGGT
ATTGAAGATC AGGCCCATAG TGTAAATGAA ATAAATGAAT CAGCCCGGTC CAGCAACCAT
AATGTCCGGG CCCTTGCCAG TGCTTCAAAT GAAATGCTTA ATTTATCTAA TAAAACAGTG
AATGCTACTT CAAAGGGGAA TGAACAGGTA GACCTTCTTG ATACCAAGAT GCAGGATGTT
GATCAGATTA TAAATGAGAG TGTTGACCTT ATTAATAGAT TAAATGAACA GGCCCAGCAA
ATAGGTGATA TTGTAGAAAC CATTAATAAT ATAGCCAAAC AGACTAATCT TCTGGCTTTA
AACGCAGCTA TAGAGGCAGC CCGGGCCGGG GAAGCAGGTC AGGGTTTTGC TGTGGTGGCC
GACGAAATAC GTGAACTGGC TGAAGATTCT CACCGTTCTA CTGATAAAAT TGCCTCTATA
CTGCAGGATA TTAAAGCCAA AACAGGACAG GTAACTGAGC AGATAAATAT CGGTCATGAT
GCTGTTTTAT CAAGCAAGAA CTCTACTGAA GAAGTAAAAA GAGTATTTAA AGAGATTGAT
GATAATACTA AAAATGTAGT AGAACAGGCT CAATACCTGG AAGAATCAGT AAGGGGCCTG
GAGAAGGAAT CACAGGCAAT TGTTGAAGAA ATATCTTCTA TTTCCAGTAT TATCGAGGAA
ACATCTGCTT CTGTTGAAGA AGTACTGGCC GGGAGTGAAG AACAGACAAA TAAAGTTAAA
AATATAGTAG AAAGCTTTAA ACAACTGGAT AAACTAATTA ATAAGTTACA GAGTTTAACT
AAAGGAGGTA GCTAA
 
Protein sequence
MDKKNELYQQ NLLLLKIMWL ITVIGVVFLL YRSAPVIMIT GFTITCLIVN ILLTFLMWKN 
ILTQYIQYFI IVGLSVLAWL LISTSNSFYS YLILFANLVL IALYTNYWPT LIMGISNLII
VNSLLGGIGR NNLLIIDLYL VLITVLVLTQ THLRIKIQRR IDEKHQQVIK TEKEKESLLE
EIMTTVKELS NFNSSLKENI KVTDRISSEV TKAFTDIAKG IEDQAHSVNE INESARSSNH
NVRALASASN EMLNLSNKTV NATSKGNEQV DLLDTKMQDV DQIINESVDL INRLNEQAQQ
IGDIVETINN IAKQTNLLAL NAAIEAARAG EAGQGFAVVA DEIRELAEDS HRSTDKIASI
LQDIKAKTGQ VTEQINIGHD AVLSSKNSTE EVKRVFKEID DNTKNVVEQA QYLEESVRGL
EKESQAIVEE ISSISSIIEE TSASVEEVLA GSEEQTNKVK NIVESFKQLD KLINKLQSLT
KGGS