Gene Hore_21590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_21590 
Symbol 
ID7313706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2349741 
End bp2351243 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content36% 
IMG OID643612611 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_002509899 
Protein GI220932991 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGTTC CTGTTTTTGA TTATAGTGAT CAGTATACTG GGGTAATCTG TGCTGATATA 
TCCATTGAAT ATTTAAAAGA TATTATTAAA TGGAAACTGG GAGAAACCGG TTATGTTTAT
ATGATTGATA AGAATGGCAA CATTATAGCA CATCCCGAAC ATGATGACAA CAATAAAAAG
ATAAATATGG GTAAATATTT TAATATTAAC CATATTTATG AATCAAAGAA GGGAGATTTC
AATTATTCAG TTGATAATGA AAAATTACTG GTTTCCTATG TAACCCTTGA TAAGGTTGGA
GCATTGCTGG CCCAGATTCC TGAAAAAGAA GCCTATATAG TTCAGGATTT GATTAAAAAT
AAAATAATTA AAGTAGGAAT TTCTATACTA GTTGTGGTTA TGGTCACCAT ATTTATACTG
GTTAGTTTTT ATTTAATCAG GCCGGTTTTA AAAATAAAAG ATGAAATGCA AAAGGTATCC
CGGGGCAACC TGAATGTTGA ATTAACAATT AATCATAAAG ATGAGCTGGG GATTCTGGCC
GGGGCCTTTA AAAAGATGGT TGGCCAGATG AGACATATCA TACAAAGTAT TGATGATACT
GCCAGGCAGG TTGAGAGTGC TTCCCAAGAT ATGAAAGAAT CATCTAATAT GATCAGTCAG
GTTTCTGAAC AGGTGGCTTC TTCTATTCAG GAAGTATCAA GTGGAGCCTA TGAGCAGGCC
AATAATGTTG AAGAAGTTGA AGAAAAAATA AAAAATCTCT CAGAGCAAAT GGAAGAACTA
GCTACTACAA ATAAACTGGT GGAAGATTTA TCATCTGAAA TGGATATTGC CAGCATTCAG
GGACAGGAAG AAATGACTAA AGTAAAAGAA CAAATGGTTA ATATTAATGA TTCAATCAGT
GAAGTAGCTA TAAAAATAAA AAACCTGGAG CAGATATCCA GTGAAATAGA TTCTATTCTT
GAAATTATTA ATGGAATTGC TGAACAGACC AACCTGCTGG CATTAAATGC AGCCATAGAA
GCAGCCCGGG CTGGTGAGAC CGGGAGGGGA TTCAGTGTTG TTGCTGAAGA GATCAGGGAA
TTATCCGAAG AGTCAGCCCG TTCTTCGAGT AAAATAAGGA AATTAATTAC CGATATCCAC
CGGGAAACCG ATGAAGTCTC CCGGAAGATG AAAGAAGGGA CCCGGCAGAT AAAATATGGA
GAAGAAGTTG TCCAGTCAGC CAACCAGGCC TTTATTAGAA TTAGAGAATC AATTGAGGAA
GTTACAGGAG GTATCAAACA TTCCAGTACA AATGTCCAGA CAGCAAAGGT CCAGAGTGAA
GAGATTAGTA GACATATTAA AAGGATAGCG GATATTTCGG AAGAATTTTC TGCCAGTGCA
GAAGAAGTTG CTGCTGCCAG TGAAGAGCAA ACAGGCTCAA TTGAGGAAAT AAACAACAGA
TCAAGGAAAT TGTTTCAAAT GACCGAGGAA TTAAATAAAC TTATCAGGAA TTTTCAAATT
TAA
 
Protein sequence
MAVPVFDYSD QYTGVICADI SIEYLKDIIK WKLGETGYVY MIDKNGNIIA HPEHDDNNKK 
INMGKYFNIN HIYESKKGDF NYSVDNEKLL VSYVTLDKVG ALLAQIPEKE AYIVQDLIKN
KIIKVGISIL VVVMVTIFIL VSFYLIRPVL KIKDEMQKVS RGNLNVELTI NHKDELGILA
GAFKKMVGQM RHIIQSIDDT ARQVESASQD MKESSNMISQ VSEQVASSIQ EVSSGAYEQA
NNVEEVEEKI KNLSEQMEEL ATTNKLVEDL SSEMDIASIQ GQEEMTKVKE QMVNINDSIS
EVAIKIKNLE QISSEIDSIL EIINGIAEQT NLLALNAAIE AARAGETGRG FSVVAEEIRE
LSEESARSSS KIRKLITDIH RETDEVSRKM KEGTRQIKYG EEVVQSANQA FIRIRESIEE
VTGGIKHSST NVQTAKVQSE EISRHIKRIA DISEEFSASA EEVAAASEEQ TGSIEEINNR
SRKLFQMTEE LNKLIRNFQI