Gene Hhal_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2159 
Symbol 
ID4709811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2371281 
End bp2372525 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content65% 
IMG OID639856634 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001003725 
Protein GI121998938 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAGG TTGCCGAGGA TCGCGACGGC GAGTCCATCG TCTACCACCC GAAGACGGCG 
TGGCTGCGCC ACTATGGTGC CCCGTCGGTG TTGACCGCGG TGGCGATGGC GATCGCCTAC
GCCTGGCCGG CGACAGGACC GCTGGCGGTG GTGCTCGGCG CCAGCGGTTG GGCCGCGGTC
GGCTATCTGC GCCGACGCGA GACGGGGTGG AACGAGGCGA CGGATACCGT GGAGGATCTG
GAAGACGCCC TGCGCGATCT GCTCCACGAC ATCGACGATA GTCTCAACGC CGAGTTCCGT
ACGGTGAACA GTGACCTGGA GCAGATCCGC GGGCTCGTCG GCGACGCCGT GCAGTCGCTC
AATCAGAGCT TCAACGGCAT GGATCAGGCC ACGGATGAAC AGGAACGCCT GGCTCGGGCG
GTGATCGAGC AGACCGGCGG GGATTCGGCC GTGGAACAGT TCGGCATCGC CGAGTTCGTC
CACGAGACCG AATCCTTCCT GAACAACTAC GTCGAGATGG TCGTCGACAT GAGTCGGCGC
AGCGTGAAGA CCGTCGAGCG CATCGACGAT ATGGTCCACC AGATGGACCG GATCCACAAG
CTGCTCGCCG ACCTCAAAGG GATCGCCAGC CAGACTGACC TGCTCGCTCT CAATGCCAGC
ATCGAGGCGG CCCGCGCCGG TGAGTCCGGC CGCGGTTTCG CGGTGGTGGC CGAGGAGGTC
CGCAAGCTCT CCGAGAAGGC CAACCAGTTC AACGAGCAGA TCGCTCAGGA GGTCAAGACC
ATCAGTAACC TGGTGGACGA GGCGCGCACC GAGGTCGGCG AGATGGCCTC CAACGACATG
AACGTGACCC TCACCACCAA GGAGCAGATC TCGGGGATGA TGAAGAGTCT CCAGGACGTG
GATCAGCAGG TGGAGCAGCA GGTCAAGCGC ATCTCTGAGG TCAGTGGACA GATCGATCAC
CACGTGGCCG ACGCCGTTCG TGCCCTGCAG TTCGAGGACA TTGTGACGCA GTTGGTGGAT
GGCTCGCGCG CCGGGGTCGA GGGTTTGGAT GACTACCTCG ATGGCGTGCG CAATGTCCTG
CAGGCCATCG CCGAGGAGGA CGTCCACGGC AGCCAGTACG CGGCGCGTCT ACGCGAGGCC
CGCGAGCGGC TGGCGCAGCA GCGTCAGGAG CGCGAGACGG CGCGGGCCAG CCAGCGCAAG
GTGGAGCAAC ACTCCATGGA TCACGGCGAC GTGGAGCTAT TCTGA
 
Protein sequence
MVEVAEDRDG ESIVYHPKTA WLRHYGAPSV LTAVAMAIAY AWPATGPLAV VLGASGWAAV 
GYLRRRETGW NEATDTVEDL EDALRDLLHD IDDSLNAEFR TVNSDLEQIR GLVGDAVQSL
NQSFNGMDQA TDEQERLARA VIEQTGGDSA VEQFGIAEFV HETESFLNNY VEMVVDMSRR
SVKTVERIDD MVHQMDRIHK LLADLKGIAS QTDLLALNAS IEAARAGESG RGFAVVAEEV
RKLSEKANQF NEQIAQEVKT ISNLVDEART EVGEMASNDM NVTLTTKEQI SGMMKSLQDV
DQQVEQQVKR ISEVSGQIDH HVADAVRALQ FEDIVTQLVD GSRAGVEGLD DYLDGVRNVL
QAIAEEDVHG SQYAARLREA RERLAQQRQE RETARASQRK VEQHSMDHGD VELF