Gene Hhal_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0167 
Symbol 
ID4710833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp196185 
End bp197807 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID639854625 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001001763 
Protein GI121996976 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCTA ACTGGCCCGT GCGCTACAAG CTCGGTATCT CCTTCGGCCT CATCGTGATC 
CTGATGGGGA CCAGCGGCCT GCTCTCTCAC TGGGCCCTGC AGGACACCGC GAACGAGAAC
CGGGACGCCA TGGCCTACAT GGAGCAGTCC GCCATGCTGG TCGAGCGTGA GGTCGAGCAC
CTAAGCTGGA CCAACGAGCT GGCGGACAGC TTTTTGCTCG AAGAGACCTT CACCGGGGAG
CTGGACTTCA CCTCATGCGC CTTCGGCGAG TGGTTCTACG ACTACCAGGG GTCGGAGCAC
TACGAATCGG CCAGTGAGGC GTTGCGCGAG GCCGTCGACG GCCTCGAACA ACCGCACATT
GCGCTGCACG AGGTCGGCGA GCGGATCGTC GAGCTGCAGG AAGCCGGGCG TTTCGACGAG
GCCGAGGCCT TCTACCATGA AGAGGCACAG CCCATTCGGC GGGTATTCCA GGATCAGCTC
GGCGAACTGC GCGAAATCCT CGAAGCCGAG CGCGACCGCC ACGCCAAGCA GGCGGCGGCG
CAGCAGACCC AGGCGGCACA GATCACGGTG GGTAGCCTGG CGATCACCGT GCTCTTTGCC
ATCGGCCTGG CGGCGCTGGT GTCCCGTCAC CTCACCGCCC GGCTGCGGAG TGCCGTCGAT
AGCCTGGAGG ACATCGCCGC CGGGGATGGC GACCTGACCC GGCGTCTGGA CGCCGAGGGC
CGCGACGAGA TCGCCGAGCT CAGCGCGGCC TATAACCGCT TCGTCGACAA GATCCACGAG
ATGGTGCGCG TCGTCCGCGA CTCCGCCACC CAGATGGCCT CGGCCACGGA ACAGCTGCAG
CGCAGCGCCG AGGAGGATCA ACAGGGGGTG CAGAACCAGC AAGAGCGGAC CTCCGAGGTG
GCCACGGCGA TGAACGAGAT GAGCGCCTCG ATCAACGAGA TCGCCCGGAA CACCCAGGAC
ACGGCGAACG CCGCCGGGGA TGCCAGCCAG CAGTCCCGAC AGGGTCAGGA GGGCGTCGGC
CACACCATCG AGACCATCCA CGACCTGGCG ACCAATGTCC GTGACTCCGC CGAGGCCATC
CGCGCCCTGG ATGAGCAGAG CGCACGCATC GGACAGGTCC TTGAGGTCAT CCGGGGGATC
GCCGACCAGA CCAACCTGCT CGCCCTCAAC GCAGCCATCG AGGCGGCACG GGCTGGCGAG
AGCGGACGCG GGTTCACCGT AGTCGCCGAG GAGGTGCGCA AGCTCGCGCA GAAGACCCAG
GACTCCATTG GCAGCATCCA GGAGATGGTC GAAGGCATCC AGAACGGCAC CCAGCAGGCG
GTGCAAGCTA TGGAGCGCAA CCGTCAGCAG GCCGAGGGGA CCGTGGAAGA GGCCTCCGCC
GCCGGGAACA CCCTCCAGGA GATCACGCGC GTGGTGACGC AGATCGAGGA CATGACCAAC
CAGGTCGCCA GCGCGGTGGA ACAGCAGTCC CAGGTGGCCG AAGAGGTCAA TCGCAACCTG
ACGCTGATCA GCGATACGGC CGAGCAAACG CGGAGTTCAG TGCAGGACAC CGGCCAGGTC
TCCCAGCGAC TTGCCCAGCT GGCCGAGGAC CTACGGGATC AGGTCGGGCG TTTCCGGGTG
TGA
 
Protein sequence
MLANWPVRYK LGISFGLIVI LMGTSGLLSH WALQDTANEN RDAMAYMEQS AMLVEREVEH 
LSWTNELADS FLLEETFTGE LDFTSCAFGE WFYDYQGSEH YESASEALRE AVDGLEQPHI
ALHEVGERIV ELQEAGRFDE AEAFYHEEAQ PIRRVFQDQL GELREILEAE RDRHAKQAAA
QQTQAAQITV GSLAITVLFA IGLAALVSRH LTARLRSAVD SLEDIAAGDG DLTRRLDAEG
RDEIAELSAA YNRFVDKIHE MVRVVRDSAT QMASATEQLQ RSAEEDQQGV QNQQERTSEV
ATAMNEMSAS INEIARNTQD TANAAGDASQ QSRQGQEGVG HTIETIHDLA TNVRDSAEAI
RALDEQSARI GQVLEVIRGI ADQTNLLALN AAIEAARAGE SGRGFTVVAE EVRKLAQKTQ
DSIGSIQEMV EGIQNGTQQA VQAMERNRQQ AEGTVEEASA AGNTLQEITR VVTQIEDMTN
QVASAVEQQS QVAEEVNRNL TLISDTAEQT RSSVQDTGQV SQRLAQLAED LRDQVGRFRV