Gene Hhal_1125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1125 
Symbol 
ID4710089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1222069 
End bp1223667 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content69% 
IMG OID639855597 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001002703 
Protein GI121997916 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGAGTCA ATGACCCGGT CACGCAGCGA CGTGTTGCCG TCTGGGATGG CGCCAACATC 
CTGTCGACTA CCGACTCCCG CGGACGGATC CGCTATATCA ACGAGGACTT TGTGCGCATC
AGCGGCTACC AGCCGGAAGA GCTGATCGGG CAGCCGCACA ACGTCATCCG CCACCCGGAC
ATGCCCCGGG TGGTCTTCGA GCACATGTGG CAACGCCTGC AGGCCGGGCA GCCGTGGATG
GGAATCATCA AGAATCGCTG CAAGAACGGC GATCACTACT GGGTTCACGC CTACGCAACG
GCCATCCGCG ACGACGCGGG CAACATCACC GAGATCCAGT CGGTACGTCA GCAGATCAAT
GACGAGGCGG TGGTCGCCCG AGCCGAGCGC GCTTACAAGC GCCTGCGCGC CGCCGAACCG
GACAAGGGGG CACTGCCCGC CGGACTGATC GGCCGGCGGG CAGTGGGGAG CGGCGTGTGG
CTCGCCGCCG GGGGGACGGC CGCACTACTC GGCGTCATCC TGGCCGCGCT GCTGCCTATC
GGCACCGGGC TCCAGCTGCT GGTCGGCATC GCCGGCGTCT CCGCCTTCGG CGCAGCCAGC
CTGCCCATGC TGCGCCAACT CCGCGGTGCG CGCGACCAGG CCCGCGCCTT GCTCGACGAC
CCACTCAGCG AGGAGATCTA CCTCGGGCGC CGCGACCACG GCGCCTCGAT CCAGCTGGCC
CTGATCCACC AGGCCTCCGA GACCCAGGCC ATCGCCAAAC GCCTGGGTGA CGACGCCCGT
CAGCTGTCCC AGGAGGCCGC CGGGGCCCGC CAGTCGATGC AGGCGGTACG CGACGAGGCG
CAGCAGCAGA GCGACGAGAC CCGCAGCGTG GCCACCGCCA TGGAACAGAT GAGCTCCACG
GTGCAGGAAG TGGCCCAGAA CGCCTCGGCC ACCGCCGATG CCACCGAGCG GGCTGGCAAG
CAGACCGACC GGGGTCGGCA GACCGTCGAG CAGAGCACCG CTGCGGTGCG CTCACTGGTC
CAGGGCATCG AGAACGCCGC GACCACCATC GAGCGGGTCA ACGGCGAGGC CGAGCGCATC
GGCAAGGCGG CCACGCTCAT CGGCAAGATC ACCAAGCAGA CCCACCTTCT GGCCCTCAAC
GCCTCGGTCG AGTCGGCGCG GGCTGGCGAG GCGGGACGCA GCTTCACCGT CGTGGCCGAG
GAGGTGCGCA AGCTGGCCGG GCAGACCGCG GAATCGACCC GCGAGATCGA TGCCATCATC
GAGTCGCTAC AGAGCGGCTC AGCGGAGGCC GTCGAAGCCA TGCGCGAGAG CCGCAACCGT
GCCGAGCAGA CCCTCGCCCA CGCCGACGAG TCCAGCCAGT CGCTGCAGGA GATCCAGGCC
GCGGTGGACG AGATCCGCGA TATGGCCGGC CAGATCGCCA CCGCCACCGA GCAGCAGGGG
GCCACCTCGC AGGAGATCGC GCGCAGCGTC TCCAGCATCG AGGGGGTGGC CGAACGGGTG
ACCAGCGAGT CGTTGCAAAC CGACCAGCGC CTACAGGCGG TCATCGAGCG CATCGCCGGC
ATCGAGGCCC TGACCGGTCG ATTCGTGCGT CGCCGCTGA
 
Protein sequence
MRVNDPVTQR RVAVWDGANI LSTTDSRGRI RYINEDFVRI SGYQPEELIG QPHNVIRHPD 
MPRVVFEHMW QRLQAGQPWM GIIKNRCKNG DHYWVHAYAT AIRDDAGNIT EIQSVRQQIN
DEAVVARAER AYKRLRAAEP DKGALPAGLI GRRAVGSGVW LAAGGTAALL GVILAALLPI
GTGLQLLVGI AGVSAFGAAS LPMLRQLRGA RDQARALLDD PLSEEIYLGR RDHGASIQLA
LIHQASETQA IAKRLGDDAR QLSQEAAGAR QSMQAVRDEA QQQSDETRSV ATAMEQMSST
VQEVAQNASA TADATERAGK QTDRGRQTVE QSTAAVRSLV QGIENAATTI ERVNGEAERI
GKAATLIGKI TKQTHLLALN ASVESARAGE AGRSFTVVAE EVRKLAGQTA ESTREIDAII
ESLQSGSAEA VEAMRESRNR AEQTLAHADE SSQSLQEIQA AVDEIRDMAG QIATATEQQG
ATSQEIARSV SSIEGVAERV TSESLQTDQR LQAVIERIAG IEALTGRFVR RR