Gene Hhal_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1643 
Symbol 
ID4709923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1793861 
End bp1795762 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content67% 
IMG OID639856108 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001003209 
Protein GI121998422 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.231435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGACCG AACAACAGAC CCTGCTCGAG GTCGCCCGGG CCACCACCGC CATCGGCGGC 
TGGATGCTCG ACCGCTCCAC CCAGGCGGTA CACCTCTCCA CAGAGACCCG GCGGCTACTT
GAACTACCCC CGGAGCACCG GATGGATCTG CGCACCGGAC TCGGCCTCTA CCCACCCGAG
TATCGGCCGA TCGTCACCCG CGCTATCCAC ACCACCCTCC ACCACGCCCG CCCCTTCCAG
CTCGACGTCG ACATCCACAC CGCCACCGGC AAACGGCTCC ACGTGCGCAA ACTCGGCAAG
CCGCTGATGG ACGACTACGG CAATGTCCTG GGTCTGCGCG GCGCGCTGCA GGACCTGACC
GAGATCAAGA CCGCCGAGCG CCAAGCCCGC CGCCTGTCCC AGCGCCTGGA GACCACCCTG
GAGAGCATCA CCGACGCGTT CTTCTTGGTC GACCCGGCCT GGCGTTTCAC GTTCCTCAAT
CGCGAGGCGG AGCGCCAGCT CAACTGTCAG CGCGAGGACG TGCTCGGCCG CGTGGCCTGG
GAGGTCTTCC CGGAGGCGCT GGGGACCGCC TTCGAGGAGC ACTACCGATA TGCCTTGGCG
CAGCAGGAGC CCGTGGTCTT CGAGGCGTAC TTCGCGCCCG TCGAGACCTG GTTCGAGGCC
CGGGCCTACC CATCGGACGA GGGACTTGCG GTCTACTTCC GCGACATCAG CCAACGCAAA
CAGGCGGAAG CGGAGATGGC CCAGCTGAAC GAAGAGCTAC ACCAGGCCCG GGAACGCGCC
GAACAGGCCA GCCGAGCCAA GTCGGAATTC CTCGCCGCGA TGAGCCACGA GCTGCGCACT
CCGCTGAACG CCATCACCGG CTTTGCGCAA CTGCTCCACC AAACCCCGCC CGACGAAACG
CAAGACCGCG ATCAGCACCT TGAGCAGATC CTCTCGGCCG GCTGGCACCT GCGGGACCTG
ATCGGCGACG TGATCGACTT CGCCCAGATC GAGACCGGTC AACTGGCGGT TCATGCGGAG
CCCATGGCCA TCGACGCCCT CGCCCGCAAC AGCGTGGCGA TGATCACCGA GCAGGCGCGT
CGGCGTGGTC TCACAGTGCA CTGCCAGAGC GACCCCATTG CGCCCCCTTG GGCCCGCGCC
GACCCGGTTC GCAGCCGACA AGTGCTGCTT AACCTGCTCT CCAACGCCGT CAAGTACAAC
AACCCGCACG GCACAATCAC CGTTACCTGG TCCACCGCCG GTGGCAGTGT TCAGGTAGAC
GTCGCCGATA CGGGCAGCGG CATCCCGCAC GGTCGCCGCG ATCGTCTCTT CGAGCCCTTC
GAGCGACTCG GTCGTGAAGC CGGCTCCATC GAGGGCAGCG GCATCGGTCT GTCATTGTCC
CAACGACTCG CCCGGATGAT GAGCGGGGAT CTGCAGCTGC TTCACAGCAG CCCCGATCGC
GGGACCACCA TGCGACTGAC CCTGCCTCGC TACAGCGATG ACCACTCAGC GACTAGCACG
CCGCCCGTCA GCCTAAGCGA AGCCGCGCCC GCCGCGCCCA TGCGCATCTG CTACATCGAG
GACAATGAGC TCAACATGAT GGTGGTGCGC GGTCTGATCC GCCGGAAAAC GACCGTCACC
CTAGACGAAG CCTGGACCGG CGCCGAGGGG CTGGCGCGAA TCCGCCAGGG CCCACCCGAC
CTGGTTCTGC TGGACATGCA CCTGCCCGAC ATGCACGGCC GGCAGGTGCT GCGCGAGATC
CGCGCCGACC CGGTACTGAC TCACCTCCCG GTCGTGGCCC TGAGCGCGGA CGCCGCCAAC
GAAACCCTGG AAACCGCCGA CGGCACCCTG GACGGCTACC TGCTCAAGCC GTTCCGGCTG
GCCGATCTGG ACGCCCTGAT CGCCCGCTTC GCCCCGGCAT GA
 
Protein sequence
MVTEQQTLLE VARATTAIGG WMLDRSTQAV HLSTETRRLL ELPPEHRMDL RTGLGLYPPE 
YRPIVTRAIH TTLHHARPFQ LDVDIHTATG KRLHVRKLGK PLMDDYGNVL GLRGALQDLT
EIKTAERQAR RLSQRLETTL ESITDAFFLV DPAWRFTFLN REAERQLNCQ REDVLGRVAW
EVFPEALGTA FEEHYRYALA QQEPVVFEAY FAPVETWFEA RAYPSDEGLA VYFRDISQRK
QAEAEMAQLN EELHQARERA EQASRAKSEF LAAMSHELRT PLNAITGFAQ LLHQTPPDET
QDRDQHLEQI LSAGWHLRDL IGDVIDFAQI ETGQLAVHAE PMAIDALARN SVAMITEQAR
RRGLTVHCQS DPIAPPWARA DPVRSRQVLL NLLSNAVKYN NPHGTITVTW STAGGSVQVD
VADTGSGIPH GRRDRLFEPF ERLGREAGSI EGSGIGLSLS QRLARMMSGD LQLLHSSPDR
GTTMRLTLPR YSDDHSATST PPVSLSEAAP AAPMRICYIE DNELNMMVVR GLIRRKTTVT
LDEAWTGAEG LARIRQGPPD LVLLDMHLPD MHGRQVLREI RADPVLTHLP VVALSADAAN
ETLETADGTL DGYLLKPFRL ADLDALIARF APA