Gene Hhal_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1904 
Symbol 
ID4710674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2096159 
End bp2098957 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content69% 
IMG OID639856377 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001003470 
Protein GI121998683 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.636099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGCCGG CGTGGACCAT CCTGCTGGTT TCGGCGCTCT ACGTCGGGCT CCTGTTCGCC 
GTCGCCTATA CGGCGGACAA GCTCGGCGAT CGCGGCCTGT CGCTGGTCAA CAACCCGTAC
GTCTACACCC TGTCCATCGC CGTCTACTGC ACGGCCTGGA CCTTCTACGG CAGCGTCGGC
CTGGCGGCCG AACAAGGGCT GGCCTTCCTG ACCATCTACC TCGGCCCGAC CCTGATGGCC
ATCGTCTGGT GGGTGGTGCT GCGCAAGATC GTGCGCATCA GCAAGCTCTA CCGGCTCACC
TCCATCGCCG ATTTCATCGC CGCCCGTTAC GGCAAGAGCA TGCTCCTCGG CGGCCTGGCG
GCATTCATCG CCCTAGTCGG CACCACCCCG TACATCGCGC TGCAACTCCA GGCGGTCTCC
TCGAGCTTCG ACGTCCTGCT CCATTTCCCG GACATGGAGG CGGTCCAGGC GGGCAGTGCC
GGGCTTCTGG ACGACAAGGC GCTCTACGTG GCGGCCCTCC TGGCCCTGTT CACCATCCTC
TTCGGCACCC GCCACATCGA CGCCACCGAA CGCCTGGAGG GCATGGTGGT GGCGGTGGCC
TTCGAGTCGG TGGTCAAGCT GGTCGCCTTC CTGGCGGTCG GGATCTTCGT CACCTTCGTC
CTCTTTGCCG GGCCCGTGGA GCTCTTCCGC GAGGCAGCGG CCTCGCCGGC CATCGCGGAG
CTCGGCCTGC TCACCGGACT ACCCGGCGGC TACCAGAACT GGATCGCCAT GCTGGCGCTC
TCGGCCCTGG CGATCCTCTT CCTGCCGCGG CAATGGCAGG TCAGCGTGGT GGAGAACGTC
AACGAGGACC ACATCCGCAC GGCCTCCTGG CTCTTCCCGC TCTATCTACT GGTCATCAAC
CTGTTCGTCC TGCCCGTGGC CCTGGCGGGG CTGCTGTTCT TCGGCGACGA GATCCCCGGC
GACGACTACG TGCTGGCCCT GCCCCTGGCC GTCGATCAGG GGCTGCTCGG CCTGCTGGTC
TACATCGGCG GCTTCTCGGC GGCCACCGGC ATGGTGATCG TCGCCACCAT CGCCGTGGCC
ATCATGGTCA GCAATGACAT TGTGACCCCG GCGCTGCTGC GCCTGCGCCG CTTTCACGCC
CGGGGGCGCC ACGATCTGTC GAGGCTGATC GTCACCATCC GGCGCCTGGT GATCTGCGCG
ATCCTCGCCC TCGGCTACGC CTACTACCTG CTCATCGCCG ACACCCACGC CCTGGTCTCC
ATCGGGCTGA TCTCCTTTGC CGCGGTGGCG CAGTTCGCGC CGGCGGTCCT GCTCGGGCTG
TTCTGGAAGG GGGCGAGCCG GCGCGGCGCC CTGGCCGGGC TGGTGGCCGG GTTCCTGCTC
TGGGCGTACA CCCTGCTGCT GCCTTCGCTG GCCGAGGCGG GGCTGCTCGG CGAGCGGCTG
CTCACCGACG GCCCGTGGGG CATTGAATGG CTGCACCCCT ACGGCCTGTT CGGTATGGAG
GGCATGGACC CCATCGCCCA CGCCTTCTTC TGGACCCTGC TGGCCAACCT CGGGCTGCTC
CTCGGTGTCT CGCTGTTCGA TCGCCAGGGC GACATGGAAC GCATCCAGGC CACGCTGTTC
GTCGACGTCT TCCTGCGCTC GGAGCGCGAC GCCCGCTTCT GGGAGGGCAC CGCCACCGTC
GGCGACTTGC AGGACCTGCT CGGCCGCTTC CTCGGCCCGG AGCGGGCCGC CGGGGTCATC
CGTGAGTACG GCGCCCACCG CGGACGTCCG CTGGATGAGG CCGAGCAGGC ACAGCCCGAG
CTGGTCAATC AGGTCGAGCG CCTGCTCGCC GGCTCCATCG GCTCCGCCTC GGCCCGGGTG
ATGGTGGCCT CCATCGTCAA GGGCGAGGCG CTCTCCTACG AGGGGGTCAT GGAGATCCTC
GACGCCACCT CGCGGGCCAT CGAGTACTCG CGGCGGCTGG AGGAGAAATC CCACGCCCTG
GAGAGCGCCA CCGAGGAGCT GCGCGCCGCC AACGAGCGGC TCAAGGAGCT GGACCAGCTC
AAGGACGAGT TCGTCTCCAT GGTCAGCCAC GAGCTGCGCA CGCCGCTGAC CTCGATCCGC
GCCTTCGGCG AGATCCTACT CAACAACCCG GAGATGGACG CCGATCAGCG CCGCGAGTTC
CTGGAGGTGG TCGTCCGCGA GAGCGAGCGG CTGACCCGGC TGATCAACCA GGTACTGGAC
CTGTCCAAGA TCGAGAGCGG CTCGGCCCAG TGGCAGCTGG AGGACGTGGA TCTCTCGCGC
CTGGCCCGGG AGGCGGCGGA GTCCACCCAG CAGCTGTTCA CCGACCGGCA GACGCAGCTG
CACATCGAGA TCGCCAGCGA CGACAACGCC ATCAAGGGCG ATCCGGACCG CCTGATGCAA
CTGATCATCA ACCTCCTGTC CAACGCCGCC AAGTTCACCG AGCCCGGCGA GGGGCAGGTG
TGGCTGCGCC TGGAGAAGGG CCGCGGCGAC ACCCTGCGCC TGTCGGTCAC GGATAACGGA
CCGGGGATCA GTGCCGAGGA TCAGCGGCGG ATCTTCGACA AGTTCCACCA GATCTCTCAG
CAGCAGGCCG GCAAGCCCAA GGGCAGCGGC CTGGGGCTGG CCATCTGCCG GCTGATCGCC
GACGCCCACT GGGGCACGCT CTGGGTCGAC AGCGAGCCCG GCGCCGGCGC CAGCTTCGTC
TGCAAGCTGC CCCGCGCCGG CGGCGAGCAC TGCAACGTCA CCGGGTACGT GGCGTTCTCC
GAGCGCGGCC CGGCACCCCC TGAGGAGGAG CCGTCGTGA
 
Protein sequence
MLPAWTILLV SALYVGLLFA VAYTADKLGD RGLSLVNNPY VYTLSIAVYC TAWTFYGSVG 
LAAEQGLAFL TIYLGPTLMA IVWWVVLRKI VRISKLYRLT SIADFIAARY GKSMLLGGLA
AFIALVGTTP YIALQLQAVS SSFDVLLHFP DMEAVQAGSA GLLDDKALYV AALLALFTIL
FGTRHIDATE RLEGMVVAVA FESVVKLVAF LAVGIFVTFV LFAGPVELFR EAAASPAIAE
LGLLTGLPGG YQNWIAMLAL SALAILFLPR QWQVSVVENV NEDHIRTASW LFPLYLLVIN
LFVLPVALAG LLFFGDEIPG DDYVLALPLA VDQGLLGLLV YIGGFSAATG MVIVATIAVA
IMVSNDIVTP ALLRLRRFHA RGRHDLSRLI VTIRRLVICA ILALGYAYYL LIADTHALVS
IGLISFAAVA QFAPAVLLGL FWKGASRRGA LAGLVAGFLL WAYTLLLPSL AEAGLLGERL
LTDGPWGIEW LHPYGLFGME GMDPIAHAFF WTLLANLGLL LGVSLFDRQG DMERIQATLF
VDVFLRSERD ARFWEGTATV GDLQDLLGRF LGPERAAGVI REYGAHRGRP LDEAEQAQPE
LVNQVERLLA GSIGSASARV MVASIVKGEA LSYEGVMEIL DATSRAIEYS RRLEEKSHAL
ESATEELRAA NERLKELDQL KDEFVSMVSH ELRTPLTSIR AFGEILLNNP EMDADQRREF
LEVVVRESER LTRLINQVLD LSKIESGSAQ WQLEDVDLSR LAREAAESTQ QLFTDRQTQL
HIEIASDDNA IKGDPDRLMQ LIINLLSNAA KFTEPGEGQV WLRLEKGRGD TLRLSVTDNG
PGISAEDQRR IFDKFHQISQ QQAGKPKGSG LGLAICRLIA DAHWGTLWVD SEPGAGASFV
CKLPRAGGEH CNVTGYVAFS ERGPAPPEEE PS