Gene Lcho_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0236 
Symbol 
ID6160555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp251255 
End bp254062 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content71% 
IMG OID641662980 
Producthistidine kinase 
Protein accessionYP_001789276 
Protein GI171056927 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00442274 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCTCGC CGGCCCTGGT GCTGGGTGCT TCGTTCGCCT ACCTGACGCT GCTGTTCGCG 
GTCGCCTGGT GGGGCGACCG GCGCGCCGCG CAGGGCCGAT CGGTGATCGG CAACGCCTGG
GTCTACGCGC TGTCGATGGC GGTCTACTGC ACCGCCTGGA CCTATTTCGG CAGCGTCGGC
CGGGCCGCCT CGGGCGGGGT CTGGTTCCTG CCGATCTACC TCGGCCCGAT GCTGGCGATG
GTGCTGGCCT GGTCGGTGGT GCGCAAGATG ATCCGCATCG CGCGCACCTA CCGCCTGACC
TCGATCGCCG ACTTCGTCGC CAGCCGCTAC GGCAAGAGCC CGCTGCTGGC CGGCCTGGTG
ACGCTGATCA CGGTGGTCGG CATCGTGCCC TACATCGCGC TGCAGCTCAA AGCCGTCGCC
GGCGGCTACG CGCTGATGAC CACGCCGCTG GGCGAGCCGC TGCTGGCCGC ACCGGCCTTG
TGGCGCGACA GCACGCTCTA CGTCGCGCTG GCGCTGGCCG GCTTCACGAT GGTGTTCGGC
ACCCGCCACC TCGACAGCAC CGAGCGCCAC GAGGGCATGG TGCTGGCGAT CGCCTTCGAG
TCGGTGGTCA AGCTGGTGGC GTTCCTGGCG GTGGGCGTGT TCGTCACCTG GGGCCTGTTC
GGCGGGCCGG CCGACCTGTT CGAGCGTGCC GCCGCGGTGC CCGAACTGCG CCAGCTGCTG
CGCCTCGAAC AGGGCCGCAG CTTCCCGTAT GCGCAGTGGT TCGCGCTGAC GATGCTGTCG
ATGCTGTCGG TGATCTTCCT GCCGCGCCAG TTCCAGGTGA TGGTGGTCGA GAACGTCGAC
GAGAGCCACC TCAAGCGCGC CGCCTGGGCC TTTCCGGCCT ACCTGCTGCT GATCAACCTG
TTCGTGCTGC CGATCGCGCT CGGCGGCCTG CTGTATTTCT CGGCCGGCGG CATGAACGCC
GACAGCTTCG TGCTCAGCCT GCCGCTGGCC GGCGGCCAGT CGGCGCTGGC GCTGATCGCC
TTCGTCGGCG GGCTGTCGGC GGCCACCGGC ATGGTGATCG TCGAGGCCAT CGCGGTCTCG
ACGATGGTCT GCAACGACCT TGTCATGCCG CTGCTGCTGC GCTGGCCGCG CTGGTCGCGC
GCCGGTGACC TGACCGGGCT GCTGCTGGCG ATCCGGCGCG CGGCGATCGT CGCCATCCTG
CTGCTCGGCT ACCTGTACTA CCACCTGGCC GGCGAGGCCT ACGCGCTGGT CAGCATCGGC
CTGATCAGCT TTGCCGCGGT GGCCCAGTTC GCGCCGGCGA TGCTCGGCGG CATGTACTGG
CAGGGCGGCA CGCGCAATGG CGCGCTGGCC GGGCTCGCGG CCGGCTTTGC GGTCTGGGCC
TACACGCTGA TGCTGCCGTC GCTGGCCAAG TCGGGCTGGT TGCCCGACAC CTTCCTGCGC
GACGGCATCG GCGGCATCGC GCTGCTGCGG CCCGAGCAGC TGTTCGGTCT GGGCGGGCTC
GACAACCTGA CGCACTCGCT GTTCTGGAGC CTGCTGGCCA ACGTCGGCGC CTACGTGGTG
ATGTCGCTGG CGCACCGGCC GACCGCGCGC GAGCTGAGCC AGGCGCTGCT GTTCGTCGAG
GTCTTCGGCG CCGCCGGCCA GGGCAGTGCC GCGACGGCAC CGGTGTTCTG GCGCGGCCGG
GCGCGGGTCG AGGACCTGCG CGCGCTGGCG GCGCGTTTCC TCGGCGAGGC CAAGGCGCGC
GAGCTGTTCG AGCGCCATGC GCGCGAGCTG GGCCTGGCGG ACGTCTCGCA GCTGCAGCCC
GACGGCCGGC TGGTGCACCG GGTCGAGACC CAGCTGGCCG GCGCGATCGG CAGCGCGTCG
GCGCGCGTGA TGGTGGCGTC CACCGTCGAG GAGGACACGC TCGGCCTCGA CGACGTCATG
CGCATCGTCG ACGAGGCCTC GCAGCTGCGC GCCTACTCGC GCCAGCTCGA GGAGCAGCAG
GCCTCGCTGC GCCGCGCCAC CGACGAGCTG CGCGCGGCCA ACGAGAAGCT GCAGGGGCTC
GACCGCCTGA AGGACGATTT CATGTCGTCG GTGACGCACG AGCTGCGCAC GCCGCTGACC
TCGATCCGCG CGCTCGCCGA GCTGATGCGC GACGAGGCCG ACATGCCCGA GGCGCAACGC
CAGCAGTTCA TGACCATCAT CGTCGGCGAG ACCGAGCGCC TGTCGCGGCT GGTCAACCAG
GTGCTCGACA TGGCCAAGAT CGAGTCCGGC CACGCCACCT GGATCGACGA CGACGTCGAC
CTCGTGGCGC TGGCCGGACA CGCCGCGCAG ACCCTGGCCG AGCTGATGCG CGAGCGCGGC
ATCACGCTGC AGCTCGACCT GCCGGCCGAG CCGCTGCCGA CGTTGCGCGC CGACCCCGAC
CGGCTGCTGC AGGTGCTGCT CAACCTGCTG TCCAACGCCG CCAAGTTCAC GCCGGCCGGC
GCTGGCCGCA TCGTGCTGAC GGTGCGTGGT GATGCGCAGG GCGCGACCGT CACGGTGGCC
GACAACGGCC CCGGCGTGCC GGCCCACGAG CAGGCGCTGG TGTTCGAGAA ATTTCGCCAG
GGTGGCGACG GCCTGAACCG GCCGCAGGGC ACCGGACTGG GCCTGCCGAT CTGCCGCCAG
ATCGTCGAGC ATTACGGCGG GCGGATCTGG CTGCGATCCG ATCCGGGTCA AGGTGCGACA
TTCGGCTACA CGCTACCGTG GACAGCCGCG GCGGCGGGAG GCATGGTGCC CACCATGAAC
ACGATGCAGG AGACAGCCCA TGAGCCACAA GATACTGATC GCCGATGA
 
Protein sequence
MLSPALVLGA SFAYLTLLFA VAWWGDRRAA QGRSVIGNAW VYALSMAVYC TAWTYFGSVG 
RAASGGVWFL PIYLGPMLAM VLAWSVVRKM IRIARTYRLT SIADFVASRY GKSPLLAGLV
TLITVVGIVP YIALQLKAVA GGYALMTTPL GEPLLAAPAL WRDSTLYVAL ALAGFTMVFG
TRHLDSTERH EGMVLAIAFE SVVKLVAFLA VGVFVTWGLF GGPADLFERA AAVPELRQLL
RLEQGRSFPY AQWFALTMLS MLSVIFLPRQ FQVMVVENVD ESHLKRAAWA FPAYLLLINL
FVLPIALGGL LYFSAGGMNA DSFVLSLPLA GGQSALALIA FVGGLSAATG MVIVEAIAVS
TMVCNDLVMP LLLRWPRWSR AGDLTGLLLA IRRAAIVAIL LLGYLYYHLA GEAYALVSIG
LISFAAVAQF APAMLGGMYW QGGTRNGALA GLAAGFAVWA YTLMLPSLAK SGWLPDTFLR
DGIGGIALLR PEQLFGLGGL DNLTHSLFWS LLANVGAYVV MSLAHRPTAR ELSQALLFVE
VFGAAGQGSA ATAPVFWRGR ARVEDLRALA ARFLGEAKAR ELFERHAREL GLADVSQLQP
DGRLVHRVET QLAGAIGSAS ARVMVASTVE EDTLGLDDVM RIVDEASQLR AYSRQLEEQQ
ASLRRATDEL RAANEKLQGL DRLKDDFMSS VTHELRTPLT SIRALAELMR DEADMPEAQR
QQFMTIIVGE TERLSRLVNQ VLDMAKIESG HATWIDDDVD LVALAGHAAQ TLAELMRERG
ITLQLDLPAE PLPTLRADPD RLLQVLLNLL SNAAKFTPAG AGRIVLTVRG DAQGATVTVA
DNGPGVPAHE QALVFEKFRQ GGDGLNRPQG TGLGLPICRQ IVEHYGGRIW LRSDPGQGAT
FGYTLPWTAA AAGGMVPTMN TMQETAHEPQ DTDRR