Gene Lcho_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_4001 
Symbol 
ID6161404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4482922 
End bp4485246 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content69% 
IMG OID641666781 
Producthistidine kinase 
Protein accessionYP_001793020 
Protein GI171060671 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value2.32996e-06 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGCC GTTCCCTGCG CTGGACACTG CTGGTCGCCC TGATCGCCAC CGGTGCCGCC 
GTGCTGGTGC TGGCCTTCCT GCTGGCGGTG GCGACCAACA ACCGGGCCCT CTACGAGCGC
CATTTCGGCT GGCTGCTGTG GGTCAACATC GGCGTCGCCA CGGCGCTGGG CGTGGTCATC
GTGCTGGCCG TGCTGCACCT GGCCCGGCGC ATGCGGCACG GCAAGTTCGG CAGCAAGCTG
CTGTTCAAGC TGGCGGCGAT CTTCGCGTTT GTCGGCGTGT TTCCCGGGGC GGTGATCTAC
GGCGTCTCGT ACCAGTTCGT CTCGCGCAGC ATCGAGAGCT GGTTCGACGT CGAGGTCGAC
GGCGCGCTCG AAGCGGGGCT GAACCTCGGG CGCAGCACGC TGCAGATCCT CGTCAACGAC
CTGGCCGGCA AGACCCGCGT GGCCGCCGAG CGGGCCGCCG ACGCGCCCGA CCGCATGCAG
GTCCTGGCGC TCGAGCGCCT GCGCGAACAG CTCGCCGCAC AGAGCGTGGC GGTGCTGTCG
CCCAGCGGCA AGGTGCTGTA TTCGGCCGGC GGCTCGATCG CGGGGGCGCC GTTGCCGGAC
CGACCCACCA CTTCGATGCT GCGCCAGGCC CGCAGCGCCC GGGTGATCTC GCTACTCGAG
GGGCTGGATG AGGAACCGGC CGGCCGGGCC AATCCGGCCA TCGCATCCAG TGCCCGCGTC
AGGGCGATCG CCTGGGTGCC GTCGAGCGAC TACAGCCTCG GCAACGAGGA TCGGTTCCTG
CAGGTGACGC AGCTGATCCC GGCCGAACTG GCGGCCAACG CGCTGCGGGT GCAGTCGGCG
TTCAGCGAGT ACCAGCAGCG CTCGCTCGGC CGGGTCGGCC TGCGCAAGAT GTACATCGGC
ACGCTCACGC TGGCGCTCAT CCTCGGCGTG TTCAGCGCCC TGCTGATGGC GGCGGTGCTC
GGCCATCAGC TGGCCCGGCC GCTGGTGGTG CTGGCCGAGG GTGTGCGGCA GGTCGCGCGC
GGCGATCTCA CGCCCACCGA GGTGTTCAGC TCGCGCGACG AGCTCGGCGG CCTGACCCGC
TCGTTTGCCG ACATGACCCA GCAGCTGGCC GACGCGCGTG CGCTCGTCCA GACCACGCTG
GCCGAGGCCG AGAACGCCAA GCGGCACTTG CAGACGATTC TCGACAACCT CACCACCGGC
GTGATCCTGT TCACCGGGCG CGGTGGGATC GACACCGTCA ATCCGGGGGC GGCGCGCATC
CTGCGCGTGC CGATGCAGGC ATGGCAGGGT CATGCGCTGG CCCAGGTGCC GGGGCTCGAG
GCGTTCTCGC GCGCGATCGA TCAGCATTTC CTGCAGGCCA TGCAGGAGTC CGATGGCCCC
GAACACGGCC AGTGGCAGGA CACCTTCGAG CTCGGTCCGG TGCTGACGGC CGATCTCACG
CTGCTGGTGC GCGGCGCGCT GCTGCCCGAC GGCACCCGGC TGATCGTGTT CGACGACATC
ACCGAGGTGG TGTCGGCCCA GCGCAGCGTC GCCTGGGCGG AGGTGGCGCG GCGGGTGGCG
CACGAGATCA AGAACCCGCT CACGCCGATC CAGCTGTCGG CCGAGCGCCT GCAGCACAAG
CTCGAGGCCA AGCTCGCCGA ACCGGCCGAC CAGCAGATGC TGGCGCGTTC GGTGGCTACC
ATCGTCGCGC AGGTGCAAGC GCTCAAGACC ATCGTCAACG AGTTCCGCGA CTACGCCCGC
CTGCCGTCCG CCCAGCTGGC GCCGCTCGAC CTCAACGAGC TGGTGGCCGA GGTGCTGGGG
CTCTACGCGG TGCAGCAGGA GTCCGGCCGG CTCGAGGCTC GGCTGGGCGC CGCGTTGCCT
GCCTTGCTGG GTGACGCAAG CCAGTTGCGG CAGGTCATCC ACAACCTGGT CCAGAACGGC
CTCGACGCGG TGGCCGATCG CAGCGAGGGC CACGTGACCG TCAGCACCGA GGCGACTTTC
CGCGATGACG ACGGCAGTCT GCGCAGCGTG CAGCTCAAGG TGCTCGACAA TGGCCCCGGT
TTCAGCGAGG CATTGCTGAA GCGAGCCTTC GAGCCCTATG TCACGACCAA GTCTCGCGGT
ACCGGCTTGG GGCTGGCGGT GGTCAAGAAG ATCATCGACG AGCACGGAGC CCGCATCCGA
CTGAGCAATC TGGCCGATGC CGCGACAGCG TCAACAGGTG CCGGCGCAAA GAGCGGTGCA
CAAGTTTCGA TATCATTTTC GCGCTGGGCG GCGAGTGCGC CGCAGCCATC CGAACCCCGG
GCCGCCCGTG GCCGAATCCC GCCCCCCATC GTGAGCGTGT CCTGA
 
Protein sequence
MSRRSLRWTL LVALIATGAA VLVLAFLLAV ATNNRALYER HFGWLLWVNI GVATALGVVI 
VLAVLHLARR MRHGKFGSKL LFKLAAIFAF VGVFPGAVIY GVSYQFVSRS IESWFDVEVD
GALEAGLNLG RSTLQILVND LAGKTRVAAE RAADAPDRMQ VLALERLREQ LAAQSVAVLS
PSGKVLYSAG GSIAGAPLPD RPTTSMLRQA RSARVISLLE GLDEEPAGRA NPAIASSARV
RAIAWVPSSD YSLGNEDRFL QVTQLIPAEL AANALRVQSA FSEYQQRSLG RVGLRKMYIG
TLTLALILGV FSALLMAAVL GHQLARPLVV LAEGVRQVAR GDLTPTEVFS SRDELGGLTR
SFADMTQQLA DARALVQTTL AEAENAKRHL QTILDNLTTG VILFTGRGGI DTVNPGAARI
LRVPMQAWQG HALAQVPGLE AFSRAIDQHF LQAMQESDGP EHGQWQDTFE LGPVLTADLT
LLVRGALLPD GTRLIVFDDI TEVVSAQRSV AWAEVARRVA HEIKNPLTPI QLSAERLQHK
LEAKLAEPAD QQMLARSVAT IVAQVQALKT IVNEFRDYAR LPSAQLAPLD LNELVAEVLG
LYAVQQESGR LEARLGAALP ALLGDASQLR QVIHNLVQNG LDAVADRSEG HVTVSTEATF
RDDDGSLRSV QLKVLDNGPG FSEALLKRAF EPYVTTKSRG TGLGLAVVKK IIDEHGARIR
LSNLADAATA STGAGAKSGA QVSISFSRWA ASAPQPSEPR AARGRIPPPI VSVS