Gene Slin_5388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5388 
Symbol 
ID8729154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6547896 
End bp6549851 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content53% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003390154 
Protein GI284040224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAC AACGCATACG CTGGATTGTT GCACTGATGG CCATTGGACT TTTGGGGCTG 
GTGGGGCTGC AATTGTACTG GATCAGTAGC GCCTTACAGT TACAGAAAGA ACAGTTTGCC
TATAAAGTGA CGGATGCCTT GCAGGAGGTG GTTCGCGCAC TGGAGCGGCA GGAAGTTTAT
TACCAGACCA ATCAGCGCGT TCGGGCCCGC GAACAGCAGG ACCGCCTGAT GGCCATTGCC
AAAAAAGAAG GTCGGGTGGT GCCTAAAGCA TCTGGTGAGC ACGTTGCCGT ACATTCGGAA
GCTACTCCAC GGGCTAAGGC CAAACGCCCT CTGTCATCAA CAGACCAGTT GGCGGCCCGG
TACGGTTTGC CGCCAGGTAT GACGGCCGCC GGTACTATGG TGGTTCAGTC GGATGTGCTG
CATCCAATCG TCCGGCCGCT CTCGGCCGAA CAGATGGTGG TAGTTGAAGA GTTTTTTCGG
CAGCAGGATG AACTAATGGC AGTTGGCGAC TGGCAGGCTC AACTAGCCCA GCAACACCAG
TTTGACCAAT GGGTCGAAAA TGTACTGAAC AGCGAGTTAA CCCGCATCAA TAACCAGATA
GGACACCAAG CCAAGCGTGA CTCGTCCGGA CGGGCGGCTA AAACCCAGCT TCGTCGGGCT
GCACAACAGC GGAAAACACA GTCGACAGAT GGAGTGGCCG CCTTAAAACC TGGTTTGCCA
ACCCGCCCGG CCAGTGTGAG TCAGAATCGG GCCGAGGAAC AATCGCACAG GATAAAAGAC
GTTTTGAAAG GTCTTCTGAT GTCGGATCGC CCGATCGAAG AACGCATCAA TCGGCTGGCA
CTCGATACGT TGCTACGGCA GTCGCTGTCC GAACGGGGCA TTGATATTCC GTTTGCCTAC
GCCGTCCGAA CGCGTCAGCA ACCTCAATTT CTGTTCACCT CTCTTGGAAG TGATGCGCAG
CAATTCGATA AAAGTGGCTA TAAAGCGGCC CTCTTTCCAA TTAACCTAAT GGAAAGGGGT
AACTATGTCT ATGTTTATTT TCCTACCCAG CAGCAGTTTA TTCTCAATCG GCTGGGGTTC
ACGTTTGGCG CGTCGGTTGT GCTGATTCTG GTTATTCTGG CTTGTTTTTA CATTGCGATC
AGCACAATCG TTCAGCAGAA AAAGCTGGCC GATATCAAGA ATGATTTCAT TAATAACATG
ACCCACGAGT TCAAAACGCC TATTTCGACG ATCTCGCTGG CCGTAGAAAT GGCGCAGGAG
CAGATGAGTG CGGGCCGGGG AGCCTGGAAC AAAGAGCTCG TCTCCGGCTC TTCCGGCGAT
GAACTGGCCG GTCGCCTGTC GAGGTACATG GGTATTATCC GGGACGAAAC CCGGCGGCTG
GGTTCGCACG TCGAAAAAGT GCTGCAAATG GCGCTGCTGG ATCGGGGTGA GATAAAACTT
AATCTGTCGC CCGTAAACGT GCACGACGTT ATCGAAAAAG TACTGAACAA CGTGGGGCTT
CAAATTGAGC AGCGCGGGGG CGAAGTCGAC CTGCATTTCG ATGCCGACCG TGAAGTGGTG
GAAGCGGATG AACTTCACCT GACAAACATC ATTTATAATC TCCTCGATAA TGCCCTGAAG
TACTCACCCG AAAGCCCGCA CATCACCATC AGCACGCGTA GTTTGCCCGA TACATCGGCG
GTGCCCGGTG CGGCTTCTAC GCTGCCGGGG GTAAGCATTA CGGTCGCTGA TCGGGGCGTG
GGCATGACGA AAGAGCAAAC CAACCGAATT TTTGAGAAAT TTTATCGGGT ACCTACCGGC
AACCGGCACG ATGTAAAAGG CTTTGGCCTG GGTCTGAGCT ATGTGAAAAA AATGGTCGAT
GAACACCACG GCCAGATTAT GGTCGACAGC CAACCCGGCA AAGGCAGTTC ATTTGAAGTC
ATTATACCGT ATACACAAGA TTTAGTAGTG AAGTAA
 
Protein sequence
MSKQRIRWIV ALMAIGLLGL VGLQLYWISS ALQLQKEQFA YKVTDALQEV VRALERQEVY 
YQTNQRVRAR EQQDRLMAIA KKEGRVVPKA SGEHVAVHSE ATPRAKAKRP LSSTDQLAAR
YGLPPGMTAA GTMVVQSDVL HPIVRPLSAE QMVVVEEFFR QQDELMAVGD WQAQLAQQHQ
FDQWVENVLN SELTRINNQI GHQAKRDSSG RAAKTQLRRA AQQRKTQSTD GVAALKPGLP
TRPASVSQNR AEEQSHRIKD VLKGLLMSDR PIEERINRLA LDTLLRQSLS ERGIDIPFAY
AVRTRQQPQF LFTSLGSDAQ QFDKSGYKAA LFPINLMERG NYVYVYFPTQ QQFILNRLGF
TFGASVVLIL VILACFYIAI STIVQQKKLA DIKNDFINNM THEFKTPIST ISLAVEMAQE
QMSAGRGAWN KELVSGSSGD ELAGRLSRYM GIIRDETRRL GSHVEKVLQM ALLDRGEIKL
NLSPVNVHDV IEKVLNNVGL QIEQRGGEVD LHFDADREVV EADELHLTNI IYNLLDNALK
YSPESPHITI STRSLPDTSA VPGAASTLPG VSITVADRGV GMTKEQTNRI FEKFYRVPTG
NRHDVKGFGL GLSYVKKMVD EHHGQIMVDS QPGKGSSFEV IIPYTQDLVV K