Gene Slin_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3100 
Symbol 
ID8726853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3759585 
End bp3761306 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content51% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003387910 
Protein GI284037980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGA AGACCTTGCT GCCATTTCTA CTTGCGCTTA CCCTGCTGAC CGTTGTGATC 
CTTATTGACC GTCACAACTT TAACCAGATG AGGAATTACA CGATTCAGGT AGACAGAAGC
CGGGACATTA TCAACCGGCT CGAACGTCTA TCCAACCACT TCAAAAGTGT CCAAATCTAC
AGCCCGGCGG CTGCAGCCAA GGCTCCGGTA AACTTTTACC AGCTATATAA ATTCGAAGCC
GCCCATTTAC GCCAGGAACT GACTCAGTTG CATCCTCTAC TGAAAGCCGA CTCAACTCAG
TACAAACGGC TCCTCACTGT CAATCGGCTG ATCGAGCGTC ATTGGAGCAC CCTGATGGTG
AACAACATTG GCGAACTTAT CCAGCAGGGA CAAGGGTGGC GTCTTAATGA CCTGTTTCGG
GTACACCAGC TCATTAACCA GGCTGTCGAC TATGAAAACG CCCTGCGTCA GTGCCGTCAG
CAGGAACTGA CCCGGTCAAC AACCATCAGC CAGGCAGCTT CGAATACGTT TTCGCTCGTA
GCGCTGGTTA TACTGTTGGT AACGTTTGGG GTTAACGTGC GGCTAAATCA CCGCCGAAAA
ATGCTACAAG GGTTTCTGGC CTCTATCCTG AACACCTCCC GCAACGGAAT CGTCAATTTA
CAGCCTGTCC GGAAGCAGGA CCAGGTAGTC GACTTTAAAG TAGATTATGC CAACGCTGCG
GCAGAAGACC TGCTGGGCAT ATCGCCGACC CAGATTAAGG GGCAGCATCT GCTCAGTTTA
CCCGGCTTTG AGAGCGGAAA GGCTGAACTC ATGAATCATT TTCTGAAGGG TATGAACACA
GACCAGGCAG AACCATTTGA GTGGTTGCTT CAAATAAACG GGGCTACTGT CTGGCTGTAC
GTTATATCGG GACAACACAA CGGCGGCTTA ACGGTCACTC TCCAGGATAT TACCTCCCTA
AAGCACTATC AGCAGGACCT TCAGACCAAA ATCGAACAAC TCAACCGGAG TAACGAAGAC
CTACAGCAGT TTGCGTCCAT TGCCAGTCAT GACTTGCAGG AGCCGCTCCG AAAAGTGCAG
TCATTTGGCG ATATTCTCAG AGATCAGTAT TCAGATCAGT TGGGTGAAGG TGTCTACTAT
TTGCTGCGAA TGCAGAATGC CGCAACCCGG ATGTCGGTAC TTATTAAAGA TTTGCTGAGT
TTCTCCCGGA TTTCAACGGG AGAGCCCAGC AAGATAGCGG TAAATCTGGA AGACATTGTT
CAAAAAGTAC TGTCTGACCT GGACTTACAG GTGGCAGAAA CCAGGGCTGC AATCGACGTG
GGCGTGCTGC CAACGTTGCC GGGGAACGCA TCGCAACTAA GACAGCTGTT TCAGAACCTG
CTCAGCAATG CACTCAAGTT TCATTCGGCA GACTCGACGC CCGTTATTCG AATAACCTGC
CAACCGGCCG GTGCCGACGA GTTGCCCGCC GGTATGCAGA GTGTACAGCC GACAACGGCA
TATCATTGTA TTGATGTGAT CGATAACGGC ATAGGTTTTG AGGAAAAATA CCTCGACCGC
ATCTTCCAGA TCTTCCAGCG GCTGCACGGC AAACAGGCGT TTGCGGGTAC GGGTATTGGA
CTGGCTATAT GCGCTAAAGT GGTGGCTAAT CATGGCGGGT TCATTACAGC CCGGAGTCAA
ACGGGGCAAG GAACCACCTT CAGCGTATAC CTACCGGTTT GA
 
Protein sequence
MNLKTLLPFL LALTLLTVVI LIDRHNFNQM RNYTIQVDRS RDIINRLERL SNHFKSVQIY 
SPAAAAKAPV NFYQLYKFEA AHLRQELTQL HPLLKADSTQ YKRLLTVNRL IERHWSTLMV
NNIGELIQQG QGWRLNDLFR VHQLINQAVD YENALRQCRQ QELTRSTTIS QAASNTFSLV
ALVILLVTFG VNVRLNHRRK MLQGFLASIL NTSRNGIVNL QPVRKQDQVV DFKVDYANAA
AEDLLGISPT QIKGQHLLSL PGFESGKAEL MNHFLKGMNT DQAEPFEWLL QINGATVWLY
VISGQHNGGL TVTLQDITSL KHYQQDLQTK IEQLNRSNED LQQFASIASH DLQEPLRKVQ
SFGDILRDQY SDQLGEGVYY LLRMQNAATR MSVLIKDLLS FSRISTGEPS KIAVNLEDIV
QKVLSDLDLQ VAETRAAIDV GVLPTLPGNA SQLRQLFQNL LSNALKFHSA DSTPVIRITC
QPAGADELPA GMQSVQPTTA YHCIDVIDNG IGFEEKYLDR IFQIFQRLHG KQAFAGTGIG
LAICAKVVAN HGGFITARSQ TGQGTTFSVY LPV