Gene Slin_2489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2489 
Symbol 
ID8726233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3011674 
End bp3013158 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content52% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003387307 
Protein GI284037377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.190195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.237117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCTC CATTCCTTAA CGCCAACCGT CGGTTTTACA TGAGCCTGTA TGGGAAGCTA 
GCCCTCACCT TACTGGCACT GCTAACGGCC CTGGCCTGCG TGTATGGATA CCTCACAGCG
TATTCGGCCA TCCGCTATTT CGATGCGACG CACCAGCGGC TGAATGTCGA TGTGGCGGCT
CATATTGCCA CGTTTACAAA GCCGTTTTTA ACACAGGGCG TCAATCAGGC GGGTGCCGAT
GATATTTTCT TCAACGCAAT GGTCACGAAT CCCAGTGCCG AGGTCTACCT GCTCGACTCG
ACCGGGAGGG TGCTGGTCTA CCATGCCCCG GCCACAAAGA TCAAGCGAAG CCAGGTATCT
CTGGAACCCA TCCGGGAATT TATCAGCCAA AAAGGAAAGG TTTACATTAA AGGCGATGAT
CCGCGAAGTG TAGCCGATCA GAAAATATTT TCGGTGGCCG AAGTTCGGAA TAACAGTCGA
ATGCAGGGCT ATGTATATGT TATTCTGGGG GGTGAGCAAT ATGGCTCGGT TATGGAAAGC
CTGCTGCAGA GCCATGTGTT GCTGTGGGGG TTGCTGACCC TGCTGATTAC CCTGACAGCT
GCCCTGCTCA TCGGGCTAAT TTCGTTCCAC CGCCTGACGA GGGGCATGGA GGCCATTACC
GTTGCGGTTG GCCAGTTTCG ACAGGGCGAT TATCTGGCTC GTGTGCAGGT GAAGGCCAGC
CGGGAATTGG CTCTGGTGGC CGATACCTTC AATGACATGG CCGATGAACT GTCTCGCACG
ATAACCAATC TGACCCAGTC GGAGCGGATA CGGCGCGAAT TAGTGGCTAG TATCTCGCAC
GATTTACGAA CGCCCATTAC CGCCATACAC GGCTATGCGG ATGCGCTGAC GACAAACACC
CTACCCGAAG ACACCCGGCA TCAGTACGCC GACGTTATTG CTCAGGGAAG CAAAAAACTT
ATAATAATGG TCGATGAACT GGCCGAGTTA GCCAAACTCG AAGCGCGGGA CACGCAACTG
CAACCCGAGC CATTCGCCAT TGCCGACCTC CTCAGCGAAG TGATTGCCCG GTTCACCCCC
CTCGCCGAAC GTCAGCAACT CATGTTAATG TGTATGAATT GCCAGTCATC CGTTTTCTGC
TACGCAGATG TAGGTCTAAT TGAACGGGTA CTACAAAACC TGCTGGAAAA CACCCTTAAA
AACACACCGG CCAATGGTAT TATTCAGGTA GAGTTAAGTC AACCGGAGTC TGGTTTATTA
ACCATTTCTG TTCAGAACCC GGTTAGTCAT CTGCCCGACT TCATTCAGGC CTATTTGCGC
TCGGATGTGT GTACACCCGA GCGCACTCCT GGTAGTGGTC TTGGATTAGC CATTGTCGAA
AAAATACTGA AGCTGCATGA TACAAGGCTT CGGGCAGCTC AGCCTGAAGC CAATTTTATT
CGTTTCAGCT TCGAACTGCC CGTTTACAAA GGTTCTGACA GATGA
 
Protein sequence
MRSPFLNANR RFYMSLYGKL ALTLLALLTA LACVYGYLTA YSAIRYFDAT HQRLNVDVAA 
HIATFTKPFL TQGVNQAGAD DIFFNAMVTN PSAEVYLLDS TGRVLVYHAP ATKIKRSQVS
LEPIREFISQ KGKVYIKGDD PRSVADQKIF SVAEVRNNSR MQGYVYVILG GEQYGSVMES
LLQSHVLLWG LLTLLITLTA ALLIGLISFH RLTRGMEAIT VAVGQFRQGD YLARVQVKAS
RELALVADTF NDMADELSRT ITNLTQSERI RRELVASISH DLRTPITAIH GYADALTTNT
LPEDTRHQYA DVIAQGSKKL IIMVDELAEL AKLEARDTQL QPEPFAIADL LSEVIARFTP
LAERQQLMLM CMNCQSSVFC YADVGLIERV LQNLLENTLK NTPANGIIQV ELSQPESGLL
TISVQNPVSH LPDFIQAYLR SDVCTPERTP GSGLGLAIVE KILKLHDTRL RAAQPEANFI
RFSFELPVYK GSDR