Gene Slin_5024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5024 
Symbol 
ID8728789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6121386 
End bp6122759 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content51% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003389800 
Protein GI284039870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACGT CATCACCCGA GGGCATTGGA ACCATTCTGC TGGCTGCCGG GGCGTTTCAA 
TTCGCCCGTC GTTTTCTTGA CTTGCCAACC CGTTTGCCCG GCTGGGAGCC GTTTTTCAAG
CGGATATGGG TCATTTTCGG CGTACTCATC GGCCTCTCGG CACTCTTTTC CATCGACTTG
TCGAATGGCT GGGTTGCGCT CGCGATGAGC CTGTTATTCG TTTTCATGCT GTGGCAGCTG
AAACATTTCC AACCGTCGCG CACCGTTATA CTGGCAGTAG CCCCTTTTCT GCTGCTGTGG
GTGGTTGGGA GTTATCTTGA ACACTATGCC CCAAAGTTCT ACAAGGCGAA CGAAGGTTTT
TTCGACAATT CGATTGGCTT TTCAACCATC TGGCTTGGTA CATTCCTCGT AATTGCCAAC
CGACAGAAGA AAGCACTGGT CAAAATTGAG CAGGAACGCG CCATCGAAGC GGAACAACGG
CGTATTATTG AAGCCAAAAA GCAGGAGCTT GAATACTTAG TTGCCGAACG AACCGCCAAA
ATTACCCTGC AAAAAGAAGA ACTGGAGCAG GCACTCGTTG AACTAAAAGC AACACAGGCC
CAACTGATCC AAAGTGAGAA GATGGCCTCG TTGGGCGAAC TGACCGCCGG TATTGCCCAC
GAAATTCAGA ACCCACTCAA CTTCGTCAAT AACTTCTCCG AAGTCAGTAT TGAACTGATC
GACGAACTGA CCGAAGAACA AGCCAAACCA GATCGCGACC CTGAGCTGGA AGCTGAACTG
CTGACGGATT TAAAGCAGAA CCTACAAAAG ATAAATCACC ACGGCGGGCG GGCCTCATCC
ATCGTGAAAG GCATGCTTCA GCACTCACGG GCTGGTGCCG GACAGCGGGA GCCAACGGAC
GTTAATGCAC TATGCGAGGA GTATCTGCGT TTGGCTTATC ACGGCTTGCG AGCCAAGGAT
AAAAGCTTCA ATGCGGTTTT CAGTACCGAA CTGGACCCCT CGCTGGGGTT AATTAGTCTG
GTACCGCAGG ACGTCAGCCG GGTGCTGCTT AACTTATTCA CAAACGCTTT TTATGCCGTT
CAGCAACGCC AGAAGCAGGA AAAAACACCC GGTTACCAGC CAGCAGTTAG CGTTAGCACC
CGCTGTGCCA ACAACGAAGC GGTTATCACC GTAGCGGATA ATGGGACGGG CATTGCCGAG
GATGTAAAGC AGAAGATTTT TCAGCCATTT TTTACCACCA AGCCCACGGG CGAAGGAACA
GGACTCGGCC TATCTCTGGC TTACGAAATC ATCACGAAAG GGCATGGCGG TACGCTGGAG
GTGGAAAGCG AAGTCGGAGA AGGATCTAAA TTCATTATTA CACTGCCCGG CTGA
 
Protein sequence
MNTSSPEGIG TILLAAGAFQ FARRFLDLPT RLPGWEPFFK RIWVIFGVLI GLSALFSIDL 
SNGWVALAMS LLFVFMLWQL KHFQPSRTVI LAVAPFLLLW VVGSYLEHYA PKFYKANEGF
FDNSIGFSTI WLGTFLVIAN RQKKALVKIE QERAIEAEQR RIIEAKKQEL EYLVAERTAK
ITLQKEELEQ ALVELKATQA QLIQSEKMAS LGELTAGIAH EIQNPLNFVN NFSEVSIELI
DELTEEQAKP DRDPELEAEL LTDLKQNLQK INHHGGRASS IVKGMLQHSR AGAGQREPTD
VNALCEEYLR LAYHGLRAKD KSFNAVFSTE LDPSLGLISL VPQDVSRVLL NLFTNAFYAV
QQRQKQEKTP GYQPAVSVST RCANNEAVIT VADNGTGIAE DVKQKIFQPF FTTKPTGEGT
GLGLSLAYEI ITKGHGGTLE VESEVGEGSK FIITLPG