Gene Slin_4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4824 
Symbol 
ID8728588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5877879 
End bp5880953 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content53% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003389601 
Protein GI284039671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0714049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.112759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATC ACCCTTACTA CTCACGACTC ATCCGCCTAT GTCGGACCGG GTTCATGTCT 
CTGCTTTTTA TTAATGCCTT GCCGCTGGGT AGCTGGGGGC AGTCTACCCT AATCTTTGAC
CACCTGTCAA CCGCTCAGGG ACTTTCTCAA AGTACAGTTC GAAGTATTTG CCAGGACAGG
GAAGGGTTTA TGTGGTTTGG CACCCATGAC GGCCTAAACA AGTACGATGG TTATTCCTTT
ACCGTATTCA AAGCGGATCC GAACGACCCG CAGAACACCC TACATCATAA CATCATCACG
GATATCCATG AAGATCGGAA GGGACGATTG TGGGCGGCTA CCCTGGGGGG AGGCTTGCAT
CAAATTGATA AACGAACGGG TCAAGTAACC GCCTTTGAAC TGGGGCGGAA TCGGGAAAAT
GCCTGGAACA CGTTGTTTTC CATCCATGAG GATCAAACAG GGGGACTCTG GGTAGCCAGC
GGGGGGGGGC TGGCGCGCTT TGATCCCGCC ACCCGCCGGT TTACCCGGTA TGCCAAACCC
GCTTATCCGG TGGCGATTAC CCAGGATGCT TCCGGAAACT ACTGGGTGGG TGGTATACAG
GGCGTAAGCC GGTTCGATCC CCGCACGGGC ACCTTTACGG CGATTAACAT CCGGAATGGA
CAATTTAAAC AGCCCTTTAT CTCCTCTTTA TTGCTTGACA GCAAAGGAAT TCTATGGGCG
GGTAGTCTGG AAGGGGGCGT GTGGCGTCTG GATGCCGGAG GGACTCCTCT GCGGTTTACC
CGGTACAACC CCAGGGGGCT ACTCACAAAA TCGATCCGAT ATAATGGGAT TTATGCCAAT
CGACGGGGAG AGGTTTGGCT GGCAACCGGC GAGGGTTTAC AACGGGTTGA CCCAAACACC
GATCAGGTCA CTACCTATAC GGAAGATCGA TCGGTCCCGG GTAGCCTAAG TAATAACAGT
ATCCAGTCCC TGTATGAAGA CCGGACGGGA TCGTTTTGGG TTGGCACCAA CCATGGTGTT
AACAAAACGC CGGCCTATAC GAAAGCTTTT TCGGCTTATC AGATCATTCC GACATTAGCC
TCCACAAGTT TAAATCATAA CTACATCAAT ACGTTACTGG AGGACCATAC AGGCACGCTC
TGGCTGGGCA GTAGTGCCGG TAGTATGGAC GGAAGCTTTC AACATGACTT AGTCGCGGCA
AATCCAGTTC AGCTCCACAG TAACCAAAGG GGCCCCTTTA AGTCAGTTGC GTCCCTGGCC
AAGCAAAAAG TATGGACGCT GTACGAAGAT CGCCAAAAAC GGCTGTGGGC GGGTACCGAG
AAGGGTTTGT ATCAGTATCA ACGGGCCATG GGCCACTTCA AGCGATACCC CTTCCCCTTT
TCGGTTCGCT GTATTGTCCA GGATTCAGCA GGAATCTTAT GGGTGGCCAA TCATAGTGCC
GGAGACACGA CCGTCATTGC GGCTTTGGAT CTTACCCATT CCCGCTCGAC CTACTATTAT
CACCACCCCG GGAATACCGC TGGATTGAAC AATGCGTTTA TTTACCAGCT ACTGGCCAGC
CGCAGCGGGG ACATCTACGT TGCAACCGGG GGAGGGGGTA TCAATCGGCT CAATCCCCGA
TCAGGGCGAT TTATACATTA CCTGCCTGCC TATGAGTCCC GGGCTTCTCA CTTGAATGAC
AAAGAGATTC GATGCCTCTA CGAAGATCAC CAGGGGATGA TTTGGGCGGG CACAGGACTG
GGCGGTTTAA ACCGACTGGA TCCCCGGACC GGTAGGGTTC AGGTTTATAC TACCCATGAG
GGACTGCCCA GTAATCGAAT TCTTAGTATC ATTGATGATG ATCAGGGTAA TTTATGGCTA
GGGACGGCGC GGGGGCTTAG CCGCTTCGAC CGGATTACCC AGCACGTTCG CAATTACGAG
CAAAGGGATG GGTTGCCCGA TGACGAGTTC AATACGGGCG CCGTTTATAA ACGGCAGGGC
AGACTCTGGT TTGGTACTCG TAATGGATTT TTTGGGTTCA ATCCTGATAG CATTCAGGAC
AACACAACCC CTCCCTCGGT TTACATCACC GGTTTAACGG TGATGAACCA AAGGCGCCCC
CTGCCCCAAC GGCAACTGAC GCTGGCACAT GACGAAAACT TTTTAACCAT TGAGTTTGTG
GCGCTGAACT TTCATCGGCC CGAGAAGAAT CAATATGCGT TTCAACTAGT GGGCTTAGAT
AAACAGTGGG TGTTCAGCAA TGCCCGACGG TTTGCGAGCT ACACCAACCT GGCTCCCGGG
CACTACCGAT TTCGAGTAAA AGCGGCCAAT AATGATGGGG TATGGAATCA AACGGGTACC
TCTTTCGGGC TAACCATTGA GCCGCCTTGG TGGCAAACAA ACTGGTTTCG GCTTATGGCC
CTAATCAGTT TACTGTTGGG GATGGGGATA ACCATTCGGT TCTACACCCG GGTCAAACTG
CGCAGGCAAC GGCATGAGTT AAAGAAAGTG CTCCAGGCTC AGGAGGAAGA ACGGCAACGG
CTGGCAGCGG ATCTCCACGA CGATCTGGGG GCGACACTGT CGACTATTAA AGGACAGCTG
GAAACATTAC CGTCTTTAAG GCAAGAATTA GAGATGCCTA TCCGTCTGAT GGGAAAGGCC
ATTGGTGATC TGCGTTTTAT CTCCCATAAC CTGATGCCGC CCGAGTTCAG CCGGCTGGGC
CTGGCGGAAA TTCTGGGCGA AGCCATCAGA CAGCGGCAGG TCAGTTCAAC CCCTGTTTTT
CTCTTCGTTA CCTTTGGGCA ACAACGTCGG CTCGATTTGG AAACCGAACT CATCGTCTAT
CGCATCGCCG TTGAACTCAT CAACAATGCC CTTAAACATG CCCGGGCGCG ACACATCACA
GTACAGTTGA TTTTCCATCC CGAACAAGTA TGCCTGCTGG TGGAAGATGA TGGCCTCGGC
TACTTAGCCT CACACCGCCC AGCCGCCGGA GCCGGACTGC GCAATATCCG CTCCCGGGCT
GCCTATCTAA AAGGGGACCT AGTGGTAGAT TCCAACCCTA GAGGCACGAT GGTCACGTTA
ACTATCCACT ACTAA
 
Protein sequence
MDNHPYYSRL IRLCRTGFMS LLFINALPLG SWGQSTLIFD HLSTAQGLSQ STVRSICQDR 
EGFMWFGTHD GLNKYDGYSF TVFKADPNDP QNTLHHNIIT DIHEDRKGRL WAATLGGGLH
QIDKRTGQVT AFELGRNREN AWNTLFSIHE DQTGGLWVAS GGGLARFDPA TRRFTRYAKP
AYPVAITQDA SGNYWVGGIQ GVSRFDPRTG TFTAINIRNG QFKQPFISSL LLDSKGILWA
GSLEGGVWRL DAGGTPLRFT RYNPRGLLTK SIRYNGIYAN RRGEVWLATG EGLQRVDPNT
DQVTTYTEDR SVPGSLSNNS IQSLYEDRTG SFWVGTNHGV NKTPAYTKAF SAYQIIPTLA
STSLNHNYIN TLLEDHTGTL WLGSSAGSMD GSFQHDLVAA NPVQLHSNQR GPFKSVASLA
KQKVWTLYED RQKRLWAGTE KGLYQYQRAM GHFKRYPFPF SVRCIVQDSA GILWVANHSA
GDTTVIAALD LTHSRSTYYY HHPGNTAGLN NAFIYQLLAS RSGDIYVATG GGGINRLNPR
SGRFIHYLPA YESRASHLND KEIRCLYEDH QGMIWAGTGL GGLNRLDPRT GRVQVYTTHE
GLPSNRILSI IDDDQGNLWL GTARGLSRFD RITQHVRNYE QRDGLPDDEF NTGAVYKRQG
RLWFGTRNGF FGFNPDSIQD NTTPPSVYIT GLTVMNQRRP LPQRQLTLAH DENFLTIEFV
ALNFHRPEKN QYAFQLVGLD KQWVFSNARR FASYTNLAPG HYRFRVKAAN NDGVWNQTGT
SFGLTIEPPW WQTNWFRLMA LISLLLGMGI TIRFYTRVKL RRQRHELKKV LQAQEEERQR
LAADLHDDLG ATLSTIKGQL ETLPSLRQEL EMPIRLMGKA IGDLRFISHN LMPPEFSRLG
LAEILGEAIR QRQVSSTPVF LFVTFGQQRR LDLETELIVY RIAVELINNA LKHARARHIT
VQLIFHPEQV CLLVEDDGLG YLASHRPAAG AGLRNIRSRA AYLKGDLVVD SNPRGTMVTL
TIHY