Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4824 |
Symbol | |
ID | 8728588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5877879 |
End bp | 5880953 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | histidine kinase |
Protein accession | YP_003389601 |
Protein GI | 284039671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0714049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.112759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAATC ACCCTTACTA CTCACGACTC ATCCGCCTAT GTCGGACCGG GTTCATGTCT CTGCTTTTTA TTAATGCCTT GCCGCTGGGT AGCTGGGGGC AGTCTACCCT AATCTTTGAC CACCTGTCAA CCGCTCAGGG ACTTTCTCAA AGTACAGTTC GAAGTATTTG CCAGGACAGG GAAGGGTTTA TGTGGTTTGG CACCCATGAC GGCCTAAACA AGTACGATGG TTATTCCTTT ACCGTATTCA AAGCGGATCC GAACGACCCG CAGAACACCC TACATCATAA CATCATCACG GATATCCATG AAGATCGGAA GGGACGATTG TGGGCGGCTA CCCTGGGGGG AGGCTTGCAT CAAATTGATA AACGAACGGG TCAAGTAACC GCCTTTGAAC TGGGGCGGAA TCGGGAAAAT GCCTGGAACA CGTTGTTTTC CATCCATGAG GATCAAACAG GGGGACTCTG GGTAGCCAGC GGGGGGGGGC TGGCGCGCTT TGATCCCGCC ACCCGCCGGT TTACCCGGTA TGCCAAACCC GCTTATCCGG TGGCGATTAC CCAGGATGCT TCCGGAAACT ACTGGGTGGG TGGTATACAG GGCGTAAGCC GGTTCGATCC CCGCACGGGC ACCTTTACGG CGATTAACAT CCGGAATGGA CAATTTAAAC AGCCCTTTAT CTCCTCTTTA TTGCTTGACA GCAAAGGAAT TCTATGGGCG GGTAGTCTGG AAGGGGGCGT GTGGCGTCTG GATGCCGGAG GGACTCCTCT GCGGTTTACC CGGTACAACC CCAGGGGGCT ACTCACAAAA TCGATCCGAT ATAATGGGAT TTATGCCAAT CGACGGGGAG AGGTTTGGCT GGCAACCGGC GAGGGTTTAC AACGGGTTGA CCCAAACACC GATCAGGTCA CTACCTATAC GGAAGATCGA TCGGTCCCGG GTAGCCTAAG TAATAACAGT ATCCAGTCCC TGTATGAAGA CCGGACGGGA TCGTTTTGGG TTGGCACCAA CCATGGTGTT AACAAAACGC CGGCCTATAC GAAAGCTTTT TCGGCTTATC AGATCATTCC GACATTAGCC TCCACAAGTT TAAATCATAA CTACATCAAT ACGTTACTGG AGGACCATAC AGGCACGCTC TGGCTGGGCA GTAGTGCCGG TAGTATGGAC GGAAGCTTTC AACATGACTT AGTCGCGGCA AATCCAGTTC AGCTCCACAG TAACCAAAGG GGCCCCTTTA AGTCAGTTGC GTCCCTGGCC AAGCAAAAAG TATGGACGCT GTACGAAGAT CGCCAAAAAC GGCTGTGGGC GGGTACCGAG AAGGGTTTGT ATCAGTATCA ACGGGCCATG GGCCACTTCA AGCGATACCC CTTCCCCTTT TCGGTTCGCT GTATTGTCCA GGATTCAGCA GGAATCTTAT GGGTGGCCAA TCATAGTGCC GGAGACACGA CCGTCATTGC GGCTTTGGAT CTTACCCATT CCCGCTCGAC CTACTATTAT CACCACCCCG GGAATACCGC TGGATTGAAC AATGCGTTTA TTTACCAGCT ACTGGCCAGC CGCAGCGGGG ACATCTACGT TGCAACCGGG GGAGGGGGTA TCAATCGGCT CAATCCCCGA TCAGGGCGAT TTATACATTA CCTGCCTGCC TATGAGTCCC GGGCTTCTCA CTTGAATGAC AAAGAGATTC GATGCCTCTA CGAAGATCAC CAGGGGATGA TTTGGGCGGG CACAGGACTG GGCGGTTTAA ACCGACTGGA TCCCCGGACC GGTAGGGTTC AGGTTTATAC TACCCATGAG GGACTGCCCA GTAATCGAAT TCTTAGTATC ATTGATGATG ATCAGGGTAA TTTATGGCTA GGGACGGCGC GGGGGCTTAG CCGCTTCGAC CGGATTACCC AGCACGTTCG CAATTACGAG CAAAGGGATG GGTTGCCCGA TGACGAGTTC AATACGGGCG CCGTTTATAA ACGGCAGGGC AGACTCTGGT TTGGTACTCG TAATGGATTT TTTGGGTTCA ATCCTGATAG CATTCAGGAC AACACAACCC CTCCCTCGGT TTACATCACC GGTTTAACGG TGATGAACCA AAGGCGCCCC CTGCCCCAAC GGCAACTGAC GCTGGCACAT GACGAAAACT TTTTAACCAT TGAGTTTGTG GCGCTGAACT TTCATCGGCC CGAGAAGAAT CAATATGCGT TTCAACTAGT GGGCTTAGAT AAACAGTGGG TGTTCAGCAA TGCCCGACGG TTTGCGAGCT ACACCAACCT GGCTCCCGGG CACTACCGAT TTCGAGTAAA AGCGGCCAAT AATGATGGGG TATGGAATCA AACGGGTACC TCTTTCGGGC TAACCATTGA GCCGCCTTGG TGGCAAACAA ACTGGTTTCG GCTTATGGCC CTAATCAGTT TACTGTTGGG GATGGGGATA ACCATTCGGT TCTACACCCG GGTCAAACTG CGCAGGCAAC GGCATGAGTT AAAGAAAGTG CTCCAGGCTC AGGAGGAAGA ACGGCAACGG CTGGCAGCGG ATCTCCACGA CGATCTGGGG GCGACACTGT CGACTATTAA AGGACAGCTG GAAACATTAC CGTCTTTAAG GCAAGAATTA GAGATGCCTA TCCGTCTGAT GGGAAAGGCC ATTGGTGATC TGCGTTTTAT CTCCCATAAC CTGATGCCGC CCGAGTTCAG CCGGCTGGGC CTGGCGGAAA TTCTGGGCGA AGCCATCAGA CAGCGGCAGG TCAGTTCAAC CCCTGTTTTT CTCTTCGTTA CCTTTGGGCA ACAACGTCGG CTCGATTTGG AAACCGAACT CATCGTCTAT CGCATCGCCG TTGAACTCAT CAACAATGCC CTTAAACATG CCCGGGCGCG ACACATCACA GTACAGTTGA TTTTCCATCC CGAACAAGTA TGCCTGCTGG TGGAAGATGA TGGCCTCGGC TACTTAGCCT CACACCGCCC AGCCGCCGGA GCCGGACTGC GCAATATCCG CTCCCGGGCT GCCTATCTAA AAGGGGACCT AGTGGTAGAT TCCAACCCTA GAGGCACGAT GGTCACGTTA ACTATCCACT ACTAA
|
Protein sequence | MDNHPYYSRL IRLCRTGFMS LLFINALPLG SWGQSTLIFD HLSTAQGLSQ STVRSICQDR EGFMWFGTHD GLNKYDGYSF TVFKADPNDP QNTLHHNIIT DIHEDRKGRL WAATLGGGLH QIDKRTGQVT AFELGRNREN AWNTLFSIHE DQTGGLWVAS GGGLARFDPA TRRFTRYAKP AYPVAITQDA SGNYWVGGIQ GVSRFDPRTG TFTAINIRNG QFKQPFISSL LLDSKGILWA GSLEGGVWRL DAGGTPLRFT RYNPRGLLTK SIRYNGIYAN RRGEVWLATG EGLQRVDPNT DQVTTYTEDR SVPGSLSNNS IQSLYEDRTG SFWVGTNHGV NKTPAYTKAF SAYQIIPTLA STSLNHNYIN TLLEDHTGTL WLGSSAGSMD GSFQHDLVAA NPVQLHSNQR GPFKSVASLA KQKVWTLYED RQKRLWAGTE KGLYQYQRAM GHFKRYPFPF SVRCIVQDSA GILWVANHSA GDTTVIAALD LTHSRSTYYY HHPGNTAGLN NAFIYQLLAS RSGDIYVATG GGGINRLNPR SGRFIHYLPA YESRASHLND KEIRCLYEDH QGMIWAGTGL GGLNRLDPRT GRVQVYTTHE GLPSNRILSI IDDDQGNLWL GTARGLSRFD RITQHVRNYE QRDGLPDDEF NTGAVYKRQG RLWFGTRNGF FGFNPDSIQD NTTPPSVYIT GLTVMNQRRP LPQRQLTLAH DENFLTIEFV ALNFHRPEKN QYAFQLVGLD KQWVFSNARR FASYTNLAPG HYRFRVKAAN NDGVWNQTGT SFGLTIEPPW WQTNWFRLMA LISLLLGMGI TIRFYTRVKL RRQRHELKKV LQAQEEERQR LAADLHDDLG ATLSTIKGQL ETLPSLRQEL EMPIRLMGKA IGDLRFISHN LMPPEFSRLG LAEILGEAIR QRQVSSTPVF LFVTFGQQRR LDLETELIVY RIAVELINNA LKHARARHIT VQLIFHPEQV CLLVEDDGLG YLASHRPAAG AGLRNIRSRA AYLKGDLVVD SNPRGTMVTL TIHY
|
| |