Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0139 |
Symbol | |
ID | 8723867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 168476 |
End bp | 171676 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003385008 |
Protein GI | 284035078 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGGCG TGGCCAATAG GGCTAGGGTG TACGTAAACA TGGAAAGTCT TTTAGAGCTT TGGTTTGAGA CGTCTGAGCA GGGAGTCGCT TTTTTGACGC CTGTTTACCT GGAATTGGAT CAAATTGCTA CATTTCATTG CCAGCGGGTC AACAAAACGC TGGCTCAACT GCTGGGTAAC TCCCCTGGCG AACTGATCGG GAAGGTCATT GATCCGTTCG TTCCCTGGAT ACCGCAGGCC GAGTTACTAA GTAAGCAGTT AACCGTTTTG CAAACCGGAG AGCCCTGGCA GGGCCGCTAC TACTACCCTG AAAAAAAACG CTGGGTACAG GTCAGCCTGA CCCGGCTTGC CGATCAGGTC GTGATAAGTT TTCTGGATGT GACCGCATAC CAGAAACCAG CAGATCAGCC CCCGGTTCAC CCGCCCGCCC GTCCGTATCT CTGGCAGGAC ACGAATCAAC AGTCTGGCAC ACTGGCTGAA AACAACCAGT TGTTGCAAAC GATCATCGAT ACCAGTCCGA CCAGTTTAGG CCTGTTGAGG CCCATTTGGC AGGAGGGAGC TATTGTTGAC TTTCGCGCTC TTATCAGTAA CCCGCAGAGC GTTAGTATAA CCGGGTTAGA TTCGGATACG CTGCTGACCC GGTCGATGCT CACGCTATTT CCTCAATTTT TGCCGAATGG CGTATTTGCC AAGATGGTCG ACGTGGTGCT TACGGGCGAG GCTCAGCGTT TTCAGATGAT GGATGAATTG GCCCCGGGGT CGTTCTGGGG TGATTTCTCG CTGGTTCGGG TTGGTGGTGA TATCCTGTTC AGCGTCAATG ATATTACCCG GATAAAACAG GTTGAAGAAG AACTGCGGAC GGCCAATCTG GAACTGGAGC AACGGGTAGC CCGGCGCACG GCCGAAGTCC GGCAACTGTC GGCGTTACAG GGGGCTATCC TAAAATACGT TGGCCTGGGA GTGGCTGCCA CAGATACTAA AGGCATTATT CAACTGGTAA ACCCGGCATT GGAAGCCATG ACTGGCTACC GGGCGGATGA GTTGGTAGGC CAGCGTACGA CTGGTTCGCT GCGGGAGCCG GTGCTGCACC AGCAACAGCT TGACCAGCTA ACGCTTGAAC TGGGTGAGGC TGCCGGGCAG GGCGAAGAAG TAGTAGCCCG GTATGTAGCC AGACACAATT TTTTGCGCCT TGAAAATACC TTGCTAACAA AAGAAGGGCG AGTTATTCCG GTTCTGTCGA CGGTGACCGG GCTCTACGAC GAGCAAAACG AATTGATGGG CTATGTGGAC ATCAATACGG ATATATCTTA CCGGAAAACC GTTGAAGAGG CTCTCATGCA GGCCGGCCAA CGCAGCCAGT TAGCCACAAA AGCCGGTAAA CTGGGCATAT GGGAATGGAA TTTGCTAACG GATGAGCTGA TTCTTGACGA GAATTTTTAT ACGCTGGTGG GTATTCCCAA GCGTACAGCC CTGGCCCGGA TGAGCGATGT GGAGCCGCTG GTACATCCGG GTGATCTGGC GTTTTTTACG GATAAGGTGC AGGCCATTAT TCAGAAGCAG CAGCCTTTTG AGATCGAGTT TCGGATCATC TCTCCAATTG ATGGGTCTAC ACGATACATG AAGGCGGACG GGCTGGTTCT CCAGAACGAA AGTGGGCTAA GTGATCGGAT GATTGGCGTG CTCCGGGATC GTACCGCTAA ACGACAGGCT GACCATGCTC TCCGGGTTAG TGAACAACGC TACCGGTCGC TGGTCGACCA CCTGAGTGCC GTTGTCTTCC AAACTGATGC GGCCGGAATG TGGACGTATC TTAATCCGGC CTGGGAGGTC ATAACCGGCT TCTCCGTTGA GGAGTCGCTC GGCCGCTTTT TTCTTGACTT TATTGTTGCT GACGATCAGC CCAAAAGCAC CTCCCAGTTT GATTACATCG TAGAAAGTCA TAAGGAGGTG CTCAAGCAGG TGATCCGTTA CATTCACAAA GATGGGGGTT ATCGATGGAT GGAGGTCTTT GCCCAGGTAA GCCGTAATCA GCAGCTGGAA ATAACGGGTG TTACGGGTAC ACTGACCGAT ATCACCGATC GCAAGCAAGC CGAGGAAGCC CTGATTGAAA GCGAACGCAG ATTCCGCGAA ATTGCCGAAA ATGTCGACGA GATGTTCTGG ATTCGGGATA TCAACTCGCC GGTGTTCCTC TACATGAACC CGGTATTTGA ACTATATAGC GGCCTCACTG TAGAGGCCCT GTACGAAGAT CCGCTGATTT TTGCCAGGAG TATTATAGAA GAAGACCGCG CGGCAGTAGT GGCGGCTTTC ATAAGTAATG AGCCAAAATC TACTTTTCTG TTCAGGATTA TTCATCCCGA TGGTAGCCTT CGCTGGATCA ATGCCCGAAT TTTTTTACTG ACTGATGAGG ATGGGGTGCC TGTGCGTCGG CTGGGGGTGG CTACTGATGT GACAACCGCC ATTGAAAAAG AGCAGATTCT GGAGGAGTCG CTGGCCAAAG AACGAGCCCT GAATGCGCTT AAAACACAGT TTATTACAAC CGCTTCCCAC GAGTTCCGCA CTCCTCTGGC TTCTATTATC TCAAGCGTCG AGTTAATAAA GTATTATGCC GACCTGGAAG ATCGATCCGA AGCAAACACA TTGATTAACA GGCATGTTCT CTCAATTTCA AAGCAGGTTA TGGCTCTGAC GGACCTGATA GCGGATACGT TGACCCTGAG TAAGCTGGAA GAAGGGAAAA TACAGATTCA GGTAGAGCCG ACTGATGTTG TAGCCCTCAC GGAGGAGTTG ATAGCCTTTA ACTTCAGTAA TCGGGAGGAT AAGCGACAAG TGGGGCTAGA CGTAACTGGT GCTCCGGTTC CGGTAAGCGT CGATAAGAAA CTGATGGCCC ATGTATTGAC GAATTTATTA TCCAACGCCT TTAAATTTTC TACCACCAGC CCAAAAGTAC AAATCCGGTT CAAGCGGGAG TCATTTCTTA TTTCGGTGAT CGATCAGGGC ATTGGCATTC CACGCAAAGA TTTACCGCAT CTATTCGGGA AATTTTTTAG AGCGAGCAAT GCGACTCATA TTAAAGGGAC TGGTTTAGGG CTATCTATTT GTCTTGAATA CATCACTTTA CAGAATGGAA GCATTGACAT AGCCAGTACG GAAGGGGTGG GGACAACCTT TACGATTGCC CTACCAATTC ATAAACACTA G
|
Protein sequence | MHGVANRARV YVNMESLLEL WFETSEQGVA FLTPVYLELD QIATFHCQRV NKTLAQLLGN SPGELIGKVI DPFVPWIPQA ELLSKQLTVL QTGEPWQGRY YYPEKKRWVQ VSLTRLADQV VISFLDVTAY QKPADQPPVH PPARPYLWQD TNQQSGTLAE NNQLLQTIID TSPTSLGLLR PIWQEGAIVD FRALISNPQS VSITGLDSDT LLTRSMLTLF PQFLPNGVFA KMVDVVLTGE AQRFQMMDEL APGSFWGDFS LVRVGGDILF SVNDITRIKQ VEEELRTANL ELEQRVARRT AEVRQLSALQ GAILKYVGLG VAATDTKGII QLVNPALEAM TGYRADELVG QRTTGSLREP VLHQQQLDQL TLELGEAAGQ GEEVVARYVA RHNFLRLENT LLTKEGRVIP VLSTVTGLYD EQNELMGYVD INTDISYRKT VEEALMQAGQ RSQLATKAGK LGIWEWNLLT DELILDENFY TLVGIPKRTA LARMSDVEPL VHPGDLAFFT DKVQAIIQKQ QPFEIEFRII SPIDGSTRYM KADGLVLQNE SGLSDRMIGV LRDRTAKRQA DHALRVSEQR YRSLVDHLSA VVFQTDAAGM WTYLNPAWEV ITGFSVEESL GRFFLDFIVA DDQPKSTSQF DYIVESHKEV LKQVIRYIHK DGGYRWMEVF AQVSRNQQLE ITGVTGTLTD ITDRKQAEEA LIESERRFRE IAENVDEMFW IRDINSPVFL YMNPVFELYS GLTVEALYED PLIFARSIIE EDRAAVVAAF ISNEPKSTFL FRIIHPDGSL RWINARIFLL TDEDGVPVRR LGVATDVTTA IEKEQILEES LAKERALNAL KTQFITTASH EFRTPLASII SSVELIKYYA DLEDRSEANT LINRHVLSIS KQVMALTDLI ADTLTLSKLE EGKIQIQVEP TDVVALTEEL IAFNFSNRED KRQVGLDVTG APVPVSVDKK LMAHVLTNLL SNAFKFSTTS PKVQIRFKRE SFLISVIDQG IGIPRKDLPH LFGKFFRASN ATHIKGTGLG LSICLEYITL QNGSIDIAST EGVGTTFTIA LPIHKH
|
| |