Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4599 |
Symbol | |
ID | 8728363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5579374 |
End bp | 5582463 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003389376 |
Protein GI | 284039446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.645634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACCAG TCTCTGCCTT CGCCAATGCA CTGCCTAGTC CGGATTTAAG CTCTCACCTC AGCAGTGCGG TACTGTATCA TGCTCAGCGA AATGCCGATG GGAAAATCAC GGATTTTAGT CTGGTTATGC TCAACTCCTC TACGCAGGAG ATGTGGGGGT ATCCGCAGCG GGAACTGACC GGGCGTTCGA TCAGTCAACT ATTTACGGTG GCCGACCGGC AGTTCTTACT TCAGCAATTC AGCGTGGTGG TAGAGCGTGG GCGGGCCGTT CGGTTCGAGA TGGAATACAG TCGGCCGGGT AATCAGTCGG TGCAGGTGTA CGATATGCTG GTCACTAAGC TGGAAGATGG CGTGCTGGTT AATTACGGCG AAATCGTACC CCCTGCCCAA ACAGAACGGG TGCCGCAGCA ACAGACCGAC TTGCTCCGGA AGCTCTTTCA CGAATCGCCC TGTAGTATGC AGTTGCTGGA GCCCGTTCGC GATGCCGAGG GTAAAATCGA GGATTTTATC TGCCGGATGG CCAATCAGGC CAGCCTGAAT CAGATGAGGG TGGTTGAAAG TCAGGTAGTT GGCAAGCGGA TGCTGCCCCT TTTTCCCAGC CTTCTGTCGT CGGGCGTGTT CGACCGTTTG GTGACCGTAG TCGAAACGGG CGAGACTCAG CAGCAGGAGC TTTATTACAA AGGTGATGAT GACCAGATCT GGATCGATGC CAAATACATC CGACATAACG ATGGGGTGAT CAGCATGAGT TTTGACCTGA CAGCTACCCG ACTGGCCGAG CAGAAGTATC AGCAGCAGGC CGAGTTGTAC AAGACCATTC TGGATACGGC GTTAACGTCC ATCACGGTTC TGGAAGCCGT GCGGGATGCC GACGACAAAA TCATCGACCT GCGCTATACC CTGGTCAATC AGGAGCGATT GCGGTTGGCA GGCAAGCCCG AATCTTTTTT TCTGGGTAAA CGCCTGACGG AGGTCTACCC GGGAATGGTT GAATCGGGCG TGTTTCACCG CTGGGTAGAG GTTATCGAAA CCCGGCAGTC ACAGAAGTTC GAAGTAAACT ACCACTATGA TGGTTTCGAC GACTGGAGCC TTTGCCTGGG GTCGCCCTTT GGCGATGGCA TTGTCGTTTC GTATACCGAT ATCACGAAGC AAAAAGAGGA TGAGTTGCAG GCCCGCCAAC AGGCCGAGTT ACTGAACAGC ATTCAGAATA CGTCGCAGAT AGGTATTTCG GCCTGCAAAT CCATTCGCGA TTCGGCGGGT ACAATTGTTG ACTTTCAGCC TATATTCCGG AATGCGACGG CCTCCCAGTT GCATCGTCAT GCACCGAACG AGCCAATTAA AGCTACCCTG CTTGAAGATA TGCCCAGCCT GAAGCCGTCA GGTGTGTTTG ATCGCTATGT ACGGGTGGTC GAAAGTGGCC AGCCCGATCA ATTTGAACAG CACTTTAGTA ATGGCGGTCT TGATGGCTGG TTTGAGTTCT CGGTGCAGCC CTGGGAGGAT GGATTCGTTC TGAATATCCT CAATACAACC AGCTTGCGTC GGGCAGAACG GGAGAAAGTA CAGCAGAGCA TCATACTTCA GCAGGTTATC GACAACTCCC AGGCCGGACT GGTGCTGGCC CAGCCTGAGC GCGACGAATC GGGTACGATC GTCGACTTCC GGTATGTGCT GACAAATGAG TACAACGCCC GCACAACGGG CTGGACGGTG GCCGAGATGG CCGATGCGCT GGTGAGCAGC CTGTTTCCCG GATGGCAGGA TTCGGACTTG TTCCGTCGGT ACGTCGAGGT TGTTGAGAGT GGGCAGGCAC AGCGACTGAC CTTCCCGTAC GAAGCTTATC AAATGAATGG CTGGTTCGAT GGTTCGTTTA ATTGCGTAGA CGGCTATCTG CTCTATACGT ATACCGATGT GACCGCGCTT AAAGAAGCCG AACTGGCTCA GCAGCAGTAC GCCAGCTTAC TTGAGCAGGT CATGAATATG ACACCTGCCG CCATTGTACT GAATGAGAGT ATCCGGGATG AGACCGGCCA AATTGTTGAT CTGCGCATGG TCAGGCTGAA TCATATGGCT ACCAAACTGA TGAAAAACCC AATCGATAAG GTTCAGTTTC GGCGCGTCTC CAAGTACATT CCCGGCTCGC TGGATACGCC CCTGTTCGCG CAATGCAAGC AGGTGATCGA AACGGGTAAC CCTGCCCGTC TGGAAGTTCC CTGGGACGAC CGTTGGTATG ATTTCTCCCT GGCCCGTTTT GGCGATGGTG TTTTGCTTAC CGTACAGGAT ATCACCCCTA TGCGCGAATA CCGGCAGAAA CTGGAACTGG CAAACCTCGA ACTCAAACGG TCCAACGAAA ATCTACAGTC GTTCGCCTTC GTTTCCTCGC ATGATTTGCA GGAGCCCCTT CGCAAAATAA TATCGTTTGC AGACATTCTG AGTACCCAGT ATGCCGGGCA GTTCGATGCA CCGGCTACCG ATATTGTGAA GCGAATCAAC ACGTCGGCCA ACCGGATGCG TCTGCTTATT CAGGATCTGC TGGCCTACTC GCAGGTCGAC ACACGGGAGG ACTCGTTCGA GCCGGTCAAC CTAACCCGGT TGATTGGAGA ACTACAGGAA CATGAACTCT GGATGACCAT CCAGCAGAGT AACGCCGAAA TCCACCTTGG CGAGTTGCCA ATTCTCATGG CCGACCGGCT GCAAATGCGA CAGCTGTTCC AGAACTTGTT ATCCAACGCT ATCAAATTTT GCCCGAAGGG TGTAACGCCC GACATCACCG TGAGCAGCCG ACTGGTTAAG CACTCCGACG TATCGGTGAA GTTGATGTCG GCCAAACTGG AAAAGAACCG GGCGATGGAA ACATGGTTCG CTGAAATTTC GGTAAGCGAC AATGGGATTG GCTTCGATGA GAAGTATCTG GATCGCATCT TCCAGGTTTT CCAGCGGCTG CACGGTCGAA GTCAGTACAG CGGGTCAGGT ATTGGACTGG CCATTTGCTA CAAGATCGCT GAACGGCACA ACGGAGCCAT TACCGCCAGC AGCCAGCCCG GCCAGGGCAG TACCTTCCGG GTCTACCTGC CTATCCGGAA GGAAAGGTGA
|
Protein sequence | MIPVSAFANA LPSPDLSSHL SSAVLYHAQR NADGKITDFS LVMLNSSTQE MWGYPQRELT GRSISQLFTV ADRQFLLQQF SVVVERGRAV RFEMEYSRPG NQSVQVYDML VTKLEDGVLV NYGEIVPPAQ TERVPQQQTD LLRKLFHESP CSMQLLEPVR DAEGKIEDFI CRMANQASLN QMRVVESQVV GKRMLPLFPS LLSSGVFDRL VTVVETGETQ QQELYYKGDD DQIWIDAKYI RHNDGVISMS FDLTATRLAE QKYQQQAELY KTILDTALTS ITVLEAVRDA DDKIIDLRYT LVNQERLRLA GKPESFFLGK RLTEVYPGMV ESGVFHRWVE VIETRQSQKF EVNYHYDGFD DWSLCLGSPF GDGIVVSYTD ITKQKEDELQ ARQQAELLNS IQNTSQIGIS ACKSIRDSAG TIVDFQPIFR NATASQLHRH APNEPIKATL LEDMPSLKPS GVFDRYVRVV ESGQPDQFEQ HFSNGGLDGW FEFSVQPWED GFVLNILNTT SLRRAEREKV QQSIILQQVI DNSQAGLVLA QPERDESGTI VDFRYVLTNE YNARTTGWTV AEMADALVSS LFPGWQDSDL FRRYVEVVES GQAQRLTFPY EAYQMNGWFD GSFNCVDGYL LYTYTDVTAL KEAELAQQQY ASLLEQVMNM TPAAIVLNES IRDETGQIVD LRMVRLNHMA TKLMKNPIDK VQFRRVSKYI PGSLDTPLFA QCKQVIETGN PARLEVPWDD RWYDFSLARF GDGVLLTVQD ITPMREYRQK LELANLELKR SNENLQSFAF VSSHDLQEPL RKIISFADIL STQYAGQFDA PATDIVKRIN TSANRMRLLI QDLLAYSQVD TREDSFEPVN LTRLIGELQE HELWMTIQQS NAEIHLGELP ILMADRLQMR QLFQNLLSNA IKFCPKGVTP DITVSSRLVK HSDVSVKLMS AKLEKNRAME TWFAEISVSD NGIGFDEKYL DRIFQVFQRL HGRSQYSGSG IGLAICYKIA ERHNGAITAS SQPGQGSTFR VYLPIRKER
|
| |