Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1658 |
Symbol | |
ID | 8725393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1990631 |
End bp | 1993750 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003386504 |
Protein GI | 284036574 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.450453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTTC TTCAACAAAC GGCCCAACGC ATTTTTGACA CCCATGGTCC CAGCTTAAAC GATGTTTGGG TAATCTTGCC CACCCGCCGG GCTGTGTCTA CCTTCCTGGA TGAGCTGGCC GCCCTTTCTG ACAGGCCCTT TCTGGCCCCC CACGCACTTG CGGTCGATGA TTTTATCACC CAGGCAGCGG GCGTTCAACT GATTGATTCG GTTAGTTTAC TTTTCGAGTT ATACGACGTT TTTAAAGAAA TTGACCCGCT GGTCGAGTTT GAGCAGTTTA TCGGCTGGGC ATCCATACTG CTCTCCGATT TCGACCGTAT TGATCAATAC CTCGTCAATC CCCACGAGTT ATTCAGCTAC CTGACAGCCG CCAAAGCACT CGAACGCTGG CAGGTCGACA TGCCTTCGTC CGCCAAACCG ATTGTCGAGA CGCCCGGCAC GACCCGATAC TTCAAACTGT TCGAGAATAT ACATACAGCC TACCACGCCC TTCACCAGCG CCTGAACGAG CAGCGACTGG CCTATCGGGG AATGGCCTAC CGGCTACTGG CGCAGCAGGT TGAACCCTTG ATTCGGGATA ATCTGGCCTA TGAGCGGGTG TATTTTGTGG GATTCAATGC CCTGAGTAAA GCAGAAGAGC ACATTATCCG GGTGCTGGTC GATGCCAAAA AAGCTGAACT GATCTGGGAT GCCGACCTGT ATTACATGAA TGACCGGCGG CAGGAAGCCG GCGAATTTCT GCGACGATAT AAGGACAATG GCTGGCTTTT TTCGAGACAA AACCATGCCG ATCTGGCGCA ACTGTCTAAT AACCTGCTAG GTTCTGAAAA AAATATCCGC GTCGTTGGCG TACCCAATGC CAGCATGCAA ACCAAGGTAG CGGGTAAGAT TTACAGCGAA TGGCAACGGG CCGATAGTGC AGTGCCACGC GATGCTGCAA CGCCCTCGCC AAAGACCGCT ATCGTACTGG CTGACGAAAC ACTTCTGGTG CCCGTGTTAT ACGCGCTGGA TGAAAACGTG ACCGACCTGA ACGTTACTAT GGGTTTGTCG TTGCGTTCGT CGCTGTTGTT TACCCTGGTC GACACGTTGT TTGAGATGCA GCGGACGGTC CATGAGTTCC GTACCAAAGA CGGGCGCGAC CTCAAGATCC CCAAGTTTCA CCATCGCCAC GTCGTAAAAC TCCTTAACCA CCCCTTCCTG AAGCAGTACG AGCGTATACG GGGGCTGATG TGGCCGGGCG ATGTACTGTC GACAGGGGAA ATTTTACCGC CCGAACCGCT CTTCCAGTGG ATCGCCAAGG AGATTGTGAA AAATCAGCGG GTATACCTGA CCGAGCGGGA CATGCTGGAA CTGGGGCAGG ATGACCCACT CGTGCGGGTG CTGTTCAGGC GTTGGCCCAA TGAAGAGCCC ATGAAGGCCA TTCGTACGTT TTATGACCTG ATCGAGTTAC TGCGCGACGT GTATCGTACC AGTCAGGATG CCATCGAGAT CGAGTACCTA TACCTTTTCT TTACCCTGCT CAAACAGCTG GAAGCAACGC TGGACAGGCA GGGCGAAGGA GCACAAGGGG CCGGGCGCGG AGTACCGGTG CCCAAGGCGC GAAATAGAGG GCAGGGGGGA ACCGCCATCC TGGATACGGG CGCCCCGGTT CCCATGCTAC ACGTCCACGA ACCGTCGGCC GCCGTTACCG TGCGCAGCCT GAAACAGTTT TTGTACGAAC TGATTCGGCA AACGAGTATT CCCTTTACCA GCGAAGGGAA GAGCCAGTTG CAGATCATGG GTATGCTCGA AACACGGGCT CTGGATTTTG ACCGGGTTAT TATTCTGTCA GTGAACGAAG GGATTCTGCC GCAGTCCAGA AAGTTAAACT CGCTTATTCC GTTCGATATA GCCGCCGATG AAAACATAAA GCTGCCTACT TATAGCGAAC AGGAGGCCGT GATGGCGTAC CACTTTTACC GGTTGCTGCA ACGGGCCAGC GAAGTGGTGC TCTTGTACAC AACCTCGACA GATGCGTACG GGAATAGCAA AGGTGAGCCA AGCCGCTTTA TCCGCCAGCT GGAACACGAG CTGGTGCCCC GCTCTAACGG ACTCGTTCGG ATAAGCTACC CAACGGTTCG TTTCGGTCGG ACAAGCGAGA AAAAGGAAAC CAGTCTGACC GAGCTGAGTG TGCCCAAAAC GGAATCGGTG CGGGACGGTC TGATCAATCT GCTCATAACG AAAGGGTTGT ATCCATCTTA CCTGAATCAG TTCGTGAGCT GTTCCATGCG GTTTTACTTC AGCCGGATTG TAAATATTAG TGAGGAAGAA GACATCGAAG AGAAAATGGG AGCGGCTGAG TTCGGAAGCT GGCTGCACAA AGTGATGGAG CGGCTAGACC TTGAGTACCG CCTGAAGGCG CTGCCCATCG ACGAGTCGAT CATTAAAATG CTGCTCGAAG AAGAGTTTGC CAGTACCAAC AAAGGCCGGG TTATCGAGTC GGGCATGAAC CTGCTGCTCT ACGACCTGGC ACAGAAACTC ATGCTCGACT TTCAGCGTCA GCAGAATGCG CTTCCTGGTT TGACCGTTAT CGGAACGGAG CAAACCCTCG AAACGTATTT GACTGTATCC ATTGAAGGGC GTGGGGCCGT TCGGGTGCGG ATAGCGGGTA AAGTAGACCG TATCGAACGT CTGGGTGATC AAATTCGAAT TGTCGATTAT AAAACGGGCA AAGTCGACCT GTCCGAAAAA ACGCCCAAAG ACCTGAGTGA TCGATTACTG AACGATGGGG GCGACGATGC GGGTAAGATG CGGCAGTTGT GGCTGTACCG GTATCTGGCC CTTAAAAACA TTAGCGAGTA TGGTGGTTTG CCCCGCGACC GGGCTAAACG GGATATTTTT AATGCGGAGG GTATGCCTGT CGAAGCTGGC TTTTATTCGT TCCGGGATGT GAATGGGGGC TTTAAAACAA ACCCTGTTCG CTTCGGAGAC AATGATAGCC CTGGTCAGTA CATCGAAGAT TCGGAGGATT TACTTCGCCA ATTGATACAA CAACTGCTCG ACCCTGAACA ACCGTTCAGG AAAACGGACC AGATTGAGAC CTGCCAGTTT TGTGATTATA AGGGTATTTG CGGGCGATAA
|
Protein sequence | MTFLQQTAQR IFDTHGPSLN DVWVILPTRR AVSTFLDELA ALSDRPFLAP HALAVDDFIT QAAGVQLIDS VSLLFELYDV FKEIDPLVEF EQFIGWASIL LSDFDRIDQY LVNPHELFSY LTAAKALERW QVDMPSSAKP IVETPGTTRY FKLFENIHTA YHALHQRLNE QRLAYRGMAY RLLAQQVEPL IRDNLAYERV YFVGFNALSK AEEHIIRVLV DAKKAELIWD ADLYYMNDRR QEAGEFLRRY KDNGWLFSRQ NHADLAQLSN NLLGSEKNIR VVGVPNASMQ TKVAGKIYSE WQRADSAVPR DAATPSPKTA IVLADETLLV PVLYALDENV TDLNVTMGLS LRSSLLFTLV DTLFEMQRTV HEFRTKDGRD LKIPKFHHRH VVKLLNHPFL KQYERIRGLM WPGDVLSTGE ILPPEPLFQW IAKEIVKNQR VYLTERDMLE LGQDDPLVRV LFRRWPNEEP MKAIRTFYDL IELLRDVYRT SQDAIEIEYL YLFFTLLKQL EATLDRQGEG AQGAGRGVPV PKARNRGQGG TAILDTGAPV PMLHVHEPSA AVTVRSLKQF LYELIRQTSI PFTSEGKSQL QIMGMLETRA LDFDRVIILS VNEGILPQSR KLNSLIPFDI AADENIKLPT YSEQEAVMAY HFYRLLQRAS EVVLLYTTST DAYGNSKGEP SRFIRQLEHE LVPRSNGLVR ISYPTVRFGR TSEKKETSLT ELSVPKTESV RDGLINLLIT KGLYPSYLNQ FVSCSMRFYF SRIVNISEEE DIEEKMGAAE FGSWLHKVME RLDLEYRLKA LPIDESIIKM LLEEEFASTN KGRVIESGMN LLLYDLAQKL MLDFQRQQNA LPGLTVIGTE QTLETYLTVS IEGRGAVRVR IAGKVDRIER LGDQIRIVDY KTGKVDLSEK TPKDLSDRLL NDGGDDAGKM RQLWLYRYLA LKNISEYGGL PRDRAKRDIF NAEGMPVEAG FYSFRDVNGG FKTNPVRFGD NDSPGQYIED SEDLLRQLIQ QLLDPEQPFR KTDQIETCQF CDYKGICGR
|
| |