Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1764 |
Symbol | |
ID | 8725501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 2124493 |
End bp | 2127732 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003386608 |
Protein GI | 284036678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00516909 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGATG GAACGATTCT GCGCTCCGAT TTTTACTCGG AATACCCGGT GTTGACCACC GTTCGTCCCA GCCGGACCAC TATATATACC GTTCAGTCCT TCCAAACGGG CTGCGGGCCT GTTCCGGTCA AAACAGCGAG CAGCGTTGTT ATAACCGTCA GTTCGGGCAT CACCATCGAC TCGGTCAGCC AGGGACCCAT TTGCGAAGGG CAAACCCTGC GGTTCAAGTT TTTACACAAC CTCTCGCTGG GTGCCAACAA CCAATTTATG GTTCGTTTCC GGCACAGTTC CGGTGTAGTG TCCGATGCCA TTGCTGCTCA GCAACAGGGC AACTACCTAA CGGTAACAGT TCCGTCGTTC ACATTACCCC AAAACCAGAC TTTCCGTAAT CAGTCCTTTA CTATGGAGGT AAGCAGTACA CAACCGGCTT ACACCTCTGA ATTGAAAGGG GGTCTGAGCA TCCTGACCTA CCCTACTATG CGCTGGTCGG ATAATAATGT CTACACGATC GACAAGCCAC AGCAACAGGT AAACTGGTAC TGGTTTGGCG ACGGAGGTGG TCCCTATCAG TTGGAGATGG AAACCGGCCA AACCGCTGGC ACAACTATCT GGGATAATGT ATCATCGGTT TCGGTACCGG GTAATATTTC CCAGAACTTC CGGGTGAAGT CGGTCCGGAA TGCCTGTTTC GTCACGAACA ATCCTTCACA GGTATCGCTG ACGGTCCGGA ATACAGGAGG GTACTTTATT TATGTCAAAC CCTACAAAGG CGTTGCCTGT CAGGGCGATA GTATAGAGCT TGGCTTTGAA ACAACCGGCG AGTTTGCACC CGGCAATCAA TTCCGTATTC AGGCGCGGGG CGGCAGTTCG TGCTGCAGCT ATCCGGATAC CTGGGCTACA ACCACCAAAT CGGGTACCAT TAAATTCAAA CTACCGACCG ATTTCGGCTG GTACAGCACG GGTGGCAGTG AAATGGCTTT CCGTATTGCC TCCACCAATC CGGTTGTGTT CAGTGAAGAC CGGTATTTGA GCATTCACCG TCCGGTTTAT AGTATAAACA TCAGCGGGCT GGCGGAAGAA CTGCTGCAGC CCGGCACCGT TACCCGAACA ATTAGTTACT ATGGCGGCAC GCCGGTAACG ATCAACTATA CCCTCGGTGG AGCCAACTAT AATCTCGTCT CATCGGGTTG GTACAGTACA GATATCAGTT ACCCGGTTAG CGGAACGACT ACGTTTACGG TAAACTCAAT TGCCAATGCC TGCGGTCCGG TACCAGTCAA CCAGTCGACT ACCCACCGGG TTCTCCCCTA CATCTTAAAA ACTCCTCCCA TTGCCGGTCA ATCGTACGAG CCCGTTTCCT TTTGCGCCGG CAGTACGCTC ACCCTGCCGT ATCTGATGGT TGGGCAACCC GACCCGGCCA TCACTGTTTC GGTACAGTAC CGGCCAGCCA GCACCACCGA ATTTCGTACC CTGGCAACGT CCATCCGGAC AAATCCGGTG GTCGTCACGC TCCCCGATAC GCTTCAGGCG GGCGATTATG TGATTCGGCT GGTATCCAAC CTCGCCATTG CATCGGCAAA TCAGACGATT CGGGTGCGCC GGAAAGCAAC AGCGCTGCTA ACAACAGAAA GTGGCACTTC TTCGCTCGAT ATGTATCCCG GCAGCTCGAC CGCTTTTCGG GTGAACTTTA CGGGTTCACC CGACTGGACG GTATTGCTTA CCAGCGGGCT GCGGCAAGTG TTCTCATCCA GTCCGGGCAC ATTGTACGTG AACCCGAAAT CGAAGACGGT CTATGCCATT CAGGCAGTTA CAAATACCTG CGGATACGGA ACTACTGCGG GCGAGATTTC CGTACGCATT AAACCAACCT TATCACTCAG TACGAACAGC AATTCTTTCT GCACGGGTGC TAAAATACCC GTCACCTACA GTGCTCAGGG CGACTTTGAA CCCGGTAACC GAATCCGGAT CGGCTTAGTG GACGGAGCCA CAGTGCGCTG GCTCGATTCC ACCGCCACTA CCCAGGGGAC TTTTCAGGTT AGCTTACCAG GCAGTTTAAC GGCGGGAGGC TCCTTCACAT TAAAACTGGC TTCGACCAAT CCGGTGCAGG AGACCCAGAT GAGTTTTCTG CTGGCATCAC CCCCCGTCGT TAAACTGGGG GGCAATGCGA TTATCAACCC CCAGCAAAGT GCCATCATTC GCTTAACATC AAATCAGGTG GCAAGCGGGT ACGGCATCCC CATCCGATAT GCATTAGTTA CGGGGGAATC GGGGGAATTT TACCCCGGGT CGTCCGGCTT TGACCTAACC GTCAGACCAA CCCAAACTAC TACCTATCGA CTGGCATCTG TATCGAACTT TTGCGGAACA GGTCAGTTTT CGGGGGCAGC GACGATAACC GTCAACCCAC CCACCGATCG CCAGATAACG ACTCTGGAGG TCAACGGCTT CTCAACAATC TGTTCGAATG ACACCGTACG GGTAACGTTC GACACAAAGG GGACCTTCTC AGCGACGAAT CGCTTTACGG TACAACTGTC CGATTCGACC GGCACGCAGT TCAGCGACCT GACCACGTTT GGCACATCAA GCCCATTAAA AGCGCTGGTT CCTGCCAATA TGCCCCGTGG CTCGTTTTAT CGGGTTCGGG TTGTTGCGTC GGACGCTGGT GTAAGCAGTT CAACCAACAT CGCCCCTCTC CTGCTGCGGT TTGCGGCAAC AGCCGCATTT GAATCAGCGA CAATTGGTTT CACACCAGGA AAACCCGTGA AGCTGAAAAT AAATCTAACG GGCGACGCCC CCTGGACAAT CCGGATCGGG AATGAGTTCA ATCCAGTTGG CACGTTATAC GCGAGCAGTA CACCTTACTC GATTGATTTA TCGCCAACGG CGGCCTCAAC TATTTACAAA TTGTATCAGG TAACCAACGG ATGCGGGTAT GGGAAAATAG TTGAACCTTC TGTGGTTCAG ATTAGTGTGC TGACAGCCAC TGACCCGGCT TTGGAAAAAC AGTTTGTGGT GTACCCGAAT CCGACCAACG GCTGGGTAAC TATTCGTCAG GACGGGGCAA CAACCCCCTA TCGTGTTCGG GTAACGGACC CGAAAGGAAA TGTATTTTAT CAGAAAAGTA CGGCAAAAGA GATTGACGGA GAAGACCTGT CCTTACTCCC AACCGGCGTA TATTTGCTGA CTATTGACAC AGACAAATCA AGTCTGGTTT TTCGGATTCT GAAAAACTAA
|
Protein sequence | MSDGTILRSD FYSEYPVLTT VRPSRTTIYT VQSFQTGCGP VPVKTASSVV ITVSSGITID SVSQGPICEG QTLRFKFLHN LSLGANNQFM VRFRHSSGVV SDAIAAQQQG NYLTVTVPSF TLPQNQTFRN QSFTMEVSST QPAYTSELKG GLSILTYPTM RWSDNNVYTI DKPQQQVNWY WFGDGGGPYQ LEMETGQTAG TTIWDNVSSV SVPGNISQNF RVKSVRNACF VTNNPSQVSL TVRNTGGYFI YVKPYKGVAC QGDSIELGFE TTGEFAPGNQ FRIQARGGSS CCSYPDTWAT TTKSGTIKFK LPTDFGWYST GGSEMAFRIA STNPVVFSED RYLSIHRPVY SINISGLAEE LLQPGTVTRT ISYYGGTPVT INYTLGGANY NLVSSGWYST DISYPVSGTT TFTVNSIANA CGPVPVNQST THRVLPYILK TPPIAGQSYE PVSFCAGSTL TLPYLMVGQP DPAITVSVQY RPASTTEFRT LATSIRTNPV VVTLPDTLQA GDYVIRLVSN LAIASANQTI RVRRKATALL TTESGTSSLD MYPGSSTAFR VNFTGSPDWT VLLTSGLRQV FSSSPGTLYV NPKSKTVYAI QAVTNTCGYG TTAGEISVRI KPTLSLSTNS NSFCTGAKIP VTYSAQGDFE PGNRIRIGLV DGATVRWLDS TATTQGTFQV SLPGSLTAGG SFTLKLASTN PVQETQMSFL LASPPVVKLG GNAIINPQQS AIIRLTSNQV ASGYGIPIRY ALVTGESGEF YPGSSGFDLT VRPTQTTTYR LASVSNFCGT GQFSGAATIT VNPPTDRQIT TLEVNGFSTI CSNDTVRVTF DTKGTFSATN RFTVQLSDST GTQFSDLTTF GTSSPLKALV PANMPRGSFY RVRVVASDAG VSSSTNIAPL LLRFAATAAF ESATIGFTPG KPVKLKINLT GDAPWTIRIG NEFNPVGTLY ASSTPYSIDL SPTAASTIYK LYQVTNGCGY GKIVEPSVVQ ISVLTATDPA LEKQFVVYPN PTNGWVTIRQ DGATTPYRVR VTDPKGNVFY QKSTAKEIDG EDLSLLPTGV YLLTIDTDKS SLVFRILKN
|
| |