Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2123 |
Symbol | |
ID | 8725861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 2573359 |
End bp | 2576580 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | YD repeat protein |
Protein accession | YP_003386956 |
Protein GI | 284037026 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAC TCTACCAGTG GCTGGCCATA GGCTGTCTGC TCACTACTTT CTTGCCTAGT CTGGCACAAA CGCCCGGCAC TAACTACACA ATCAGCCGAA CCTATAAACA GGCTAACATT AGCGAAAACT TGACTGGATC GGGCTTTACA GCAACAGCTC AACAGGCCAC TAGTCAAGTC TCCTACTTCG ATGGATTGGG GAAGCCTATT CAGCAGGTGA TGGCTTTCGG GGCGGGGTCC AAAGCTGACA TCGTTACGTT AATTGAATAT GATGCTCTAC AGCGACCCGT TCAGACATTT TTACCCACTC CAATAAACGC AAATGGTGGA CTGCTTCAAG CGAATGGTGA TCTCAAAATA AAAGCAAAGA CATATTATAC GGACGTCGCA AAAGTTTCAG CTCCTATTAA TACTACTGCT CCTATAGAGG TCTTAGCAAC TCAAACTTTC TACGAAAACA CACCGCTTAA CCGGGTCACG AGCCAGAAAG CCCCGGGAGC CACTGGCAAT TCCACTCAAT TCCATGGGGT GAACGCCATA AATGATGTTA AATACTTTCG AAGCAGTGCT GCAGGGCTAG CAAGTATTCA ATTGGTTGGT ACTTATGAGG CAGGTGAGTT GACTTATGTG CGTACCACTG ATGAAGCCGG TGGTGCCGTG ACTCAGTATC TGGACCGTCT CAATCGGATA GTCCTTAAGC GCGTACTCAC TAGCAAAAAT GGCCTGGATG TTAATTTAGA CACATACTAT GTTTATGATG AGAAAAGTCA ATTGCGGGCG GTCCTTCAAC CAAATTATCA GAACGAAACA GACTTAAACC GGAATGCCTT TCTCTATCGG TATGATGAAT ATGGACGTTT AGTGGAAAAG AAGTTGCCTG GCAGCAACGC GTCGCAAATG GAGTATTACA TTACTACAGA TCTGCCGAAA AGTAGTACAG ATGGACGTGG CCAGAAGTTT TATTACCTCT ACGATAATCT TAATCGCCAG ACCGAGATGG GTCTGTGCAA AAATGGCAAC TGTGATACCC CAGAGCCCCT GTTGAAGACC TATTACGATA ATTATGGGTT TACCCCTTTC CGAAATTACG AAGCTGAGCC AGGCTTAACC GGAGTTGCCT TTGCGAATAC TCCTATGGTC AACCGTACTA ACTTGAAGAC CGGTCAGGCT GCCAGGGTGC TGCTACCTAA TGGCGATTAT GGACAGTGGT TGCAAACGGT GATCTATTAC GATGACAAGC AAAGAGTAAT TCAGACGTTG CGTCAGTTGT ACGGATTCAG CAGTAATGCA TTTGAGCGCG TGAGCCTGCA GTTAGCATTC GATGGCAAGC CCGAGCAGGA GTGGATTACT CAGGAGACTG GCAGTGTGAG CTACAAGCTA ACCAAGACTT TCACGTACGA CCATGCCAAC CGGCTGAGTA AAATTAACCA TATACTCTAT GAAGGAGGTG TTCAGAAAAA ATCATACACC CACATGGAGC AACTTTACAA TGAAGTAGGT CAACTGGCGA CCAAATCCCT GCATACAGGT GTGCAAATTC TAGGCTATAA GTACACTCCA CGGGGCTGGC TAGGCAATAA TCAAACAAGT ACAGGTCAGC CTTTCACGTT AGGTCTAAGT TACAAAGCTA ATGGTAACAT TGATAGCCTG TCGTGGATAA CCAAAAGTTA CAGTGGAGGA ATGGGGTTAA CCTACGACAA GTCGAGCAGA TTAATTGGAG CAGTAGGTAG TGGTAATTTT GGAGGCTATA ATGAGTCACC AATCAATTAC GATAGCAATG GTAACTTAGA AAGCCTGACT CGTAAGTACA ACAATACAGT CATTGACCAG TTGAGCTACC AGTATCACGG CAACCAACTT CATAGGGTAA ACGACGACGC GCAAGATAAT CAGAGCCAAG CGGTTAAAGG ATTTATTAAC GGAACTAACA TCGATGATGA GCTAATCTAC GACGGCAACG GAAATTTGGT AAGGGATTTC AATCGGGGCG TTGGAAGTGC CACTACAGAT GGCATTTATT ACAACGTACA GAACCTGCCT CGCACCGTGA TCCGTAATGG GCGTACGGTA CTTTATACGT ACGATGCTAG CGGAATAAAA CTTAAAAGTG AAGCGCCAGA TAATGTCAAT ACGTACTACG CAGGGATGTT CGAGTACAAA GCAGAAAACA GCTTACTTCG TATCGGTTTA GAGGAAGGGC AACTCGTAAA GAAAGATACT AATTACTTAG CGCATTACTA TTTAAGGGAT CATCTAGGAA ATGTCCGCTC AGTACTGGAT GAGGTCGGCA CTGTAATACA GGAGACAGAA TACTATGCTT TTGGTTTACC AATTCAGCGT AGTGGAAGTG ATAAAAATAA ATATCTTTAC AACGGCAAGG AAAAACAGCC TGAGACTGAG TGGCTAGATT ATGGAGCCCG TATGTATGAT CCGAGTATTG GACGATGGAT GGTGATTGAT CCGTTGACGG AGATTTTTCC AAGTACTTCA TATTATAGTT ATGCGGTAAA CAATCCTACT TTATTTACAG ATAAGTATGG TCTTTATGCC GAGTCTTCAG AAAACATAGC AGTTTGTCCA ACTTGCCCAA GTGGAGAAGA ATATAGCAAA TACAGGGACA GTAAATCACT TTATACCTAT GACAAAGGAA CAGGTGTTAT CCTAAATGGG GATGGAAAAG GTGCAACAGT AACTGCAAAA AGAATACAAC CTACAGAAGC TCCTACTTTT GGTTGGCCTT GGCAAGCAGA TTTGCCTATT GGCCTATCAG AATTGCAATT AGGTAATAAA ATTGAACAAG TAGCAAAATG GGACGGTACT TTCCGTTATC CAAATTCTGT ACTAAGTCAA AAAGAAGTAG ATGCCAAAGC TATTTTTAGA CAACCACTTA TTCAAAAACG ACCTATAAAT ATTCTTAGAA ATGTTGCCTT ACCAAGGGAT CTGGCACTCA AAGTAGCAAA AGGGTTGAAG GTAGCTGGAG GAATAACGAT GGCGATGGGA GTTGTAGACA ATATTAGCAA AGGCTATGCA GGTAATATTA CGTGGGAACA TTCAGCGACT TCAATAGGTA TAGGTGCGTT TGGGTTAGTC GCAGGTACAT TTGGAGCACC TGTTGTTCTT GGCGCAATTG CGGTAGGAGT TGTTTATTCT GTTTACGAAG ATGACCTCTG GCATGATTAT GACAAAACCA ATGCAACTAA TTTTGTGGAA AAGAAAGAAT GA
|
Protein sequence | MNQLYQWLAI GCLLTTFLPS LAQTPGTNYT ISRTYKQANI SENLTGSGFT ATAQQATSQV SYFDGLGKPI QQVMAFGAGS KADIVTLIEY DALQRPVQTF LPTPINANGG LLQANGDLKI KAKTYYTDVA KVSAPINTTA PIEVLATQTF YENTPLNRVT SQKAPGATGN STQFHGVNAI NDVKYFRSSA AGLASIQLVG TYEAGELTYV RTTDEAGGAV TQYLDRLNRI VLKRVLTSKN GLDVNLDTYY VYDEKSQLRA VLQPNYQNET DLNRNAFLYR YDEYGRLVEK KLPGSNASQM EYYITTDLPK SSTDGRGQKF YYLYDNLNRQ TEMGLCKNGN CDTPEPLLKT YYDNYGFTPF RNYEAEPGLT GVAFANTPMV NRTNLKTGQA ARVLLPNGDY GQWLQTVIYY DDKQRVIQTL RQLYGFSSNA FERVSLQLAF DGKPEQEWIT QETGSVSYKL TKTFTYDHAN RLSKINHILY EGGVQKKSYT HMEQLYNEVG QLATKSLHTG VQILGYKYTP RGWLGNNQTS TGQPFTLGLS YKANGNIDSL SWITKSYSGG MGLTYDKSSR LIGAVGSGNF GGYNESPINY DSNGNLESLT RKYNNTVIDQ LSYQYHGNQL HRVNDDAQDN QSQAVKGFIN GTNIDDELIY DGNGNLVRDF NRGVGSATTD GIYYNVQNLP RTVIRNGRTV LYTYDASGIK LKSEAPDNVN TYYAGMFEYK AENSLLRIGL EEGQLVKKDT NYLAHYYLRD HLGNVRSVLD EVGTVIQETE YYAFGLPIQR SGSDKNKYLY NGKEKQPETE WLDYGARMYD PSIGRWMVID PLTEIFPSTS YYSYAVNNPT LFTDKYGLYA ESSENIAVCP TCPSGEEYSK YRDSKSLYTY DKGTGVILNG DGKGATVTAK RIQPTEAPTF GWPWQADLPI GLSELQLGNK IEQVAKWDGT FRYPNSVLSQ KEVDAKAIFR QPLIQKRPIN ILRNVALPRD LALKVAKGLK VAGGITMAMG VVDNISKGYA GNITWEHSAT SIGIGAFGLV AGTFGAPVVL GAIAVGVVYS VYEDDLWHDY DKTNATNFVE KKE
|
| |