Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0543 |
Symbol | |
ID | 8724271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 664815 |
End bp | 667889 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | DNA polymerase I |
Protein accession | YP_003385406 |
Protein GI | 284035476 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.72991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.283494 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAC CAACCAAAAA ACTGTTTTTA TTAGACGCGC TGGCACTTAT TTACCGCGCT CACTTCGCCT TTAGCAAATC GCCCCGTATC TCATCGAGGG GTATCAATAC GTCGGCGGTG TTCGGGTTTA TGAACGCCAT GATCGAGGTG TTAACGAAAG AGAAACCGAC GCACATCGGT GTCGCTTTCG ACTCGGCGAA AAAGACATTC CGGCACGAGT CCTTTCCGAT GTACAAGGCC AATCGGCAAT CGCAGCCCGA AGATATCAGC GTGGCTATGC CATACATCAA GCAGATTGTG GAAGCGATGC ATATCCCCAT GCTGATTCTG GACGGATACG AAGCCGACGA TATCATTGGC ACTATCGCAA AAAAAGCCGC TCTGGCTGAT TTTGAGGTGT ATATGATGAC GCCTGACAAA GATTACGGGC AGTTGGTGGA AGAACACATT CATATTTACA AGCCCGCCTT CATGGGCAAA CCCGCCGAAA AGCTGGGCGT AACGGAAGTG CTCGAACGCT GGCAGATCGA GCGTATCGAG CAGGTAACGG ATATGCTCGG GCTGATGGGC GATTCGGTCG ATAACATCCC CGGTATTCCG GGCATCGGCG AAAAAACGGC GCAGAAGCTC ATCGCTGACT TCGGTTCGGT CGAGAACCTC ATTGCCCGCG CCGATGAACT GAAAGGGAAA CTGAAAGAAA ACGTTGTCAA TTTTGCCCAG CAGGGACTTA TGTCGAAAGA ACTGGCCACC ATTCACCTGG ATGTGCCGGT GCCATTCGAT GAAGAACACC TGCGCCATAC CGAATACGAT AAACCCCGGT TGGCCGCCTT GCTGGACGAA CTCGAATTTC GGCAGATGAA GACCCGCCTG CTGGGTGGCA GTTACGACGA GAAACCGTTG CCAACGGCTT TCCAGGCTCC CGGTTCAGCA CAGATGAACC TGTTCGACTC ACCTGGTGGC GACAGCCCTG CCTTCCTGCC GTTTCCCAAT ATGGGGTCAA ACAATCCATC GGGGGCAAGT GATCTGCCAT TTGATTTTGG TAGCGAGACG ACTCCAACTG CTGCACCCGT TGAGAAGCCC AAAGGCAAGC GCACCGCTGT TAAAGTACCC GTTGCGTCGG GTTCTAAAGC GACTCCCAAA GGCGTGACGG ATACCATTAC GGCGGACGAT AAACTGGGCG CGGAGGTGAG TGAAACGGAC GCCCCCGCTT ACCTCGACGT GTATCCCGAT TACGAAATCG ACGAAAATCA GCCCGAACGA CGCAAGACTA TTTTGTCGGT CAAGCACGAC TACCGGCTGG TCGATACTGC CGAACTGCGG GCTAGTCTGG TACACTACTT AAGCCAGCAG GAGAGCCTTT GCTTCGACTC CGAAACGACT GCTATCGACC CCGTTGAAGC CGATCTTGTT GGGTTGTCCT TCGCGTATCG TGCGGGCGAA GCCTTCTACG TACCCGTTCC CGCCGACCGG GCCGAAGCGC AGGCGATTGT TGATCAGTTC AAACCCGTTT TCGAGAATCC GACCATTGAG AAGGTTGGAC AGAACCTGAA GTACGACCTG CTGATGCTGA AAAAGTATGG CGTGGAAGTA CAGGGTAAGC TGTTCGATAC CATGATTGCC CATTACCTGA TTGAGCCCGA AATGCGGCAC AACATGGACA TGATGGCCAT GACCTACCTG AACTATAGCC CGGTTGAGAT TGAAGCCTTG ATCGGCAAGA AAGGGAAAGG GCAGTTGACC ATGCGCGACG TGGACATTCA GAAAGTGGTG GATTATGCGG GCGAGGATGC CGATATTACC CTGCAACTGA AACACGCATT CGCCCCCCGG CTCGAAAAAG ATAACCTGCA CAAACTCTTC GATCAGGTCG AAATGCCGCT CGTTCAAGTG CTCACGGATC TGGAACTGGA GGGTATTAAA ATTGACACCA ACGCGCTATC TGAATTGTCG GCCACGCTGG AGGTCGATAT GCGGCAGGTG CAGCAGGAAA TTTTTGAGAT TGCCGGTGAG TCGTTCAACA TCGGCTCGCC GAAGCAATTG GGCGAAGTGC TGTTCGATAA ACTCAAGCTC GACAAGAACG CCAAAAAGAC CAAAACCGGG CAGTACGCCA CGGGCGAGGA AATCCTGTCG AAGCTCGAAG CCGAGCACGA AATAGCCCGC AAAATTCTCG ATTACCGCGA GTTGATCAAA CTTAAGAACA CCTACGTCGA TGCGCTGCCG TTGTTGATCA GCAAGCGTAC CGGTCGGATT CATACGTCGT TCAATCAGGC GGTAGCGGCC ACTGGTCGGC TGTCATCGAC CAATCCTAAC TTGCAAAACA TCCCGATTCG GACGCCACGC GGGCAGGAAA TCCGGAAAGC GTTCGTACCG CGCGGACCGG AGTTTGTGAT CATGTCAGCC GACTATTCGC AGATCGAACT ACGAATTATG GCCGCTTTCA GTGGTGATCA GACTATGCTC GAAGCCTTCA ACAACGGCGT CGATATTCAT ACCCAAACAG CCAGCAAGGT ATTCCATGTG GGGCTCGACG AAGTAACCAG CGACATGCGT CGGAAGGCCA AAACCATCAA TTTTGGTATC ATTTACGGCA TATCGTCCTT TGGCCTGGCG CAACGGCTCA AGATTCCGCG CAAAGAGGCA GGGCAGATCA TTGAAGAGTA TTTCGCGGGT TTCCCGGCGG TAAAAGACTA CATCGACCAG TGCATCGAAA AAGCACGCGG CTTTGGCTAT GCCGAAACCA TACTGGGTCG TCGGCGGTAC CTGCGCGACA TCAACTCCCG CAACCAGACC GACCGTATGT TTGCCGAGCG TAACGCCGTG AACGCTCCCA TTCAGGGCAG TGCTGCCGAC ATGCTCAAGA TTGCCATGAT CCAGATTCAC GAGTTCATGC AGGCCGAGCG GTTGAAGTCC AAAATGATCC TGACCGTACA CGACGAACTC GTCTTCGACG CCCACCGCGA CGAAATCGAC TTGTTGCGCG TGCGTGTAGA CGAGATCATG AAGAACGCCA TCCCGATGGG TGTAAAGATG GAAACTGGCA TCGGCACGGG CGAGAACTGG TTGTTGGCGC ACTAA
|
Protein sequence | MAKPTKKLFL LDALALIYRA HFAFSKSPRI SSRGINTSAV FGFMNAMIEV LTKEKPTHIG VAFDSAKKTF RHESFPMYKA NRQSQPEDIS VAMPYIKQIV EAMHIPMLIL DGYEADDIIG TIAKKAALAD FEVYMMTPDK DYGQLVEEHI HIYKPAFMGK PAEKLGVTEV LERWQIERIE QVTDMLGLMG DSVDNIPGIP GIGEKTAQKL IADFGSVENL IARADELKGK LKENVVNFAQ QGLMSKELAT IHLDVPVPFD EEHLRHTEYD KPRLAALLDE LEFRQMKTRL LGGSYDEKPL PTAFQAPGSA QMNLFDSPGG DSPAFLPFPN MGSNNPSGAS DLPFDFGSET TPTAAPVEKP KGKRTAVKVP VASGSKATPK GVTDTITADD KLGAEVSETD APAYLDVYPD YEIDENQPER RKTILSVKHD YRLVDTAELR ASLVHYLSQQ ESLCFDSETT AIDPVEADLV GLSFAYRAGE AFYVPVPADR AEAQAIVDQF KPVFENPTIE KVGQNLKYDL LMLKKYGVEV QGKLFDTMIA HYLIEPEMRH NMDMMAMTYL NYSPVEIEAL IGKKGKGQLT MRDVDIQKVV DYAGEDADIT LQLKHAFAPR LEKDNLHKLF DQVEMPLVQV LTDLELEGIK IDTNALSELS ATLEVDMRQV QQEIFEIAGE SFNIGSPKQL GEVLFDKLKL DKNAKKTKTG QYATGEEILS KLEAEHEIAR KILDYRELIK LKNTYVDALP LLISKRTGRI HTSFNQAVAA TGRLSSTNPN LQNIPIRTPR GQEIRKAFVP RGPEFVIMSA DYSQIELRIM AAFSGDQTML EAFNNGVDIH TQTASKVFHV GLDEVTSDMR RKAKTINFGI IYGISSFGLA QRLKIPRKEA GQIIEEYFAG FPAVKDYIDQ CIEKARGFGY AETILGRRRY LRDINSRNQT DRMFAERNAV NAPIQGSAAD MLKIAMIQIH EFMQAERLKS KMILTVHDEL VFDAHRDEID LLRVRVDEIM KNAIPMGVKM ETGIGTGENW LLAH
|
| |