Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5894 |
Symbol | |
ID | 8729672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 7136893 |
End bp | 7139982 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | DNA repair ATPase-like protein |
Protein accession | YP_003390656 |
Protein GI | 284040726 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0510725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00862936 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATACCCA TTAAACTGTC AATTCAAGGA CTCTATTCAT ATCAGGACTT ACAGGAAATT GATTTCCAGC AATTAATTGG TTCGAGTGTT TTTGGCATCT TTGGCAAGGT GGGCAGTGGC AAAACGTCGC TGCTGGAAGC TATAAGCTTT GCGTTATACG GCGAAACGGA ACGGCTGAAC AGTCGCGACA ATCGTCAGTA TAATATGATG AATTTAAAGT CGAAGCACCT GATCATCGAC TTTCAGTTTC AGGCCGGGCC GGAGCAGCAG CTATACAAGT TTGTTTACGA AGCGAAACGT CATCCCAAAA AGCATCATGA AATAGGGCCT GGCGAACGCC GGATGTTTAT TCGCCAGGGC GATGAATGGC AACCCATTGG GAATGAGAAA GAAGATATAG CCGTTCTGTC GAAGCAGATT CTGGGTCTGG ACTACGATAA TTTCAAGCGG ACCATCATCA TTCCCCAAAA CCAGTTTCGG GAGTTTCTGG AGCTTAGTCC CACCGAACGG ACGAAGATGA TGAACCAGTT GTTCAAGCTC GACCAGTATG ACCTGGCTGC CCGAGTGAGC AAACTGAGTA AAGCGAATGA CGACCAACTG GCCGAACTTC GCGGGTTGTT ATCCCCTCTG GAAGCTGTAA CACCCGAAAC CATTGAGCAG GCAAAGACCG ACATTACTGT TATTTCTGAG TCTCTGAGCC GGAAGGAAGC CGAAATCAAT GACTTGTCTC CTGCCGAAAA ACGATTACTC GAAAACCAGC AACGTAGCCA AACACTGGTA TCGACTCAAC AGGAACTCAC CCAGGTTCTT AACCGGGCAC CCCTCTATCA GCAACGGGAA CAGGCACTGG CCATTTACGA AACCTGTCTG CTCGTCTTCC AGGCCGACTT TGCTAACCTC GATAAGCTTA ATTCCCGCAA AACCAGACTG ATAGAACAGG AACAGGCGGC CCAGCGGCAA CTCCTGTTCG TAACGAAGCG GTTGGTAGGC TTACGTAGTT TATACGAAGC GGCCAAACAG GCTTACGAAA CGCGCGACGA ACTACAGCAG AAAATTGACG AACTGGATAC GGTACAGCAA ATCCGAACGC TGCAACAAAC CATCGGCCAG CAAACCCGTA ATCGCGATAC GCTGGCCAGT CAGCTCCACC ACCAAGCGAC TTTAATTGAA CGGCACAAGG CAACCCGCGC CAAACACCGG CTCATTCTGG ATAACGGCAT CGGTCGGACG TCCGATCTAG AGCGGCTTTA TAAAGTTAAG AACTGGTTTA CGGCTTATAA ACCTCTGAAG AAACAAGCCG ATGATCTTCA ACTGGCCGTC GACAACTATG ACCGGGCGAT TGAAAAACTG AAACAGCAGA AGAACGATGC GCTGTTGGGT TTTCCACCCG ATTGGACTGA CTTAACGCTC AAAACCTTAC CGGATGCTAT TGAGGAGGCT CTTGAGAAAT TTAAAGAAGT TCGGGAGGAC CGCGAGGACA AACACCGGAA ACTACTCGTT CAGGATGAGT TACGAAAATA TGCCGATGCA CTGAATGAAG GACAACCCTG CCCACTTTGC GGATCAACGC ACCATCCTAA CCGACACAAG GGCGATGAGG AAAGCATTGA TGTAAAAGGG AGCGAAGCCG GTTTACAGAA AGTAATTCAG CGAATAGAGG TCACCAGTAA GCTACAGCTT GCCATTAAGG AGCTGACGAC AAAACTGCGG AGCGAACACG ATAACGGCAA ACGGCTCATG CAGGAGCGTA CCGAAATCGT CCGGCAGTTG ACCGAGCACG AAGATAATTT TATGTGGCCG GAATTCTCGA AAGAGCAGGA AGGTGAGGTG ATAAATGCCA TCCAAAAAGA GAGCGACGCG CAGAAGCAAA TCCTGGACGC GCAAAAGGCG ATTCAGGATC TCGAAAAACT TATAGGAGAA GCCGAAGCGG CCCATAACGA TCTAAACAAA AAAGTGGTCG ACGCGGGTAA TGCCATTGCC GGATTGAGCG GGCAATTGAA AACAGCGGCC GAATCGCTGG AGCATTTCCG GCTGGAAGAA GTGAAAAACT GGGGACTTGA CCAGATTGCT GACCTACGCG AATCACTCAC CAAAACCTAC CGCCAGGCAA AAATTCATTT CGACGATACG GCAAAGCAGC AGGGTGAAGC CGAAAAAGAA CAGGCAACGG CTGAAGAACA AATTCAGCAA TTCCATAATC AATTGGCCGA AATAGTCGCT GAACAGGAAG GTCTGGAAGT CACCATCGCG CAGAATCTGG CCAACCAAAA CCTGACCCGC AAGCAGGTGA AACAGATTCT ACAATCTAAC CTAAATATCG GTCAGGAACG ACAGCAGATC AACGAATACA ACGAAAAGCG AAGCAGTCTG CAAGGACAAG TCGATACGCT ACAGACTGAG CTAGCCGAAC AGCCGTTCGA CCCGGTCGAA TTAGCGACGG TTCAACAACA GTTAGCAACC TTACAGACCG AGAAGGATGC GCTGAACAAA GAACATGGCC GTGCTACCAG TGTGCTAGCT GCCCTCGAAA GTCAGTGGCA GCAAAAGCTG GAACACCAGA AGCGGCACGA TGAACTTGAC CTCCGAAAGC AGGACCTGAA GAAGATGGAC GAGATGTTCA GAGCGCAGGG CTTCGTGAAT TACGTGTCGT CGGTTTATCT GAAAAACCTC TGTGAATCAG CCAATGAACG ATTCTTTAAG CTGACCAATA ATCAGCTGCG GCTCGAACTG GACGACAAAA ACAATTTCCT CGTGCGCGAC TTCCTCAACA GCGGGGAGGT ACGGAGTGTA AAAACGCTCT CGGGCGGTCA GACCTTCCAG GCAGCTCTGT CGCTGGCGCT GGCGTTATCG GATAACATTC AACATCTGAC GAAAGCAAAG CAAAATCTGT TTTTCCTGGA TGAAGGCTTT GGTACGCTCG ACAAAGATTC CCTGCAAACG GTCTTCAAAA CGCTGAAAGC CCTCCGCGCC GAAAATCGGG TCGTTGGCAT TATCTCGCAC GTGGAAGAGC TTCAGCAGGA AGTGGAACAT TTTATTCGGG CAGAATCGAC CGAGAATGGT AGCCGGATTA TTCGGAGCTG GGAGAGTTGA
|
Protein sequence | MIPIKLSIQG LYSYQDLQEI DFQQLIGSSV FGIFGKVGSG KTSLLEAISF ALYGETERLN SRDNRQYNMM NLKSKHLIID FQFQAGPEQQ LYKFVYEAKR HPKKHHEIGP GERRMFIRQG DEWQPIGNEK EDIAVLSKQI LGLDYDNFKR TIIIPQNQFR EFLELSPTER TKMMNQLFKL DQYDLAARVS KLSKANDDQL AELRGLLSPL EAVTPETIEQ AKTDITVISE SLSRKEAEIN DLSPAEKRLL ENQQRSQTLV STQQELTQVL NRAPLYQQRE QALAIYETCL LVFQADFANL DKLNSRKTRL IEQEQAAQRQ LLFVTKRLVG LRSLYEAAKQ AYETRDELQQ KIDELDTVQQ IRTLQQTIGQ QTRNRDTLAS QLHHQATLIE RHKATRAKHR LILDNGIGRT SDLERLYKVK NWFTAYKPLK KQADDLQLAV DNYDRAIEKL KQQKNDALLG FPPDWTDLTL KTLPDAIEEA LEKFKEVRED REDKHRKLLV QDELRKYADA LNEGQPCPLC GSTHHPNRHK GDEESIDVKG SEAGLQKVIQ RIEVTSKLQL AIKELTTKLR SEHDNGKRLM QERTEIVRQL TEHEDNFMWP EFSKEQEGEV INAIQKESDA QKQILDAQKA IQDLEKLIGE AEAAHNDLNK KVVDAGNAIA GLSGQLKTAA ESLEHFRLEE VKNWGLDQIA DLRESLTKTY RQAKIHFDDT AKQQGEAEKE QATAEEQIQQ FHNQLAEIVA EQEGLEVTIA QNLANQNLTR KQVKQILQSN LNIGQERQQI NEYNEKRSSL QGQVDTLQTE LAEQPFDPVE LATVQQQLAT LQTEKDALNK EHGRATSVLA ALESQWQQKL EHQKRHDELD LRKQDLKKMD EMFRAQGFVN YVSSVYLKNL CESANERFFK LTNNQLRLEL DDKNNFLVRD FLNSGEVRSV KTLSGGQTFQ AALSLALALS DNIQHLTKAK QNLFFLDEGF GTLDKDSLQT VFKTLKALRA ENRVVGIISH VEELQQEVEH FIRAESTENG SRIIRSWES
|
| |