Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4111 |
Symbol | |
ID | 8727870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4950224 |
End bp | 4953193 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | Rhs family protein-like protein |
Protein accession | YP_003388897 |
Protein GI | 284038967 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.217652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0101653 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCAC CCGCGTTTCG GTCCGGTTTA CGACGGTTAC TTTACCCGGC CATTAACGGT GCGTCGCCTG CTGGTGAAGA AAACCCGGAA CAGCTTGCCC TACTGCTGAA TCGACTGGCC TTAACGCCCG ACGATTGCCA GGTGCTGCCC TGGGAACGAC GGACGTTCAA TGGTCGCTTT GTCGATGTGG CATCGGGAGA AGTGCTGGTT GAGGAAGTCG ACGTAGAGAT TCCGGGGCCT GTTCCCTTCC GATGGAAACG GTTTTACCGC TCCCATAAAT CAACAACCGG CTCTTTGGGT GGCGGGTGGC GTCACGCCTA CGATTTTGCC CTGCTGGAAG ACCAGGTCAG CGGTCGGGTG GTGGTCCGGT TGCCCGATAA TCGGGGTATA CTGTTTCCAC AACTGGCCAA AGGTGAGCAA TATCTGAACC GAGCCGAAAA GCTTCAGCTT AACCACGACG ATCACGGCTA CCGGCTCCGG GGAGCCGACG GGCTGGGCTA CCGTTTTGCC CGAAATCCGG GCGACAGTCT TTATCGGCTG ATGGCGGTCG ATAGCCCTCA CTTTCCCCAC CGTCTTCAGT TCAGTTATAC CCGTGCGGGA CATTTACGGC GCATAAGCGA TGATTTTCAG CGCGTGATCG ATGTAACTAC CGATCCGCAG GGAACTATCA TACGGCTGAC CATCACGGCC CCCGACCAGT TACTTAAAGA GTTTACGTTG GTAGCCTACG GGTACGACGA AACCCATAAC CTGACCGATG TGCATGTGGC GGGTAGCCGT ACGGCTCAAT ACAGTTATCG CCAGAATCGA CTCGTCCGGC TAACGGACCG TTTCAACCAA AATACATTTT TCACCTACGA AAAAATCGAC AATACTTACC GGTGCAACAC GCTGAAATGG AGCGGAAGCC CGATTTCACT CCGCTTTGAT TACCTGTCCG ATGAAGGGCG AACGCTGGTT ACGGACTCCG CCGACCGGGT CAGGCAATAC ACGCACGAAG CCGGAGCCGT TCAGCGGTTT ATCAGCGAAG GGGGGCAACA ACGTGTTTGG CTCTTTAACG AACACGGCGA ACTGGAGAGT GAGCAGGATA CGCTTGGTAA CACCACGTTC TATACCTACG ACCCAAAAGG CAATAGGATA GAGATAGATT GGCCAGACGG TGGACAGATG CGGATGCGCT ATAACGACGA CGGCCAGTTA CTGGAACTGG TAGACCGGGT CAATGGCGTG TGGCTGTGGT CGTATACCGA AACCGGGCAA CTCCTATCCT GCCTGAATCC GGTTGGTGCC GAAACGACCT TTAGCTATAA CCGGGACGGC TGGCTTCGCG AACGCAGAAA CTTAAGAGAA GGCTGGACCC GCTGGGCTTA CGATCCGTAC GGTTACCCGG TAGAGGTAAC AACGGAAACC GGCCGAAAAA CGACATTATC GTTCAACGCC CTGGGGCAAT TAATAAACTC GAACCCCGAC CCGAGCACGG CAACCAAACC CGTACCCGCA AACGAGCCGA AAGCCGTTTC TGAATACCAG CCGGTTTACA CAAACGACGG TAAACTCATT GAGCTTCGGA GCAAAGATAA ACTCAGTTGG CGGTTCGTCC GGGATACCGC CGGGAGAGTG CGTGACTATT GCCGACCCGA TGGCCGTTCA ACCCGTTTTC ACTACGATGC TGCCGGGCGT ATGACCGAAG TACTGTTCTC CGACGGAAGC TGGCACCATT ATACCTATCG CCCGGATGGC TGGCTGATGG AAGCCAGCAC ACCCACCACC CTGGTTCAGT TCGAGCGCGA TCCGCTGGGG AAAATCATTA CCGAAACAGC GGGTAATACC GTGGTGGAAA CGGTGTACGA CAAGGCCGGT AACCGAATAA ATCTGCAATC GTCAGCCCAG GCTACTGCGG CCTACACTTA TGACAACCGC AGCCTACTGA CCCGGCAACA ACACGGACCG TGGCAACTCG AATTTACCCA CGACCGCCAG GGGCGTTTGG CAGAGTGCCT TATGCCCGGC AGTTTGCGCA GTCGCTGGCA GTATGACCAG GGGCCATTGC CAACCAGTCA TCAGCTCTTT TGGGGAAGCC GTCTGCAGGC CGCCCGTTCG CAAACCTACC AATGGCAGCA GAACCAAGTC ACCCGGCTAC AGGATTCGCG TTTCGGCACG GCCACGCTGC TTTATGATTC AGCCAATGAA CCGATAGAGG CCGTCGGCTC TGCCGGTGCC GGATGGGTCG ACCGCTGGCT TCCGCAGCGG TCGCGCTATC AGCAGGTGTT ACTGAAGCCA GCCGCCAGAG CCAGTGAAGT CGGCTGGCAG CTCATTCTGG CGGGACCGGC CCGATTTTAT TACGATCCTG AGGGGTATCT GCGCGAAAAA CAGATTGCCG GAAAGGTGTG GCAGTTCGTG TGGCATGAAT CAGGCTGTTT ACATCAGGTC ATCTGTCCGG ATGGCAGCGT GGTTGCGTTC GAATACGATG CCCTCGGGCG TCGCATCCAG AAAACTGTCA ACGATTACAA GGTCTGCTGG GCGTGGGACG GCAACCGACT GCTGCACGAA TGGCATGAGC GCACGGGCAG CGAACCGGTT CAACTTACCT GGTATACGGC GCGGGGAGCC GAAGCCACCA TGCTTCAGGT GGGTAAAAAC AGCTATAGCG TGGTCTGTAA CTACCTGGGA CAACCCCTGT CCATGCATGA TGAGCAGGGC GATCCGGTAT GGGAATGGCG CTGGTGTCTG TTCGGCAAAA AGCGCAGCCT AACCGGCCCC CAACGCTGGC ATACCTTTCT GGGATATGGG CAGTTTGACG ATCAGGAAGC CGGGTTAGTG TACAACAACT TCAGGTATTT CGACAGTGAA ACCGGGCTGC CTATCAGTCC AGAATATTCC AGCCCCGCCG GCTGGGTGCG ATCTGGATGG GAGCCTCCTC ATGCACCAGA ATCGTTTCTT TCCGCTGGCC GATATATTCA AGTGTACTGA
|
Protein sequence | MESPAFRSGL RRLLYPAING ASPAGEENPE QLALLLNRLA LTPDDCQVLP WERRTFNGRF VDVASGEVLV EEVDVEIPGP VPFRWKRFYR SHKSTTGSLG GGWRHAYDFA LLEDQVSGRV VVRLPDNRGI LFPQLAKGEQ YLNRAEKLQL NHDDHGYRLR GADGLGYRFA RNPGDSLYRL MAVDSPHFPH RLQFSYTRAG HLRRISDDFQ RVIDVTTDPQ GTIIRLTITA PDQLLKEFTL VAYGYDETHN LTDVHVAGSR TAQYSYRQNR LVRLTDRFNQ NTFFTYEKID NTYRCNTLKW SGSPISLRFD YLSDEGRTLV TDSADRVRQY THEAGAVQRF ISEGGQQRVW LFNEHGELES EQDTLGNTTF YTYDPKGNRI EIDWPDGGQM RMRYNDDGQL LELVDRVNGV WLWSYTETGQ LLSCLNPVGA ETTFSYNRDG WLRERRNLRE GWTRWAYDPY GYPVEVTTET GRKTTLSFNA LGQLINSNPD PSTATKPVPA NEPKAVSEYQ PVYTNDGKLI ELRSKDKLSW RFVRDTAGRV RDYCRPDGRS TRFHYDAAGR MTEVLFSDGS WHHYTYRPDG WLMEASTPTT LVQFERDPLG KIITETAGNT VVETVYDKAG NRINLQSSAQ ATAAYTYDNR SLLTRQQHGP WQLEFTHDRQ GRLAECLMPG SLRSRWQYDQ GPLPTSHQLF WGSRLQAARS QTYQWQQNQV TRLQDSRFGT ATLLYDSANE PIEAVGSAGA GWVDRWLPQR SRYQQVLLKP AARASEVGWQ LILAGPARFY YDPEGYLREK QIAGKVWQFV WHESGCLHQV ICPDGSVVAF EYDALGRRIQ KTVNDYKVCW AWDGNRLLHE WHERTGSEPV QLTWYTARGA EATMLQVGKN SYSVVCNYLG QPLSMHDEQG DPVWEWRWCL FGKKRSLTGP QRWHTFLGYG QFDDQEAGLV YNNFRYFDSE TGLPISPEYS SPAGWVRSGW EPPHAPESFL SAGRYIQVY
|
| |