Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1570 |
Symbol | |
ID | 8725304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 1891543 |
End bp | 1894608 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | glycosyl hydrolase BNR repeat-containing protein |
Protein accession | YP_003386418 |
Protein GI | 284036488 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.371729 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTATT TCGCCTTTTA CTTTCTGGCA ATTAGTTGCA GTCTGGTACT CAGCAACCCG CTTTGGGCTC AGCCAGCAAC GAAGCCCGAT ACGGCCATCA ACCCCGTTTT TAAAGGCATG GCCTGGCGCA ACATCGGCCC AACCCGGGGT GGCCGGTCGC TGGGGTCTGC GGGGTCGCCG AGCCGCAAAC AGGAATATTA TTTCGGAGCC GTTGGCGGTG GCTTATGGAA AACTACCGAT GGCGGCCAAA GCTGGGCACC CGTTACGGAT GGTCAGTTGA CAAGTTCGTC TGTAGGGGCC GTTGCGGTTG CTGAATCCAA TCCCGATGTG GTCTATATCG GTACAGGCGA AACACAGCTT CGCGGTAACA TCATGCAGGG CGATGGAGTG TACAAATCGA CCAATGCCGG GAAAACCTGG ACCAACATTG GGTTGAGAAA CACCCAGGCC ATTGCCCGTG TCAGGATTCA CCCCACCAAC CCTGATATTG TTTACGTGGC TGCTCTGGGA CACCCCTACG GCCCTAACGA AGAGCGGGGC ATTTTCCGGA CCACCGATGG CGGTAAATCC TGGAAGAAAG TCCTCTACAA AAGCGATAAG GCCGGTGGTA TCGACCTGAT CATCGACCGA ACCAATCCCA ATGTACTCTA TGCCTCTCTC TGGCAGGTGT ACCGCAAACC CTGGAAAATG TGGGGGGGCG GGGGTGACTC CGGCCTGTTC AAATCGACGG ACGGGGGCGA AACCTGGACC GAGCTGACCC GTAAACCAGG CATGCCTAAA GGAACCGTCG GTAAAATTGG CGTGACCGTT TCGCCCGTCG ACCCCAACCG GGTGTGGGCC ATCGTCGAGG CCGAAGACGG GGGCGTATAC CGCTCCGACG ACGCCGGGAT GACCTGGAAA CACGTCAACG ACGAGCGCAA GCTTCGCCAG CGGGCGTTTT ATTACTCCCG AATTTATGCT GATCCCCTCG ACAAAAACGG CGTTTACTGC CTGAATGTCG ACTTTTTCAA ATCGTCGGAT GGTGGGGTAA AGTTCAATAA ATCGTTAAAA GTACCGCACG GCGATAACCA CGACTTGTGG ATCGACCCGG CCGATTCGAC CCGAATGATT ACGTCCAATG ACGGCGGTGC GGCTGTTTCG GTCAACGGCG GTAAAACCTG GACGGATGAA AACTTCCCGA CCGCGCAACT TTATCACATT ACAGCTACCA ACGATTTCCC CTACCATGTA GCCGGTGCCC AACAGGACAA CACCACCGTA GCCGTAGCCA GCGAAGGCTG GGGGAATCAG ATGGCCCGCA GTAATTCCAT CAAAAAGAGC GAATGGACTT ACGAAGTGGG TGGTGGCGAA AGCGGCTACA TTGCCCAGGA CCCCAAGAAC CCGAACATCT TTTACGCGGG CAGCCAGGGT GCATTACTTA CCCGCTACGA CCGCACGACG GGCCAAACCC GCGATGTGCA GGTGTACCCG CGTTTCTTCT CCGGCGAACC CGCCAGTGCT CTGCCCGAAC GCTGGCAGTG GACGTACCCG ATTGTTTTTT CGCCCAAAGA CCCGAACCGG CTTTACGTTT GTTCACAGCA CGTATGGGTA TCGACCAACG AGGGACAAAG CTGGGATAAA ATCAGCCCCG ACCTTACCCT GGCCGACACG GCTACGCTGG GGAAAAGCGG TGGTGTCATT ACGATGGACA TGAACGGCCC GGAGATTTAC GCGACCGTTT TTGCACTGGC TCCCTCCTAC CACGACGTGA ACACCATCTG GGCGGGCTCC GACGACGGGC TGATTCACAT CACCCGCGAC CACGGCAAGA GCTGGCAGAA AATCACCCCG CCGGATATGC CCAAACATAC CCGCGTGAGT ATTATCGAAG CGTCGCGGCA CAAACCCGGC ACGGCCTATG TAGCGGCCAA ACGCTACCAG ATGGACGACC GCACGCCCTA TCTCTGGAAA ACGGATGATT ACGGGAAGAC CTGGAAAAAG ATCATTACTG GATTACGTGC GGACGATTAC GCCCATGCCA TTCGCGAAGA CATCACGCGT CCCGGCCTGC TCTACGCCGG TATGGAACAT GGCGTTTGGG TTTCCTTCAA CGACGGCGAG AACTGGCAGC CCATGCAACT GAAACTGCCC GATACCCAAA TCTCAGACAT TCAGGTAACG GAGAAAGACA TTGTCATTGG TACGCACGGC CGGTCGATCT ACGTGCTGGA CGATGTAGCT CCCGTTCGGG AGTTTACGCC CGATCTGGCC AAGAAGGCTG TTCACCTCTT CAAGCCCTAC TATGCTGTTC GTCGGGTACA GCCAGCCGTC TTCCAGTACT ATCTGGCGAA GAAAGCGGAC AGTGTGAAAA TTGAGATTCT TAATGCCGCC GGAACGCTGA TTCAGTCGTT TACGGGCAAC AAACCCTCTT ACCCTAAAGA TGATGAGGAT GACGACGATT CAGGGAAACC CAAAATCAAA CTACCCACCA CGGCCGCTGG TCTGAACCGC TACGAGTGGG ACCTGCGCTA CCCCGGTGCC ACTTATTTCA AAGGGATGAT CATGTGGGGA GCCCGGCCTA CGTCCGGACC ACTGGCGTTG CCCGGTCAGT ATCAAGTGCG GTTAACCGTA GGCGATCAAA CGTTCACTCA ACCCTTCGAG ATTAAGCTTG ACCCACGACT GAAGGGCGTC TCCCAAGCCG ATGTGCAGGA GCAATTCAAG ATGGCGATGA AACTGCGGGA CGAGACGAGC AAAGCCAACG ACGCGGTGAT TCAGATTCGG GCGGTGAAGG AGAAGCTGGC CAAACAACCC GACAGCCCTA CCAATAAAAA GCTGAAAGAG CAGTTGAACA TCATCGAAGA AAACCTCTAT CAGATTCGGA ATCAAAGTGG TCAGGACCCG CTGAACTTCC CGATCAAGCT CAACAACCGG CTGGCGGCTC TCTGGCGCAG CATTGAATCC GGCGATGCCA AACCGACCAA CGGCTCTTAC AAAGTTTACG AGGAACTCAC CGCCGACCTA AACAAGCAAC TAGCCGAACT GGATACGCTG CTCAAAACGA AAACGGCAAA AAATATTGGT ATGTGA
|
Protein sequence | MKYFAFYFLA ISCSLVLSNP LWAQPATKPD TAINPVFKGM AWRNIGPTRG GRSLGSAGSP SRKQEYYFGA VGGGLWKTTD GGQSWAPVTD GQLTSSSVGA VAVAESNPDV VYIGTGETQL RGNIMQGDGV YKSTNAGKTW TNIGLRNTQA IARVRIHPTN PDIVYVAALG HPYGPNEERG IFRTTDGGKS WKKVLYKSDK AGGIDLIIDR TNPNVLYASL WQVYRKPWKM WGGGGDSGLF KSTDGGETWT ELTRKPGMPK GTVGKIGVTV SPVDPNRVWA IVEAEDGGVY RSDDAGMTWK HVNDERKLRQ RAFYYSRIYA DPLDKNGVYC LNVDFFKSSD GGVKFNKSLK VPHGDNHDLW IDPADSTRMI TSNDGGAAVS VNGGKTWTDE NFPTAQLYHI TATNDFPYHV AGAQQDNTTV AVASEGWGNQ MARSNSIKKS EWTYEVGGGE SGYIAQDPKN PNIFYAGSQG ALLTRYDRTT GQTRDVQVYP RFFSGEPASA LPERWQWTYP IVFSPKDPNR LYVCSQHVWV STNEGQSWDK ISPDLTLADT ATLGKSGGVI TMDMNGPEIY ATVFALAPSY HDVNTIWAGS DDGLIHITRD HGKSWQKITP PDMPKHTRVS IIEASRHKPG TAYVAAKRYQ MDDRTPYLWK TDDYGKTWKK IITGLRADDY AHAIREDITR PGLLYAGMEH GVWVSFNDGE NWQPMQLKLP DTQISDIQVT EKDIVIGTHG RSIYVLDDVA PVREFTPDLA KKAVHLFKPY YAVRRVQPAV FQYYLAKKAD SVKIEILNAA GTLIQSFTGN KPSYPKDDED DDDSGKPKIK LPTTAAGLNR YEWDLRYPGA TYFKGMIMWG ARPTSGPLAL PGQYQVRLTV GDQTFTQPFE IKLDPRLKGV SQADVQEQFK MAMKLRDETS KANDAVIQIR AVKEKLAKQP DSPTNKKLKE QLNIIEENLY QIRNQSGQDP LNFPIKLNNR LAALWRSIES GDAKPTNGSY KVYEELTADL NKQLAELDTL LKTKTAKNIG M
|
| |