Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4594 |
Symbol | |
ID | 8728358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5569096 |
End bp | 5571537 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003389372 |
Protein GI | 284039442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAACCA ACTACATCAA AATCGCCTGG CGGAACCTGC GGAAACAGCA GGGATTCGCA TTTATCAACA TTTTCGGGCT GGCTATTGGC CTGGCATGTT GCATGCTCAT TATGCTGTAT GTGCTGGACG AACTGAGCTT CGACCGCTAC AACGCCAATG CCGACCGTAT CTATCGCATA CAGTCCGACA TCAAGTTTGG GGGGAATGAT ATGCACTTCG CCACAACACC CGACCCACTG GGACCTACCC TTAAGAAAGA CTATCCACAG GTAGAGCAAT TTGTCCGGCT GCACCAGCGT GGTACCTGGC TGGTGAAGAA AACCGGCGAA ACGACCAACC TTCGGGAAGG TGACGTCGTG TTTGCGGATT CTACGGTATT CGATGTTTTC ACGCTGCCCT TTGTTGCGGG CGATGCCAAA CGGGCACTGA CGCAGCCGAA CACAGTTGTC ATCAGCAAAT CGGCCGCGAA ACGGCACTTT GGCAACCAGA ACGCCCTAGG GCAAACGCTG GTCTTTGCAA ACACAGAAAA TTACAGAGTA ACGGGAATCA TGCGCGACAT GCCCCGGAAC GCCCATTTTC GCACTGACTT CTTCGTGACG ATGCTCAGCG ATAAGTACCC CTGGGGACAA TGGCTTAGTA ATAACCATCA CACGTATATA CGGCTGAAGG CAGGACGGTC CGGCGCTCCG GCTGATCCGG CTGTATTTTC TCAGAATTTC GGGGCCGTTA TCGAGAAATA CGTGGGGCCG CAATTGCTGC AAATGGTGGG CACCACAATG GAGCAGTTCC GGAAATCCAA CAACCAGATG AACTTTTGGC TCATTCCCCT CACCGATATT CATCTACGTT CCAAACAACA AATCGAACTG GCCCCCAACG GCGACATTCA ATACGTTTAT ATCTTCTCGG CCGTAGCACT GTTCATTCTC ATTATAGCCT GTATCAACTT CATGAATCTG GCCACTGCCC GCTCATCGAA CCGGGCTAAA GAGGTTGGCG TCCGGAAAGT GATGGGATCG GAACGGCAAC AACTCGTGGG TCAGTTCATG ACCGAATCGA TACTGACGAC CGTGCTGGCT ATGGCGCTGG CCATTGGTAT TGTGGCCGTT GCCCTGCCGG GATTTAATAG CATTGCGGCC AAAGAAATCA GTCTGTTGCA ATTGGTATCG CCATCGTTGT TGCCAATAAT TATCATCCTG CCGATTGTGG TAGGCTTGCT GGCTGGCAGT TACCCGGCCT TTTTTCTGTC TTCCTTTCAA CCCATTTCGG TATTGAAAGG CCGCATAAAC ATGAGTTTCC GGACGGTCAG TCTACGAAGC GGACTGGTGG TGTTCCAGTT CATGATGTCG GTCGTACTCA TCATCGGGAC AATCATTGTC TATCGACAGC TTACGTACAT CCAAACGACC AGCGTGGGTT TCAACCGCGA TCACGTTCTA ACGGTCAATG ATCTGTATGC GGTGGGCAAA CAGGCCGAAA CGTTCAAGCA GGAGGTACTG CGCTTACCGG GCGTGGTAAG CGGCAGCCTG TCCGGTTACC TGCCCACCCC CTCAGACCGG AACGACAACG TGTTTTTCCC CGAAGGGCAG ACAAACATGA ACAAAGGGGT CAGCATGCAG AACTGGGGCG TCGATTACGA CTATGTGAAA ACACTAGGTA TTCAACTGGT AGCGGGGCGT AATTTCTCAC AGCAGTTCGG TTCCGACTCC TCAGGGATAC TGCTCAACGA AGTCGCTGTC AAAATTCTGG GATTCAAAGA CCCCATTGGC AAACGCATCT GGGGATTCAA TGATGCCGAG GGGAAAACCC GAAAAACATA CACCGTCGTC GGGGTTGTTA AAAATTTCCA TTACGAATCG CTCCGGCGCA ACATTGGTGC GTTGGCACTG GTGCTGAGTG CCAACGCAGG GGCGGCTTCC TTCCGGGTAA TCAGCACGAA CCTACCGGTA TTGATGCAGC AGATTGAAGC GAAATGGAAA GCACTGGCAC CCGGCCAGCC GTTCAGTTTC AAATTCATGG ATGATAGCTT CGACGAGATG TACCGCGCCG AACAGCGCAT TGGCACCATT GCCCTAACGT TTGCGGGATT GGCTATCCTG ATCGCGTGTC TGGGTCTGTT CGGACTGGCC GCGTTCATCG CGGAACAGCG TACCAAAGAG ATCGGCGTCC GCAAAGTACT GGGCGCGAGT GTCCCCAGCC TCATCGGTCT GCTCTCCAGG GACTTTCTGA AACTGGTTCT GATTGCTATT GTCATAGCCT CGCCCATTGC CTGGTATGCC ATGAATAACT GGCTAAAAGA CTTCGCTTAT AAAATCGACA TTGAGTGGTG GATGTTTGCC CTGGCGGGTC TGCTGGCCGT AGGCATTGCT CTGTTGACGG TCAGTTTCCA GAGTGTAAAA GCTGCGTTGA TGAACCCAGT GAAGAGTTTA CGGAGTGAGT AA
|
Protein sequence | MLTNYIKIAW RNLRKQQGFA FINIFGLAIG LACCMLIMLY VLDELSFDRY NANADRIYRI QSDIKFGGND MHFATTPDPL GPTLKKDYPQ VEQFVRLHQR GTWLVKKTGE TTNLREGDVV FADSTVFDVF TLPFVAGDAK RALTQPNTVV ISKSAAKRHF GNQNALGQTL VFANTENYRV TGIMRDMPRN AHFRTDFFVT MLSDKYPWGQ WLSNNHHTYI RLKAGRSGAP ADPAVFSQNF GAVIEKYVGP QLLQMVGTTM EQFRKSNNQM NFWLIPLTDI HLRSKQQIEL APNGDIQYVY IFSAVALFIL IIACINFMNL ATARSSNRAK EVGVRKVMGS ERQQLVGQFM TESILTTVLA MALAIGIVAV ALPGFNSIAA KEISLLQLVS PSLLPIIIIL PIVVGLLAGS YPAFFLSSFQ PISVLKGRIN MSFRTVSLRS GLVVFQFMMS VVLIIGTIIV YRQLTYIQTT SVGFNRDHVL TVNDLYAVGK QAETFKQEVL RLPGVVSGSL SGYLPTPSDR NDNVFFPEGQ TNMNKGVSMQ NWGVDYDYVK TLGIQLVAGR NFSQQFGSDS SGILLNEVAV KILGFKDPIG KRIWGFNDAE GKTRKTYTVV GVVKNFHYES LRRNIGALAL VLSANAGAAS FRVISTNLPV LMQQIEAKWK ALAPGQPFSF KFMDDSFDEM YRAEQRIGTI ALTFAGLAIL IACLGLFGLA AFIAEQRTKE IGVRKVLGAS VPSLIGLLSR DFLKLVLIAI VIASPIAWYA MNNWLKDFAY KIDIEWWMFA LAGLLAVGIA LLTVSFQSVK AALMNPVKSL RSE
|
| |