Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4641 |
Symbol | |
ID | 8728405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5649493 |
End bp | 5652003 |
Gene Length | 2511 bp |
Protein Length | 836 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003389418 |
Protein GI | 284039488 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0388606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGAA GCTATATCAA AACATCGAGT CGCAACTTAA TGCGTAACAA GCTGTTCTCG TCCATCAATA TTGTTGGCCT TGCCATTAGT ATGTCTGTTG GGCTATTGCT GATCGCGTTC ATGCTCGATC TGTATTCGTA CGACAGATTT CACCAGAATG GGGAGCGGAT TTACCGCATC ACCAGCATAC AAACCTCCAA TCAGGAAGAA CGTCAGTCCG GTCAGTCAAA TCCGGACCGA GCGCGGTCGG GAGCCAAGTT TGCCACCACT TCTTTAAAGA TCGGAAAGCT AATCCGGCAG AAGGTGACCG GCGTTGACCG GTACGCTGGT GCGGACGTGA CTATTCTACA CAACGACTTT TCGCAGGACG CGCAGGTTGG CTCTTCCGTT GTTCCCATCA AGGGTTTCTG GGCGGAGCCG TCTGTATTCA GAATTTTCAC CTTTCCGATG CTGGAAGGCA ATCCCGAAAC AGCGCTGAAA GATCCGTACT CGATCGTTCT TACAGAGACG GCAGCCAAAA AGCTGTTCGG CAATGAATCA GCACTCGGCA AGGCGATCAA ATTCGATACG CTCTCGTATC AGGTGACCGG TGTCATGAAG GACGTTCCCT TCTTTTCGCA TATTCATTTC GAAGCCCTGG TATCACTGTC GACGGCCGAG CAGCTCAACC GGAACAACTT CGAGAAATGG GCAAGTATGC CGTCGAACTC CGTATACCTC CTACTGCCGG AAACTGCCAA TATGGCCTCA ATCCAGTCGC AGCTCGACGC CGTTGCCAGG GAGGAAAATC GCGCCGACGA AAACACGAAG ACCCAGCTTG AGCTAATGCC TTTATATAGT GTCGTGGTCG GCGAAAGCCT CCGTCAAGCC GAAGGGGGGC CTGGCGTTGG GGGGCCACAC ATGCCACCAA CGGTGCTTTG GATACTCGGC GGGCTTGCCC TCATTGTAAT CCTGTCGGCG TGTTTTAACT ACACCAACCT GTCGATGGCC CGCGCCATGC GCCGATTCAA GGAAGTAGGG CTTCGCAAAG CGATTGGTGC TGATAAACGT CAGGTATGGC AGCAGTTTCT GGTTGAAGCC GTCATGATAT CCCTGGCGGC CCTTGTTCTA TCCTACTTTA TCTTTCTCCT GTTGCGACCA CAGCTGATTA ACCTGGCTCC GGAGTTGCAG CGCACAGTGA AGCTCGAACT TAGTCCGGCT ATGGTCATCG CCTTCGTCGT CTTCTCCATT ACCGTAGGAG TTATTGCCGG TATCATGCCC GCTCTGTTCT TTTCGAAAGT CAGCGCGATC AATGCACTCA GGAACGTATC CACCCGGAGC CTCGTCGGCG GAGTATCACC ATCGTTCCTG TCAATAAACG TGTTCAAACA CGCAACACTC CGGCAGGCGC TGGTGGTCAT TCAATACACG CTTACGCTAA TTTTTATCAC AACAACCGCC ATTGGCTATG TGCAGTATAA GAACATCCTG AAATTCGACC TGGGATTCAA TACCCAGAAC ATTCTGAATA TCAACATGCA GGGTAATAAA CCCGATGCAT TTCTGAAAGA CCTTGGCGAG ATGCCGGAGG TAACGGCGCT GTCGCGGTCG CTCATTATCA CCAGCGTCGG CAATGCATGG GGCGGCTACA TGAAATATAC AGATTCGCGC GACTCGGCGC TGGTGCTGAC GAACAACGTC GACGAAAACT ACCTGGCGCT GCACGAATAC AAACTTATTG CCGGGGGTAA TTTTAAAACA AGGCCCACAA CAGCCGAAGC CGTCAGCGAA GTGATCGTTA ACCAGCAAGT TTTAAAACGA TTTAACATTG CTGACAACGA CCCCCAAAAA GCGATTGGGC AGGAGATCAC ATTCAGCAAT TTCAGCGGAA CACGCCGGAT GACCATTGTG GGGGTTATGA AAGACTTTCA CTATGGCAAG GTTGACAATC TCGTCGGGCC GGTAGCTTTC ATGAGCTGGA CACCCGGCGA CAGGGCCATT ATCAATGCCA AAATACAAAG TACTGACCTG CTGGCAACCA TGGCCAGGAT TGAGTCGGCC TGGAAAAAGA TCGACCGTGT TCATCCTTTT CAGGCCAAGT TCTATGACCA GGAAATCCAG GACGCTTACA GTGAGTTTTC TGCGATTATC AAGATCATTG GCTTCCTTTC CTTCCTGGCC ATTTCGATTG CTTCGATGGG TCTGTTCGGC ATGGTGGCCT ACACAACCGA AACCAGACTG AAAGAAATCA GCATCCGCAA GGTAATGGGA GCAAGCTCCG TCAACCTTAT TTTCTTGTTG AGCCGTGGTT TTCTCCTGCT ACTGTCGATT TCGGCACTTA TCGCACTCCC CATCAGCTAT CTATTCTTCA AAAACGCTGT GCTCACCCAC TTCCCGTATC ACACCCCCGT TCAGATCGCC GAGCTATTCG TGGGCTTGCT GGTAGTATTG CTGATCGCCT TCATTATGAT CGGCTCGCAG ACGGTAAAGG CCGCAAAGGC GAATCCGGTA GACGTCCTGA AGAGTCAGTA A
|
Protein sequence | MIGSYIKTSS RNLMRNKLFS SINIVGLAIS MSVGLLLIAF MLDLYSYDRF HQNGERIYRI TSIQTSNQEE RQSGQSNPDR ARSGAKFATT SLKIGKLIRQ KVTGVDRYAG ADVTILHNDF SQDAQVGSSV VPIKGFWAEP SVFRIFTFPM LEGNPETALK DPYSIVLTET AAKKLFGNES ALGKAIKFDT LSYQVTGVMK DVPFFSHIHF EALVSLSTAE QLNRNNFEKW ASMPSNSVYL LLPETANMAS IQSQLDAVAR EENRADENTK TQLELMPLYS VVVGESLRQA EGGPGVGGPH MPPTVLWILG GLALIVILSA CFNYTNLSMA RAMRRFKEVG LRKAIGADKR QVWQQFLVEA VMISLAALVL SYFIFLLLRP QLINLAPELQ RTVKLELSPA MVIAFVVFSI TVGVIAGIMP ALFFSKVSAI NALRNVSTRS LVGGVSPSFL SINVFKHATL RQALVVIQYT LTLIFITTTA IGYVQYKNIL KFDLGFNTQN ILNINMQGNK PDAFLKDLGE MPEVTALSRS LIITSVGNAW GGYMKYTDSR DSALVLTNNV DENYLALHEY KLIAGGNFKT RPTTAEAVSE VIVNQQVLKR FNIADNDPQK AIGQEITFSN FSGTRRMTIV GVMKDFHYGK VDNLVGPVAF MSWTPGDRAI INAKIQSTDL LATMARIESA WKKIDRVHPF QAKFYDQEIQ DAYSEFSAII KIIGFLSFLA ISIASMGLFG MVAYTTETRL KEISIRKVMG ASSVNLIFLL SRGFLLLLSI SALIALPISY LFFKNAVLTH FPYHTPVQIA ELFVGLLVVL LIAFIMIGSQ TVKAAKANPV DVLKSQ
|
| |