Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4520 |
Symbol | |
ID | 8728284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5478561 |
End bp | 5481281 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003389299 |
Protein GI | 284039369 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.815518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAT TATTCGGAAT GATGCCGAAT ATCAACGTAT CAATGATTAC ATCGAATCAA ATCCCGAAAA TTGGGAAAAG GATAAATTTT ATGACGCCCT CTAAGATTAT GCGTTCTGCT CTATTTCTTT CCATTCTACT CAGTCTTATT TCAGTTAACA TACCCAACGA ACCTTCCGCT GTTATCCGCA TTAACCTGCT CGGTTACCGA CCTGATAGCC CGAAGGTCGC CGTTTGGGGA AGCCTGACCG ACGGGCAAAT TAATACGTTC GAACTGGTCG ATGAACAGAC GAACGGGGTG CAGCAACAGC TACCGGCCGG GCGGGCGTTT GGCGGGTATG GACCGTTTAA ACAATCCTAT CGGCTCGACT TTTCGGCGGT TCAGAAACCC GGTCGGTATT ACCTGCGCAC AGCCGACGGT GCCCGATCGC CGGGGTTTCG GATTGGCGAC GATGTGTATG CGGGCGCGGC CGATTTTTGT CTGCGCTACA TGCGCCAGCA ACGAAGCGGC TTCAATCCCT TTCTGAAAGA CTCCTGCCAT ACGCACGATG GCTATACCAT GTACGGTCCC ATGCCCGACA GTACGCATAT CGACGCGTCG GGTGGCTGGC ACGATGCCTC AGATTACCTG CAATACGTAA CCACCTCAGC CAATGCCACC TATCACCTGC TGGCCGCCCT GCGCGACTTC CCCACTGCTT TCGGCGATCA GCACCAGTTA AATGGCCTGG CGGGTGCCAA TGGCTTGCCC GATGTGCTGG ACGAAGCCCG TTGGGGACTC GACTGGCTCC TGAAAATGCA CCCCACCGAT ACCTGGCTCT TCAACCAGCT CGGCGACGAC CGCGACCATT CGAGTATGAG AATTCCAAAA CTCGACAGCA TGTACGGCAA AGGGGCTGAA CGGCCGGTTT ATTTCGCAAC TGGCCAGCCA CAGGGGCTGT TCAAATACAA AAACCGCGCA ACGGGCGTGG CCTCTACGGC TGGTAAAGTG AGCAGCGCAC TGGCACTGGG CTACCAACTA ACCCGTAAAA AAGACCCTGC CTATGCCGGT AAACTCTGGC AACGGGCGCA GTCGGCGTAT AAACTTGGGC TCGAAAAGCC GGGCAATTGC CAGACCGCGC CGGGTCGGGC TCCTTATTTT TATGAGGAAG ACAACTACGC CGACGATATG GAACTGGCTA CCGTTGAACA ACTAAATGCA ACGGGAGCTA CCGGAACACA AGCAGCCAGA TTACAAACTA CCGCACTGGA TTTTGCCCGG ATGGAGCCTG TTACGCCCTG GATTCTGAAC GATACGGCCC GACATTATCA ATGGTATCCG TTTGTCAACG TCGGTCATGC GGAGCTGGCC AAGCAGTTAC CGGCCCGCGA TCGGAAAATC GTTACTGATT ACTACAAGTC GGGCATAGAA ATTATCTGGA AACGGGCGAG CCAGAACGCG TTTTACCGGG GTGTGCCGTT TACCTGGTGT AGCAATAACC TCACAACTTC TTTCGCCACG CAGTGCTTCT GGTATCGGCA GCTCACCGGC GACACGCAGT TTGCCGCGCT GGAACAGGCC AATTTCGACT GGCTGTTTGG CTGTAATCCG TGGGGCACAA GTTTCGTTTA CGGCCTTCCG GCCAACGCCG ACACCCCATC GGACCCGCAT TCGTCCTTCA CCCATTTGAA AAATTACCCC ATCGACGGTG GACTGGTGGA TGGTCCTGTT CGAGGCAGCA TTTACAGCAG GCTGATCGGC ATTACCCTGC ACGAACCCGA CGAGTACGCA CCGTTCCAAA GCAACGTAGC CGTATACCAC GACGATTATG GCGATTACAG CACGAACGAG CCAACCATGG ATGGCACGGC CTCGCTGGTT TACCTGCTGG CCGCCAAACA TGCCGAGAGC TATAAACCAG CCAAGGCAAA TACCTCCCGG AAACCTGAAA AACCGGCTAA GACACTAAAA CCATCGACGA AAACCGACCT CAAATCAACC TATTTCAAGG GAGCCAAAAT AAGGGGTGAT ACCAGCGCCC GCCGATTGGC GCTGGTGTTT ACCGGCGATG AGTTTGCCGA TGGCGGCTCC ACCATAGCCC GAACCCTGCA AAAGCACCAG GTTCGGGCTT CGTTTTTTCT GACCGGTCGG TTTCTGCGCA ACCCCGCTTT TACCGCGCTC ACAAAACAAC TTGCGCGGGA AGGCCACTAC CTCGGCCCCC ACTCCGACCA GCATTTACTT TACTGCGACT GGACAAAGCG CGATAGCCTG CTCTTAACCC GGCAGCAGTT TGTGGATGAT TTACGGGCTA ATTACGCGGC TCTTTCGGGG GTATTGGGCG GGATGAATCA AACGCCATAT CTTGAAGATA TGGCGTTTGA TTCATCCCGC CCAATACCAA AAACCAGTAA ACTATTTCTA CCCCCTTACG AGTGGTATAA CGACAGCATT TCCGTCTGGG CCAACGCAGA AGGCGTTCAA CTCATCAACT ACACGCCCGG TACACTAAGC CACGCCGACT ACACCACCCC TCAGGATAAA AACTACCGCA ACAGTGCCAC CATACTGCAA TCCATACAAA CCTACGAGCA GAAAAAACCG GCTGGCCTGA ACGGCTTTAT CTTACTGATG CACATTGGCG TAGCCCCAAA TCGAACGGAT AAACTGTACG ATCATCTGGA TGAATTAATC GCTGAACTTC GGCAAAAAGG GTATGCCTTT GTGCGAGTCG ATGCCTTGTA A
|
Protein sequence | MTTLFGMMPN INVSMITSNQ IPKIGKRINF MTPSKIMRSA LFLSILLSLI SVNIPNEPSA VIRINLLGYR PDSPKVAVWG SLTDGQINTF ELVDEQTNGV QQQLPAGRAF GGYGPFKQSY RLDFSAVQKP GRYYLRTADG ARSPGFRIGD DVYAGAADFC LRYMRQQRSG FNPFLKDSCH THDGYTMYGP MPDSTHIDAS GGWHDASDYL QYVTTSANAT YHLLAALRDF PTAFGDQHQL NGLAGANGLP DVLDEARWGL DWLLKMHPTD TWLFNQLGDD RDHSSMRIPK LDSMYGKGAE RPVYFATGQP QGLFKYKNRA TGVASTAGKV SSALALGYQL TRKKDPAYAG KLWQRAQSAY KLGLEKPGNC QTAPGRAPYF YEEDNYADDM ELATVEQLNA TGATGTQAAR LQTTALDFAR MEPVTPWILN DTARHYQWYP FVNVGHAELA KQLPARDRKI VTDYYKSGIE IIWKRASQNA FYRGVPFTWC SNNLTTSFAT QCFWYRQLTG DTQFAALEQA NFDWLFGCNP WGTSFVYGLP ANADTPSDPH SSFTHLKNYP IDGGLVDGPV RGSIYSRLIG ITLHEPDEYA PFQSNVAVYH DDYGDYSTNE PTMDGTASLV YLLAAKHAES YKPAKANTSR KPEKPAKTLK PSTKTDLKST YFKGAKIRGD TSARRLALVF TGDEFADGGS TIARTLQKHQ VRASFFLTGR FLRNPAFTAL TKQLAREGHY LGPHSDQHLL YCDWTKRDSL LLTRQQFVDD LRANYAALSG VLGGMNQTPY LEDMAFDSSR PIPKTSKLFL PPYEWYNDSI SVWANAEGVQ LINYTPGTLS HADYTTPQDK NYRNSATILQ SIQTYEQKKP AGLNGFILLM HIGVAPNRTD KLYDHLDELI AELRQKGYAF VRVDAL
|
| |