Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6398 |
Symbol | |
ID | 8730182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 7751979 |
End bp | 7754306 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | capsular exopolysaccharide family |
Protein accession | YP_003391155 |
Protein GI | 284041225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA ATGCTTACTC ATACATGCCC TACCAGGTCT ATGCTCCTGA GAACCAGGTC AACCTTAGGG TAATATTACT TCGTTACCTG AAGCACTGGA AGTGGTTTGC TTTATCACTG ATATTGGCTG GCGCAGCTGC TTATGTATAT CTGCTTTATC AAACACCAAT CTATAAGGTT CAGGCAAGTT TGTTAATCAA AGATGACAAG AAAGGACTGG ATGGTGAGAA CATCCTGAAA GAAATGGATA TTCTTAAACC GAAGAAAACG GTTGAGAACG AAATTGAAAT TGTGCAGTCC TACACGTTAA TGGACAAAGT GGTTGATCAT CTTAACCTCA ACGTTCAATA TTTTAAGCCA ACGTCTACTA TCGATAAGGA GGTTTATGGT AATTTACCTA TTCGGCTGGT TGTAGAACGG CCTGCTTCAG CCTTATACGA CGATAAGCTT GATCTAAGCT TCGTTGATGC GCAGCACATC AAGCTGAACG ATCGGACGTA CCCCGTCAAC CAATCGGTCA ATACCCCATA TGGACGGCTT CGCTTTTTTA CCCGTCAGCC TTTAACTGCC TCGTTTGAAC CGATGAAAGT TAAAGTGTCG CCCCGTACAG AAACGGTCAA TAACTTACTG AAAAACTTAA CCGTTGAAAC AACTAGTAAG GCATCGACAG TATTATTGAT AACTCTCGAT GAGGCTGTAC CCGAAAAAGG AGAAGCGCTG CTCAAGCAGC TAATCGAGGA ATACAACCAG GCGGCCGTTG TCGATAAAAA CCTGGTTGCG GCCAGCACGC TCGATTTTAT CGAAGATCGT CTTCGGTTGA TCTCCGGTGA ACTGACAACG GTAGAGAAAG ATGTAGAATC GTATAAAACA AGCCAGGGTA TAACAGATCT GAGTACACAG GCGCAATCGT TTCTGCAAAC GGTACAGGCC AACGATGCTC AACTTAATCA GGTAAATATT CAACTGGGTT CACTGAATGA CATTGAACGA TACGTGACCA GTAAAGGCGC TACACGGGGT GCGGCTCCTG CAACGCTTAG CCTGAGCGAT CCGGTGCTGT TAGGGTTGGT AACGAAAGTG TCTGAGCTCG AGTCGCAGCA TGATCTGTTG GCCCGCACGA CGTCGGACCG AAACCCGCTG CTGCAATCGC TGGATAGCCA GATTAAAGCC ACAAAAGAGA GCATAAGTGA GAATATTCAG ACGATAAAGA CTCAATTGAT CAGCACTCGC AATCAGCTTA CATCGACCAA TAAGCGGCTC GAAGGTATGG TGCGCACTGT TCCCCACAAA GAGCGGGCGT TGCTGAACAT TACCCGTCAG CAGGCGATCA AAAACAACTT GTATACGTAT TTGCTGCAAA AACGCGAAGA GACGGCTTTG TCCTATGCGT CTACGTTGCC TGATAGCCGC ATTGTTGATA TGCCCCGGCA TGATGAGAAA CCGATCAAGC CGGTTCGCGG CATGACGTTT GCTCTATTTG GCCTCTTTGG TTTATTGTTT CCTATTGGCG TAATGGCTAC CCGGGATGCG CTGAATAACC GCGTTCGTCG TCGTTCTGAT GTTGAAGAAG CTTCGCAAGT CCCCATCCTG GGTGAAGTGG TGAAGTCAGA TGGTTCAAAG GCGCTGGTCG TCGTATCGAA TAGTCGTTCG GTCATAGCGG AACAAATACG GGCACTTCGA ACAAACCTGC AATTTTTGCG GAGTAGTCAA ACGGGTTGCC AGGTTGTCCT GTTTACATCG AGTATCAGTG GCGAAGGAAA GTCATTTATG TCGCTTAACC TGGGAGCAAG TTTAGCACTG GTCGATCGTC CTACGGTTAT TTTGGAAATG GACCTTCGCA AACCTAAGCT TCATAGCTCA TTGGGTATGC GTAATCCAGT TGGGATTAGT AATTACCTGA TTGGCGAGGC CACCCTGGAT GAGGTACTAC AGCCTATTGA AGGATTTCCC AACTACTTCC TGATAAGCAG TGGTCCTCTG CCCCCAAATC CATCTGAATT ACTGAATGGC CCTCATCTGG CTCGTTTGTT CACCGAGTTA CGCCAACGGT TCGATTATGT TATTGTCGAT TCGCCACCCA TCGGTCTGGT AACGGATGCG CAGGTGATCG CTCCCCTGGC AGACGCGACC CTCTACATGG TACGTCATGA CATTACGCCG AAAACCTACC TCAAAATGGT CGATACGCTC TATAAAGAGC ATCGGTTCCA GAACCTGAAT GTCATCCTGA ATGCTGTTGA TGACGGTGAA TCCTATTACT ACAGTTATAG CTACGGCGGT TACTACCAGG AAGACAAGCC ACAACGCCCT AAGCTCAAAG CTGAATAG
|
Protein sequence | MSENAYSYMP YQVYAPENQV NLRVILLRYL KHWKWFALSL ILAGAAAYVY LLYQTPIYKV QASLLIKDDK KGLDGENILK EMDILKPKKT VENEIEIVQS YTLMDKVVDH LNLNVQYFKP TSTIDKEVYG NLPIRLVVER PASALYDDKL DLSFVDAQHI KLNDRTYPVN QSVNTPYGRL RFFTRQPLTA SFEPMKVKVS PRTETVNNLL KNLTVETTSK ASTVLLITLD EAVPEKGEAL LKQLIEEYNQ AAVVDKNLVA ASTLDFIEDR LRLISGELTT VEKDVESYKT SQGITDLSTQ AQSFLQTVQA NDAQLNQVNI QLGSLNDIER YVTSKGATRG AAPATLSLSD PVLLGLVTKV SELESQHDLL ARTTSDRNPL LQSLDSQIKA TKESISENIQ TIKTQLISTR NQLTSTNKRL EGMVRTVPHK ERALLNITRQ QAIKNNLYTY LLQKREETAL SYASTLPDSR IVDMPRHDEK PIKPVRGMTF ALFGLFGLLF PIGVMATRDA LNNRVRRRSD VEEASQVPIL GEVVKSDGSK ALVVVSNSRS VIAEQIRALR TNLQFLRSSQ TGCQVVLFTS SISGEGKSFM SLNLGASLAL VDRPTVILEM DLRKPKLHSS LGMRNPVGIS NYLIGEATLD EVLQPIEGFP NYFLISSGPL PPNPSELLNG PHLARLFTEL RQRFDYVIVD SPPIGLVTDA QVIAPLADAT LYMVRHDITP KTYLKMVDTL YKEHRFQNLN VILNAVDDGE SYYYSYSYGG YYQEDKPQRP KLKAE
|
| |