Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5096 |
Symbol | |
ID | 8728862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 6231575 |
End bp | 6234544 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003389870 |
Protein GI | 284039940 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATACT TTTCCGTAGC CATTGGTGCG GCATTTATTT TATTGCTTCC CCTTTTTGTT CTAGCTCAGG TTCAGGTGTC TTTTCCGACG ACACGGGCGG TTTTGCAACG AAACAATTCA AATCAGGCAA CTATTCGTAT AACAGGGTAT TATACCGCAA CAGTCGGGCG TGTTGAAGCC CGTTTACAGG CAAGGGATGG TATAGGCTCC TCAACCGATT GGGTAACACT TCAAAATAAT CCATCGGGAG GGGTATTCAG CGGTGATATA ACGGGCTCAG GTGGCTGGTA CAATCTCGAA GTACGGGGTA TGAACGGCGA CCAGCAAGTG GGTAATTCAA CAACTGTAGA GCGCGTTGGC ATTGGCGAAG TATTTGTCGT AGCCGGACAG TCTAATGCAC AGGGTATCCA TCAGGATGCA CCCAATCCAC TGAATGACCT GGTAAACTGC GTTAACTACC GTTACCCAGA CCAAGGCTTT CCGAACGAAC CACCCACCCC CGTATTCACC CAACTCGATA ATTCATCGGG TTTTACAATA GCGCCCAGAG GAATGGGTAG CTGGGCATGG GGGCAGTTGG GCGATATTCT GGCGAAAAGG TTACGTGTTC CTATTCTATT TTTTAACGCA GCTTTTACAG GAACGTTTGT ACGTAACTGG CGTGATAGTG CGCCCGAAGG TGGAGTGGCT TATGGCCCTG GCGGGGCCTA CCCTGCCCGT CAGCCGTACA TTAACTTAAA ACTTGCCCTT CAGTTTTATG CCAATTCCCT CGGCGTGCGC GCTGTACTTT GGCAGCAGGG CGAATCCGAT AACCTTTACA ATACGTCAAA AGATCAGTAT GTCAACGATC TTCAGTACGT TATCAATCAG TCTCGGCAGG AATACAACAG TAACACGTCG TGGGTAGTAG CCCGTGTGAG TTACGGCGAC TTTACCGGTG GAGTTGACCC AGCTATCATT GACGCTCAGA ATCAGGTTAT CAGCACAACC GCCAATGTAT TTGCCGGGCC TAATACAGAT GTGATTCAAA TACCACGCCA ACGGCCGCCA CGCAATGATC CGGAGGGTGT TCACTTCGAT TATAATGGCC TTGTCGATTT GGCAAACGCC TGGAACGCGA GTTTAAACGA TTCTTTTTTT CAGCGTTCTA CACCTATATC ACCAGTTGCC TCGCCAACAA TCTCTATCGC TTGTGCTTCT AACAACAACC TTTCTCTTAC CGTAAATGGT AACTACGCCA GTGTGCAATG GGAGTCGGGA GAATCAGGTA ATAGTATTAC GAAGGGAGCA GGTGTATATC GTGCTAAGGT CAAAGATTCA CGGGGAAATA CGCTTTTCAC CAATCAGGTG CGGGTATCTG ATGCACCAAT TGCGGCAACG AGCGATAATA GGCCGCCCTC TGTTTGTATA GGCAGTAGCC TGGCACTAAC AACGAATTAT GACAATGTAA CCTGGCTAAA CCAGCAGAAT AACACAACGG TAGCAACGTC ACGCAACTTC TCGACTGTTT CGGCCGGTGC CTACTACGTT CGCTATCGGG ATGTGAGCGG CTGTGAGTTT ACATCGAATG TATTGAACGT AACGGTAAAC CCATTACCCG ACACGCCTAC AATTACCAAC GACAAACCAA CGGTATTCTG CCAAGGGGAT AACACGACGC TTCGTGCCAA TGTTGATAAC ATCCAGTACA ACTGGAGCGA TGGCCAAAAG AATAAGGTGG TGAACGTTGG CAACTCGGGT TCTTACTTCC TGACGGTAAC GGATCGGAAT GGTTGTACAT CAGCGCAGTC AAACACAATT GCAGTTACTG CGAACCCCGT ACCCGCTAAA CCTACCATCG CCACGAACGG CCCAACCACC TTCTGCGCAG ACAGAACCAT TACCCTAACT GCTCCCCAAA ATGTCGCTTA TCAGTGGACA AGCGGTCAAA CAACCCAGAG CATCACCCTT AGCCAGTCCG GTAATTTTGC GGTTAAGACC AGTAACCAGT TCGGATGCAC ATCTGAGCAG TCAGATGTGT TGACGATTCA GGTCAATCCT CTCCCACAAA CCCCATCTAT TACAGCTGGC GGTGCAACAA CGTTTTGCGA AGGCAATCGT GTTACGTTAA GTGCAAGCAG CAACAACACG ATTGTATGGT CCAGTGGCCA GCGCAGCAAC AGCATTACTG TTAGTACGTC TGGCAATTTT ACCGTTCAGG CACTTGACCA AAATGGCTGT TTATCGCCCT TTTCACCGGT TATAGCCGTG AAAGTAAACC CTCTGCCTGC AACGCCAACC ATACTTGCGG CTCCTTCTCC TATCATTTGT GAAGGAGATA GAGCCACCTT ACGGGTTGAC GGTCCATATA CTGTTTTTTG GAGCACCGGC GATTCTACCC AGCGCATTAT GACCGGTTCA GCGGGCAATT ACTCCGCCAA AATCCGGGAT GTTAATGGCT GTGTTTCTGC TCAGGCAGGA GCCATAACGG TTGAATTAAG ACCACTTCCC CCTTCTCCTA CCATTAATGT CATTGGTACC TACACCCTTC AGGCGATAAG CTCAACGAAT GGCACCGTAT TCCGCTGGCG GGTGGGTACT GATTCGCTAG CGGCACAAAC GGCCATTATT AAAGCAAATC AATCTGGTTC CTATACGGCG CGCGCGTCAA TCGTCTACTC ACAAGCACTA ACCTGCTTCT CGTTACCATC GGCTCCATTC GCTTTTACGG TCGATGTAAG CAATAAGGGA TTAAGTGTTT ACCCGAATCC TAATCCGGCT AAAATTATCA CAATAGAAAC ACTGGCTAAC CTGACAAATG CCGTTATCAC CATTTATACC ATCAATGGTC AGATAGTCTT CACTACACCG GTTCCCTCCC TGGATGAGCG AAAACAATTG GTTTTAACCA GTTTGACCTC AGGCTCTTAC ATTTTACGTG TACAATCGGC TGATTTTGAC GTTTCAAAGC GAATTATACT CGGATTGTAA
|
Protein sequence | MRYFSVAIGA AFILLLPLFV LAQVQVSFPT TRAVLQRNNS NQATIRITGY YTATVGRVEA RLQARDGIGS STDWVTLQNN PSGGVFSGDI TGSGGWYNLE VRGMNGDQQV GNSTTVERVG IGEVFVVAGQ SNAQGIHQDA PNPLNDLVNC VNYRYPDQGF PNEPPTPVFT QLDNSSGFTI APRGMGSWAW GQLGDILAKR LRVPILFFNA AFTGTFVRNW RDSAPEGGVA YGPGGAYPAR QPYINLKLAL QFYANSLGVR AVLWQQGESD NLYNTSKDQY VNDLQYVINQ SRQEYNSNTS WVVARVSYGD FTGGVDPAII DAQNQVISTT ANVFAGPNTD VIQIPRQRPP RNDPEGVHFD YNGLVDLANA WNASLNDSFF QRSTPISPVA SPTISIACAS NNNLSLTVNG NYASVQWESG ESGNSITKGA GVYRAKVKDS RGNTLFTNQV RVSDAPIAAT SDNRPPSVCI GSSLALTTNY DNVTWLNQQN NTTVATSRNF STVSAGAYYV RYRDVSGCEF TSNVLNVTVN PLPDTPTITN DKPTVFCQGD NTTLRANVDN IQYNWSDGQK NKVVNVGNSG SYFLTVTDRN GCTSAQSNTI AVTANPVPAK PTIATNGPTT FCADRTITLT APQNVAYQWT SGQTTQSITL SQSGNFAVKT SNQFGCTSEQ SDVLTIQVNP LPQTPSITAG GATTFCEGNR VTLSASSNNT IVWSSGQRSN SITVSTSGNF TVQALDQNGC LSPFSPVIAV KVNPLPATPT ILAAPSPIIC EGDRATLRVD GPYTVFWSTG DSTQRIMTGS AGNYSAKIRD VNGCVSAQAG AITVELRPLP PSPTINVIGT YTLQAISSTN GTVFRWRVGT DSLAAQTAII KANQSGSYTA RASIVYSQAL TCFSLPSAPF AFTVDVSNKG LSVYPNPNPA KIITIETLAN LTNAVITIYT INGQIVFTTP VPSLDERKQL VLTSLTSGSY ILRVQSADFD VSKRIILGL
|
| |