Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3034 |
Symbol | |
ID | 8726786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 3675477 |
End bp | 3678866 |
Gene Length | 3390 bp |
Protein Length | 1129 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | ASPIC/UnbV domain protein |
Protein accession | YP_003387844 |
Protein GI | 284037914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.721984 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGCC AATTCCCTCA AAGTACCAAT AGAGACCTGA TTGATCGTTT ACGGTATTCT GTTTGCTTCC CGTTTCTGAT TATATGCCTG TGTTGCCCGG TAGTGTATGC GCAGAATCCG CTCTTCCAAC TCCTTGCTCC CAAGCAGACC CATATTGATT TTAAGAACGA TATTGATGAG AACGAAAGCC TGAACGTGCT TTCCTACGAA TATTTCTACA ACGGTGGGGG CGTGGCCGTT GGCGATATTA ACAATGACGG TTTACTGGAT TTGTTCTTCA CTGCTAATCT GAAAGCCAAT AAATTATACC TTAATCTGGG TAAATTAACC TTTAAAGACA TTACCAGTGA GGCCGGTGCT CAGTTAGGCG GTAGAGCGGG CGGCTGGAAA ACGGGCGTTA GTATGGCCGA TGTCAATGGC GACGGCTGGC TGGATATTTA CGTTTGCTAC TCAGGCAAAG GAGACGAAAG CAAGCGCAGA AATCAGCTGT TTATCAACCA GGGGGCTGGC CCTAAAGGCA TGGTTCGGTT TGTGGAACAG GCAAAGGAAT ACGGTGTCGA CGATAACGGC TATAATACGC AGGCTACGTT CTTCGACTAT GATCGGGATG GCGACCTCGA CCTGTTTCTA CTGCATCACA ATGTCAAGAA ATACGATAAC ATGGAACTGG CCAAACTACA CGGTGAGACC GACCTGCTAG CTGGCAATAA ACTGCTCGAA AATAGAAACG GTCACTTTGT CGATGTGTCC CAGAAAGAAG GGATCCATCA ATATCCGTTA ACGTTTGGTC TGGGAATGGC CGTAGCAGAT GTTAATAAAG ATGGGTGGCC GGACATCTAC GTGACGAACG ATTATAACGA GCCAGATTAC CTCTACATAA ATCAGAAAGG GATTGACCTA GCCGCACGGC GGTCGGGTGA GCCGGCTTTT AAAGACGAAA CCCAGTCCTA CTTCCGGCAT CTGGCGCAGT TCTCGATGGG CGTAGATATT GCCGATTATA ACAATGACGG TCTGCCCGAC ATCATGTCGC TGGATATGCT ACCGGAAGAT AACCGGCGCC AGAAACTGTT GCAGCTTCAG GAAAACTACG AGTCGTTTGA GTTGATGCAG CAGCAGAAAC TACAGCGGCA GTATATGCGG AACATGCTGC AACTCAACAA CGGCGATGGT ACATTCAGTG AAATAGCGCA AACGGCGGGC GTGTCGAATA CCGATTGGAG TTGGTCGCCT TTGCTGGCCG ATTTTGACAA CGACGGTTAT AAGGACCTGT TTATTACCAA CGGTTACCTG CGTGATTATA CGAACAAAGA CTTCCTTAAA TACTGGGGCG ACTATAAAAT CAAAAAAGCA ATTGACCGGG AGCCCGTGCA ACTGATGGAT CTGGTAAAAG CCATGCCGTC AACCAAAATT GCTAATTACA TCTTCAGCAA CAATCACGAT TTAACCTTCA CAAATAAACA GCGGGAATGG GGTTTTCAGA CCCCCTCAAT CTCGAACGGG GCCGTCTACG CCGACCTTGA CAACGACGGT GATCTGGAAC TGGTAGTCAA CAACATTAAC GAGCCGGCTT TCGTGTATCA GAACATGAGC CGGGAACAAT CTGCCAATGG GTTTCTTCAG GTAAAACTGG TGCCTTCCGG GAAAAACAGG ACCGCTATCG GCGCTAAGGT CACGCTGTTT GCTAATGGCA ATTTACAATA TCAGGAGGTA AATCCGGTGC GGGGTTATTT ATCAAGCCAA CCGCTCACGC TTCATTTCGG CGTTGGAACG GCCAGTAGGG CTGACTCCAT CAACATTATC TGGCCGGATC AATCTGTACA AAAATTAACC GGTGTTCCTG TCAACCAGCA ACTAGTTGTG CAACAGCAGG CGACTCCCGC TAGCGTTAGT CAGGCTGCTG TGCAGAAAAT GGCCTCACCC GTGTTCACGA AAGTAGACCC GGTGCTGGCC CATACGCACG AGGGGTTTCT GGAAAATGAT TTCAAACGTC AGCCGTTGAT GCTCTGGATG TATTCGCACA CGGGGCCGAT ACTGGCGAAG GGCGACATAA ACAAAGACGG TCTGGACGAT GTGTTCATCA GTGGCGACCA GAACAAGCCG GGCAGCATCT GGAAGCAGCA GACAAACGGG ACGTTCAAGC AGGTAGAAGG ACTAGTTATT GGCGACGAAT CCATATCATC CGTTTCGGCC GCTGTCTTTT TCGACGCCAA CGGCGACGGG TATGACGATC TCTACGTAGC GAAAGGAGGC TACTCGTTAT TTGAAATCAA TACAACCTCC TTTCAGGATG AGTTGTACAT GAATGACGGA AAAGGCAACC TGACATTGGC AAAAACAGCA CTGCCCAACC TCGCTGCCAG CAGCAAAGCC TGTGTTCGGC CGTACGACTA CGATCAGGAT GGCGATATCG ACTTGTTCGT GGGTGGCCGG GTTGTTCCGG GGCGCTACCC CGAAGCACCG GTTTCCTATC TGCTCCAAAA CACCGGCACA GGCCAGTTTA GGGCGGTCCC CATTCCATTC GCTAAAATAG GAATGGTGAC GGACGTAAAA TGGGCCGATA TGAATGGTGA TGGGCGGGCA GACCTGGTCC TGTGTGGTGA GTTCATGCCC ATCAAAATCT ACCTGAATAC AAAAGATGGG TTTGTCGATA AAACGAAGCA GTATTTCCCG GAGGAGGAGA AAGGCTTCTG GCTGTCGCTA ACGATAGCTG ATGTAAATGG AGATGGACAG AATGATATTC TGGCCGGAAA TCTGGGTACT AATTCACAGA TTAAGTATTC ATCTAAAGAG CCTGTCGAGC TTGTCTACGC CGATTTTGAT AACAATGGCT CTATCGATCC CTTCGTTAGT TTCTACGTTC AGGGAACATC GTATCCGTTT GTAAGCCGCG ATGAACTGAA TGATCAGATC TACGCGATGC GCAAGAAATT TGCGTTTTAC AAAGACTACG CAAATGCGAC TATTAGCACT ATCCTACCTG CCGATGACCT GGCCAACGCG CCGAAACTGA CGGCTACGGA GTGCCATTCT GTCTGTTTTC TGTCGAAAAA AGGACAATTT GAAAAACAGA TTCTGCCAAT AGAAGCCCAG TTCGCTCCGG TCACAAACAT GCTATGTGAG GATTTTGACC GGGATGGTCA ACTGGACGTA CTGCTGTTGG GAAACAAATC GGACAATCGG TTAAAACTGG GTAGCATGGA CGCGAACTAC GGCTGTTTAC TGAAAGGAGA TGGTAAGGGA GGTTTTACGT ATGTGAGCCA GCCTGCTTCG GGTTTATCCG TGATTGGTGA TGTAAAATCG GTTGTTGATA TCGGAATCAA TCAGAACAGG TGTTTACTGA TCGGAGCCTT CAATCAGCCG TTGCAGGTGT ACAAAAAGCA AAAACTATGA
|
Protein sequence | MNSQFPQSTN RDLIDRLRYS VCFPFLIICL CCPVVYAQNP LFQLLAPKQT HIDFKNDIDE NESLNVLSYE YFYNGGGVAV GDINNDGLLD LFFTANLKAN KLYLNLGKLT FKDITSEAGA QLGGRAGGWK TGVSMADVNG DGWLDIYVCY SGKGDESKRR NQLFINQGAG PKGMVRFVEQ AKEYGVDDNG YNTQATFFDY DRDGDLDLFL LHHNVKKYDN MELAKLHGET DLLAGNKLLE NRNGHFVDVS QKEGIHQYPL TFGLGMAVAD VNKDGWPDIY VTNDYNEPDY LYINQKGIDL AARRSGEPAF KDETQSYFRH LAQFSMGVDI ADYNNDGLPD IMSLDMLPED NRRQKLLQLQ ENYESFELMQ QQKLQRQYMR NMLQLNNGDG TFSEIAQTAG VSNTDWSWSP LLADFDNDGY KDLFITNGYL RDYTNKDFLK YWGDYKIKKA IDREPVQLMD LVKAMPSTKI ANYIFSNNHD LTFTNKQREW GFQTPSISNG AVYADLDNDG DLELVVNNIN EPAFVYQNMS REQSANGFLQ VKLVPSGKNR TAIGAKVTLF ANGNLQYQEV NPVRGYLSSQ PLTLHFGVGT ASRADSINII WPDQSVQKLT GVPVNQQLVV QQQATPASVS QAAVQKMASP VFTKVDPVLA HTHEGFLEND FKRQPLMLWM YSHTGPILAK GDINKDGLDD VFISGDQNKP GSIWKQQTNG TFKQVEGLVI GDESISSVSA AVFFDANGDG YDDLYVAKGG YSLFEINTTS FQDELYMNDG KGNLTLAKTA LPNLAASSKA CVRPYDYDQD GDIDLFVGGR VVPGRYPEAP VSYLLQNTGT GQFRAVPIPF AKIGMVTDVK WADMNGDGRA DLVLCGEFMP IKIYLNTKDG FVDKTKQYFP EEEKGFWLSL TIADVNGDGQ NDILAGNLGT NSQIKYSSKE PVELVYADFD NNGSIDPFVS FYVQGTSYPF VSRDELNDQI YAMRKKFAFY KDYANATIST ILPADDLANA PKLTATECHS VCFLSKKGQF EKQILPIEAQ FAPVTNMLCE DFDRDGQLDV LLLGNKSDNR LKLGSMDANY GCLLKGDGKG GFTYVSQPAS GLSVIGDVKS VVDIGINQNR CLLIGAFNQP LQVYKKQKL
|
| |