Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4180 |
Symbol | |
ID | 8727939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5032765 |
End bp | 5035737 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | conserved repeat domain protein |
Protein accession | YP_003388965 |
Protein GI | 284039035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.245505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTTC CTTTCCGATG GATTGGGGTT ATTGTCTTAC AACTTTTCTC TGGTCTGGCG TTCTCACAAC TCACCATCAC CAGCCCTGTA CCAAGAATGG TCTTCCAGCG CAATTTAGCA AATGAAGCCA GTGTTAGTAT AACCGGAATT GCCTCTTCTT CAGCAACAAC CATAGAAGCT CGTTTTGTCC CCATGGCCGT TGGTCAGGGA AATGTTACGG ACTGGAAGCC GTTGAAATTT CTACCTCAGT CGACGGCCTT TCACGGGCAG GTGACCGTAT CGGCAGGCTG GTACCGACTG GATGTGCGAT CGCGGTCAGG GACAACTATC ACCGCCCAAA CCCAGGTCAA CCGCGTCGGT GTGGGCGAGG TGTTTATTAT TGCCGGGCAG TCGAATGCCG AAGGTGGGTT TCAACGGCCG CCCAGTTCGG TAGATGACCG GGTCATGTGC GTCGACTTCC GACAGGATAG CCTCAGTGAG CAGTTGATCC CCTTACAATT TAGCCACATC AGTTACGGCA CCAGCATTGG TCCCAGCCAG CCTCCCCATA TTTACAGCAT TTTAGGCGAT AAGCTCGCCC AGCGGCTTAA TGTTCCCATT CTGTTTTTAG GCGCGGCTCT AGGCGGGACC AGCAGCGCCG ACTGGCAGCA ATCCGCAGCC GGCAATATGG GCACCGGTCG TAATTCAGCC GTTTATCGGC GAATGGGGGC GGTTTTACTT CATTATGTCA CCCGAACGGG GGCCCGGGCT GTTCTCTGGC ATCAGGGAGA AAGTGATCTT CATTCGTCAA CACAGACGTA TTTCGACAAC ATCAAGTATG TCATTGAAAA AAGTCGGCAG CAGCTTGGCG GGAAGCCTCT GGCATGGGCA GTTTCACGGG CAAGTTATAT CTTCGGGCAA ACCAGTTCAT CTGTTATCGC GGCACAGAAC CAGCTCATCA ACAGTGTCTT TAACGTTTTT GCCGGACCCG CTACCGATGG TATTACCGGC CCCGACAACC GGTTCGACGA TCTGCATTTT GGCGGCAATG GGCTCTATCG GTTTGCCAGT GCCTGGGATG AGAGCCTGAC AGCCTCCTTT TTTCAAAATG CACTGCCCGT TATGCCAATC GACTCGGTGT CGCTGATCAC GAGCGGTTAC ACTATACCGC TTACCCGACG ACCGGGTGAG ACGGTTGCGG TCGCTTCTGT CCGCAGTGAT GCGCATGAAT CGGACAATCA GTATGTTGCT CAGATTATCC GGGCCAGTGA TGGGGCGCTG ATGGCCGAAT CCAGTCCAAC TACTGACAAC CCGATCTCGA TGGTGTTACC ATTTGCACTG GCAAACGGGC AGTATCGTCT GCGTACCCGC TCTACTCATC CCGTTGTGCT GGGCACTCTG GGCGAACCAT TCAATGTACA GCAGGATGCC ACCCCCCAGA CTCAACCCCC TATACAGCGA CTACCTGTTA GCGGAGGAAC AGCCGATACA ACCATCAGAC GTTTTGCGTA TCGTTTTGAA ACAGGCTCCC ATTCGTTTTA CGGGCTCATT CAGGCTACTT CGCCGGTAGA AGTTCGACTA CAGAGTCTCG ACGGCAGCGG TTTCAATGAT TCTGACTGGC ACCTCGCTCC TCCCAGTTCT CAGGCACCTG ATTATGATCA GTTCGCTGAC TTCAACTATA TCCGTAATTA CCCCCCTATT GCCGGGGGAG TTGGTGGCGT GATTCCGGGG AGATACCGTT TTTCGATCCG TCGGCAGGGC AACACCGGGC CGGGCTTATG GTATGAAATG ACGCTACTCA ACGGCCGAAA TATCCTATAC TATCCCATGG AGCCCATTGG TACTGTTCCA CCAGTACTTA CCATCACCAA CTCGGTAACG CCATGTCTGG TTGGCTCCTT TGCCGTAGCC GTCGACGTCG CCGAAAGTGC CCCGCAGGCA GGCAATGTTT TCAGTGTCAA ACTGTCGGAT GCCAACGGCT CGTTTACCAA TGAAACAACC ATTGGCACCG GTACAACCAG TCCTATTGCT GTCACCCTGT CGCCAACGCT TCCTGTGGGG TCTAATTACC GTATTCGGGT TATTGCCAGC AATCCGGCGG TAGCCAGCGC CCCCAGCCAG CCGTTTTCCA TTTGTGCCGG GGCCGATCTG TCCATGCAAA TGGCCATCAG TAATCGGGCA TCGTTAACCA GTCAGCCCGT TACGTTAACA GTTGTACTAA CCAACGCAGG GCCAATGGAC GCGACAAATG TAAAAGCCAG CAGCATACTT CCCGACGGGA TGAGTTTTGT CGATGCGGCA TCAGGAGCCG TTAGTACAGC CGCAAACACC GTTTCCATCA ATGCGGGTAA TTTGCTCAAT GGAGCCAGCA AATCGTTTGC GTTTCGGGTT AAACCTACTA AAAACGGCAC CTTTTTTACG TCTGCCCAAA TTACGGCCAG TGATCAGTTC GACCCCGACA GCCAGCCCAA CTCAGGCACT GGCGACGGGC AGGATGACGA AGGCAGCGTC GATTTGCGTA CACCCGACTC TGGTACGTTC GTCAGCATAT CCCCCAATCC AGGCCAGGTA CCACTACCCC CCGTTCAGTC CAGCCAACCG CCAGTAGACA ATACCAAAGC CGATCTGAGT TTGGCTATCG CCACCAACTC GCTGGTAGTT TCCGCCAATC AGGTCGTAAA CATTCCTTTA ACCGTAAGCA ATTTGGGCGG GGCAAACGCG ACAAATGTAT CCGTACAGGC GCTTTTACCC ACGGGCTGGC AATTAACGAC AACAGCGGGG CTGACGGTCA GCGGGCAAAC TGTAAGCGGT ACCATTGGGT CCGTTGCAGC CGGTAGCACA GGCACACTGG TACTTGTGGT GAAGATAACC CAGGCGGGTA CGCTGCAAGC CCAGATAGCT GGCGCATCGC CTTCTGACCC CGACTCCACA CCGGGCAATG GCTACACGAA AGGCGAAGAT GATGAGGCCA GATTAAGTCT GCGGATCAAG TAA
|
Protein sequence | MPFPFRWIGV IVLQLFSGLA FSQLTITSPV PRMVFQRNLA NEASVSITGI ASSSATTIEA RFVPMAVGQG NVTDWKPLKF LPQSTAFHGQ VTVSAGWYRL DVRSRSGTTI TAQTQVNRVG VGEVFIIAGQ SNAEGGFQRP PSSVDDRVMC VDFRQDSLSE QLIPLQFSHI SYGTSIGPSQ PPHIYSILGD KLAQRLNVPI LFLGAALGGT SSADWQQSAA GNMGTGRNSA VYRRMGAVLL HYVTRTGARA VLWHQGESDL HSSTQTYFDN IKYVIEKSRQ QLGGKPLAWA VSRASYIFGQ TSSSVIAAQN QLINSVFNVF AGPATDGITG PDNRFDDLHF GGNGLYRFAS AWDESLTASF FQNALPVMPI DSVSLITSGY TIPLTRRPGE TVAVASVRSD AHESDNQYVA QIIRASDGAL MAESSPTTDN PISMVLPFAL ANGQYRLRTR STHPVVLGTL GEPFNVQQDA TPQTQPPIQR LPVSGGTADT TIRRFAYRFE TGSHSFYGLI QATSPVEVRL QSLDGSGFND SDWHLAPPSS QAPDYDQFAD FNYIRNYPPI AGGVGGVIPG RYRFSIRRQG NTGPGLWYEM TLLNGRNILY YPMEPIGTVP PVLTITNSVT PCLVGSFAVA VDVAESAPQA GNVFSVKLSD ANGSFTNETT IGTGTTSPIA VTLSPTLPVG SNYRIRVIAS NPAVASAPSQ PFSICAGADL SMQMAISNRA SLTSQPVTLT VVLTNAGPMD ATNVKASSIL PDGMSFVDAA SGAVSTAANT VSINAGNLLN GASKSFAFRV KPTKNGTFFT SAQITASDQF DPDSQPNSGT GDGQDDEGSV DLRTPDSGTF VSISPNPGQV PLPPVQSSQP PVDNTKADLS LAIATNSLVV SANQVVNIPL TVSNLGGANA TNVSVQALLP TGWQLTTTAG LTVSGQTVSG TIGSVAAGST GTLVLVVKIT QAGTLQAQIA GASPSDPDST PGNGYTKGED DEARLSLRIK
|
| |