Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6264 |
Symbol | |
ID | 8730048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 7597615 |
End bp | 7600665 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003391022 |
Protein GI | 284041092 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.724284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCCC ACACCAACGA ACAAGCACTC GAAGCGGCCA TCGAGAAAAA ACTAACCGGT ACTACCCGCG AAGAACTCAG AACACAAGGT ATTACCAGCG CCTGGGCCGA AGCCGCCAGC CAATACCGGT CGGGTAACGG CTACTGGATG GGTGAAACCA CTGATTTCAA CGCCGAATAC GCCATCGACA CCCGGAGGTT CTGGCACTTT CTGGAAACCA CACAACCCCT CGAACTGGCC AAGCTCCAGA CCCAGCATCA ATGGGAGTTG CTGATCCTGC AAAAGGTGGA TCGGCTCATT AAGAAATATG GCGTACTGCA TCTCTTCCGC AAAGGGCTGG ATATTGACGA TGCTCATTTC ACCATGCTGT ACGTGCCGCC TTTGGCATCG AGCAGCCAGA GCGTTAAAGA CGCCTTCGAG CGAAATGAGT TTAGTGTAAC CCGCCAGTTG CGCTATTCGA CCACTAACCC CCGCGAAGAA GTGGATATGG TGGTGTTCAT CAACGGGCTA CCTGTAGCTA CGATGGAGCT AAAGAACCCC TGGACCGGCC AAAATGCACG CGTACATGGT ATCAAACAAT ACAAATGGGA TCGGGACGCC ACGCAACCCC TGTTGCAGTT TGGTCGGTGT GTGGTCCATT TAGCCGTTGA CCCCGATGAG ATTTTTATGA CCACTCGCTT GAGTGGCAAG GACACGTTCT TTCTGCCCTT CAACAAAGGC GACCTGAACC ACGGAGCCGG AAACCCGCCC AATTCGTTCG GCCATAAAAC GGCTTACCTC TGGGAAGACA TCCTTACCCG GCAGAGCCTG AGCACCATCA TTCAGCATTT CGTTACGCTT GATGGAAAAT CCACTGAACC GCTAGCCAAA CGAACGCTCA TCTTTCCCCG CTATCACCAG CTGGAGGTCG TACGCCGGTT GTTAAAAAAC GTGAGCCAGA ACGGTGTGGG GCAAACCTAC CTGATTCAGC ACTCCGCCGG TTCAGGCAAG TCGAACTCCA TAACCTGGGC CGCGTTTCAG TTAATCGAAG CCTACCCCGA ATCGGCTACC ACGCCGGGCA ACCGGGGCAT CACCCTCCCT CTGTTCGATT CGGTTATTGT CGTCACCGAC CGCCGATTGC TCGACAAGCA GATCCGCGAG AACATCAAAG AGTTTTCGCA GGTCAAAAAC ATAGTGGCTC CGGCATTCAG CTCGCAGGAA TTACGACAGG CGCTGGAGGG CGGCAAGAAG ATCATCATTA CCACCATCCA GAAATTCCCG TACATCATCG AGGGTATTTC CGACCTAAGC GACAAAAAGT TTGCGGTCAT TATCGACGAA GCCCACAGTT CCCAAAGCGG CACTGCTCAT GATAGCATGA ACAACGCGAT GGGTAAGACC GAAGCCAAAG ACGACGATCA GCAGCCCGAC GCGCAGGACC TGATTTTAGC CGCCATGCAA GCCCGGAAGA TGCGCGGCAA TGCGTCGTAC TTTGCCTTTA CAGCTACGCC CAAACCGAAT ACGCTGGAGA AGTTCGGTAC CAGGAAAGAC GATGGAACCT TTACCCCCTT CCACTTATAC TCCATGAAAC AGGCCATTGA AGAAGGCTTT ATTCTGGACG TGCTGGCCAA TTATACAACC TATCGGAGTT ATTACGAGAT TGAGAAATCC ATTGAAGAAA ACCCCCTTTT CGATACCCAG AAGGCTCAGA AAAAGCTAAA GATGTACGTG GAGCAGCACC GGCAAACTAT CGGTACCAAA GCCGAAATCA TGGTGGAGCA CTTTACTACG CAGGTAGTCA ATACCAAAAA GCTCAAGGGC AAAGCCAAAG GCATGGTGGT TACCCAAAGT ATTGAATCAG CTATTCGGTA CTTCTTCGCC ATTCGCGCCA TACTGGCCGA CAAGAATGCT TCCTTCAAGG CTGTTATTGC CTTTTCGGGC AAAAAAACCG TCGACGGAAT CGAACATACC GAAGCCAGCC TGAATGGTTT TGCCGAGAAC GAAACCAGTG AGAAGTTCGA CAGCGATGAG TATCGCATTT TGGTGGTGGC CAACAAATAC CTGACGGGTT TCGATCAGCC CAAACTATCG GCCATGTATG TCGATAAAAA GCTACAGGGC GTACTGGCTG TGCAGGCTCT CTCGCGTTTA AATCGATCCG CGAACAAGCT GGGCAAGAAA ACGGAGGATT TGTTCATCCT GGACTTCTTC AACTCGGTCG ATGACATAAA AGAAGCTTTT GACCCGTTCT ACACAGCCAC CTCGCTGAGC CGCGCCACCG ATGTCAATGT ACTTCACGAG TTGAAAGCGC AACTTGACGA TGTAGGCGTT TATGAATGGG CCGAAGTGGA AGACTTCGTG GCCAAATACT TCAGTAACGT TGACGCCCAA CTATTGAGCC CTATTATCGA CGTAGCCGCC GAACGGTTCA GCAACGAGCT AGACCTGGAA GACAACGACA AAGCCGACTT TAAAATCAAG GCCAAGCAGT TTGTCAAGAT TTACGGCCAG ATGGCGTCTA TCCTGCCCTT CGAGGTGGTG AATTGGGAAA AGCTATTCTG GTTTTTGAAG TTCCTGATTC CCAAACTGAT TGTCCGGGAC CCACAGGGCG ATGCGCTGGA CGAGTTGCTT CAATCCGTCG ATCTGTCAAC CTACGGCCTG GAACGCACAC GGCTGGGCCA TACCATCACC TTGGACGATA CCGAAACCGA AGTAGACCCG CAAAACCCAA ATCCACGTGG AGCCCACGGC GCAGATGAAG ACCAGGACCC TCTGGAGGAG ATTATAAGAA GCTTCAACGA GCGGAATTTT CAGGGATGGA ATGCTACGCC CGAAGAGCAA CGCGTGAAAT TTGTGAGTGT TATCAAGTAC ATGCAGCAAC AGCCCACCTT TGAAACGCAC GTTTTGAACA ATCCAGACGT ACAGAACCGG GAACTTGCCT TGCAAAAGTT ATTCAATGAT GCCGTCAACC AACAGCGAAA ACTGGACATT GAGTTTTACA AGCTTTACAC CCAAGATCCA GCCTTCAAAC AAGCCTGGCA AGACAGCGGC CGGCGTATAT TGGGAGTGTA A
|
Protein sequence | MPSHTNEQAL EAAIEKKLTG TTREELRTQG ITSAWAEAAS QYRSGNGYWM GETTDFNAEY AIDTRRFWHF LETTQPLELA KLQTQHQWEL LILQKVDRLI KKYGVLHLFR KGLDIDDAHF TMLYVPPLAS SSQSVKDAFE RNEFSVTRQL RYSTTNPREE VDMVVFINGL PVATMELKNP WTGQNARVHG IKQYKWDRDA TQPLLQFGRC VVHLAVDPDE IFMTTRLSGK DTFFLPFNKG DLNHGAGNPP NSFGHKTAYL WEDILTRQSL STIIQHFVTL DGKSTEPLAK RTLIFPRYHQ LEVVRRLLKN VSQNGVGQTY LIQHSAGSGK SNSITWAAFQ LIEAYPESAT TPGNRGITLP LFDSVIVVTD RRLLDKQIRE NIKEFSQVKN IVAPAFSSQE LRQALEGGKK IIITTIQKFP YIIEGISDLS DKKFAVIIDE AHSSQSGTAH DSMNNAMGKT EAKDDDQQPD AQDLILAAMQ ARKMRGNASY FAFTATPKPN TLEKFGTRKD DGTFTPFHLY SMKQAIEEGF ILDVLANYTT YRSYYEIEKS IEENPLFDTQ KAQKKLKMYV EQHRQTIGTK AEIMVEHFTT QVVNTKKLKG KAKGMVVTQS IESAIRYFFA IRAILADKNA SFKAVIAFSG KKTVDGIEHT EASLNGFAEN ETSEKFDSDE YRILVVANKY LTGFDQPKLS AMYVDKKLQG VLAVQALSRL NRSANKLGKK TEDLFILDFF NSVDDIKEAF DPFYTATSLS RATDVNVLHE LKAQLDDVGV YEWAEVEDFV AKYFSNVDAQ LLSPIIDVAA ERFSNELDLE DNDKADFKIK AKQFVKIYGQ MASILPFEVV NWEKLFWFLK FLIPKLIVRD PQGDALDELL QSVDLSTYGL ERTRLGHTIT LDDTETEVDP QNPNPRGAHG ADEDQDPLEE IIRSFNERNF QGWNATPEEQ RVKFVSVIKY MQQQPTFETH VLNNPDVQNR ELALQKLFND AVNQQRKLDI EFYKLYTQDP AFKQAWQDSG RRILGV
|
| |