Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3901 |
Symbol | |
ID | 8727659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4679951 |
End bp | 4681060 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | proline-specific peptidase |
Protein accession | YP_003388690 |
Protein GI | 284038760 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0490645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACGGG TTGCCCGGCA GGTCACAAAT TCAAATCAAT TCAACTCTAA AATCCCCTTG AAAACCAATC GCTTCCCTTT ACTTCTGGCC GCTTTTAATA TCGTGCTCAT TAGCCTGACC GTTACAGGCT GTAACCCAAC TACAAACGGT AGCGAGGCTA ACACGCGCCA AACCTACTTT ACCCCGGCCG ACACGGGTGT GCAAACCGGG GGCGTGACCG TAATCCCGAT CAAGACACCT AAAGGCACAT TCAACGTATG GACAAAGCGG ATTGGCAACA ACCCGAAAAT TAAGGTGTTG ATTCTGCACG GGGGGCCCGG TGTCAACCAT GACCCCTACG AGTGTTTCGA GAATTTTCTG CCCAAAGAAG GCATTGAGTT CATTTATTAC GATCAGCTCG GCGCGGGCAA CAGCGACCGA CCAACTGACA AGAGCCTGTG GGTGTTACCC CGTTTTGTGG AAGAAGTAGA GCAGGTTCGG ATGGCGTTGG GGTTAAACAA AGACAACTTT TACCTATACG GTCAGTCGTG GGGGGGCATT TTGGGTATTG AATACGCGCT TAAATACGGC CAAAACATCA AGGGTTTAAT TATCTCGAAC ATGATGAGCA GCGCGCCCGC CTACAGCCAG TACGCCACCG ACGTACTCGC CAAACAAATG GATCCGAAGG TGCTGGCCGA GATCAAAACC CTTGAAGCAA AAGGCGACTT CACCAACCCG CGCTATATGG AACTGTTGCT GCCCAATTTT TACGAAAAGC ATATCTGCCG GTTTCCAACG GCGCAGTGGC CCGAACCGGT GAATCGGGGG TTGGCCAAAC TGAACCAGGA GCAGTATGTG ACTATGCAGG GACCAAGCGA GTTTGGCATG GCGGGTGATG CTAACCTAAA GAACTGGGAT CGTACCAAAG ACCTGCCCAA AATCACAGTG CCGACGCTTG TTATCGGCGC CACCTACGAC ACGATGGACC CCAAACACAT GGCGATGATG GCCAGACAGG TCAAAAATGG CACTTTCCTG CTCTGTACCA AGGGTAGCCA TCTGGCGATG TACGACGACC AGCAAACGTA TTTCACCGGA TTGATCTCTT TTTTAAAAAA AGGGAATTGA
|
Protein sequence | MARVARQVTN SNQFNSKIPL KTNRFPLLLA AFNIVLISLT VTGCNPTTNG SEANTRQTYF TPADTGVQTG GVTVIPIKTP KGTFNVWTKR IGNNPKIKVL ILHGGPGVNH DPYECFENFL PKEGIEFIYY DQLGAGNSDR PTDKSLWVLP RFVEEVEQVR MALGLNKDNF YLYGQSWGGI LGIEYALKYG QNIKGLIISN MMSSAPAYSQ YATDVLAKQM DPKVLAEIKT LEAKGDFTNP RYMELLLPNF YEKHICRFPT AQWPEPVNRG LAKLNQEQYV TMQGPSEFGM AGDANLKNWD RTKDLPKITV PTLVIGATYD TMDPKHMAMM ARQVKNGTFL LCTKGSHLAM YDDQQTYFTG LISFLKKGN
|
| |