Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3923 |
Symbol | |
ID | 8727681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4705687 |
End bp | 4707360 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | carboxyl-terminal protease |
Protein accession | YP_003388712 |
Protein GI | 284038782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0829863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.226226 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTCT CTAAACGACT CACCCTGTTA GCCTCCTCAG CCCTTGTAGC GGGTGGCATT GGTTTCTTCT CGTTCAAGAC GGACGACCGC TTCTTCGAGA TCGCGCGCAA CCTGGACATC TACGCCACGC TGTTTAAGGA ACTCAATCTC TATTATGTAG ATGAGGTAAA CCCGAACCGC ATGGTTAAAA CCAGCATCGA CGCTATGCTG AAAGCCCTCG ACCCGTACAC GAACTTCTTC GCCGAAGACG AGATCGAGGA TTACATGACC ATGACCACCG GCCGCTACAA CGGCATTGGG GCGCTCATTG GCCAGCGGCA GGGCAAAAGC ATCGTGCTAA TGGTGTATGA AGGCACCCCC GCCGAAAAAT CGGGGTTGCA AATCGGTGAT GAAGTGCTTA AGGTGGACGG TGTAGACCTG AAAACCCGCA AAGACCGCGA TGGCGGACCA CTTGATCCGG GTAAACTCCT GAAAGGGCAG AACAACACGG CCGTAAAACT GACCGTTAGT CGGTACGGAC AAAAAGCCCC GCTCGAACTC AGCGTTATCC GGGATGTGGT TAAAATGACC AACGTGCCTT ACTACGGCAT GGTATCGGAC GAAGTGGGAT ACATCGACCT CAAAGATTTT ACGGCCACGG CTTCGCGTGA GGTACGGACC GCCTATCAGG AACTGAAGGG GAAAGGGATG AAAAAACTTA TCCTCGACGT TCGTGAAAAT CCGGGCGGAC TGCTCAACAT GGCCATCGAC ATCTCGAATA TTTTTATTCC GAAAGATTCA GAAGTGGTGA CGACTAAAGG TAAAGTGACG GAGTGGAACA AGACCTACAC CGCCATGAAC CCACCCCTCG ACCTCGACAT TCCTATTGTT GTGCTGACAA ACAGCCACAG TGCATCGGCG GCCGAGATTG TATCGGGGGT TATTCAGGAT TACGACCGGG GCGTGTTGAT CGGGCAGCGG ACCTACGGCA AAGGGCTGGT GCAGACCACT CGGGAATTGT CGTTCAACAC CAAGCTAAAA ATCACAACGG CCAAGTATTA CATTCCGAGT GGCCGGTGCA TTCAGGCCAT CGACTACAGC CACCGCAACG CCGATGGCAG CGTGGGCAAG ATTCCGGATT CGCTGAAAAC CGCTTTCAAA ACCAAGGCGG GCCGGGTAGT ATACGACGGC GGTGGCGTGT TGCCCGATAT TGTCGTAGAA GCGCAGACAC CCTCGCCGGT GGCCCTGAGC CTGACAAACA AAGGCCTGAT TTTCGATTAT GCCGTGAAGT ACCGACACGA GCATGCTAGC ATTAAACCAG CCCGCGAATT CCGCCTGACC GATGCCGAGT ATACTGAATT TGCGAAGTGG CTCGGCGATA AAGAGTACGA TTATACGACG CAGGTCGAGA AAGACTTGGG TACCCTCGAA GCATCGGCCA AGAAAGAGAA GTATTTCGAC CAGATTCAGG ATCAACTGAA GTCGCTGAAG ACCAAAATGT CGCACAGCAA AGATGCCGAC CTGAACACCT TCAAGCCAGA GTTAAAAACC CTGCTTGAGC AGGAAATAGC CGGGCATTAC TACCTGCAAA AAGGCATCAA GGAAGCCTCG TTCGCTACCG ATCCCGAAAT GAAAGCAGCC CTTGACCTGT TCAAAGACAT GAACCGGTAC GGTACCATCC TGAAGGGAAA GTAA
|
Protein sequence | MRFSKRLTLL ASSALVAGGI GFFSFKTDDR FFEIARNLDI YATLFKELNL YYVDEVNPNR MVKTSIDAML KALDPYTNFF AEDEIEDYMT MTTGRYNGIG ALIGQRQGKS IVLMVYEGTP AEKSGLQIGD EVLKVDGVDL KTRKDRDGGP LDPGKLLKGQ NNTAVKLTVS RYGQKAPLEL SVIRDVVKMT NVPYYGMVSD EVGYIDLKDF TATASREVRT AYQELKGKGM KKLILDVREN PGGLLNMAID ISNIFIPKDS EVVTTKGKVT EWNKTYTAMN PPLDLDIPIV VLTNSHSASA AEIVSGVIQD YDRGVLIGQR TYGKGLVQTT RELSFNTKLK ITTAKYYIPS GRCIQAIDYS HRNADGSVGK IPDSLKTAFK TKAGRVVYDG GGVLPDIVVE AQTPSPVALS LTNKGLIFDY AVKYRHEHAS IKPAREFRLT DAEYTEFAKW LGDKEYDYTT QVEKDLGTLE ASAKKEKYFD QIQDQLKSLK TKMSHSKDAD LNTFKPELKT LLEQEIAGHY YLQKGIKEAS FATDPEMKAA LDLFKDMNRY GTILKGK
|
| |