Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2038 |
Symbol | |
ID | 8725776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2463451 |
End bp | 2466525 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | TPR repeat-containing protein |
Protein accession | YP_003386882 |
Protein GI | 284036952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.144797 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTATT CAACCACCCA AACGGGTCAC TACCAATTTT CCAGCATGGT CTTTAATCGG TTGCTGTTCA CGCTGACCCT TGGGTTACTG GCTACCTTAT CGGCTTCGGC CCAGCGCACC CAAAGTAATG TTGAACCCGA TTACCACTAC CGGAATGGGC TCGAACTCTT CGAGAAAGCC AATTATGCCG CTGCCCGCTA CGAATTTCGG CAATACATGG AACCCCGGCG GGGCGATGGT GCCAAATCAC TGCTGAATAC CAGTGATCAG AATGCAGTGG AAGCCGAATA CTACATCGCC CTGACGAGTC TTTATATCGA CGAACCGGGG GCCGAGCTTT TAGTTGACCG CTTCGTCAAG AACAACAGCC AGCATCCCAA AGCCGGACAA CTTTACGGCG ATCTGGGGAC TTACTACTAT AACCGCCAGG ATTACACCAA AGCCATCGAC TTCCTGGAAA AAGCCGTCCG GCAGGGAGGC AGTTCAACGC AGCAATCGGG TTATAAATAC CAGTTGGCGC TCTCCTATTA CAACACGCAG AATCTGCAAA AAGCCCTGCC TCTGCTGAAC GAAATTAAAG TCGATCCCAA CTCAACCGAT GCACCGGCCG CATCGTACTA TGCCGGGACC ATTAACTTCA GAAACAAGAA CTTCAACGAA GCCGTCGCCG ATTTTCGTCG GATCGAAAAC AACCCAACCT ACCAAAATCA GGTACCGAAC TGGATTGCCC AGTCGCTTTA CCGGCAGCGT CGGTACGACG ATTTGCTGGC CTATACCGAG CCGCTGCTTA AGCGAAACAA TGGAGCAGGT ATGAACGAAG TTGCCCTGTT TACGGCGGAA GTGTTTTACC AGCAAAACCA GTTTGCGAGG GCCATACCTT ATTATAAATC GTACGTGAAC ACCGCTGGCG CCAAAGCACC CGGAGCGGTT AAGTTCCGGT ATGGGCAATC GCTCTTTCGC ACCGGTGCCT ACAACGACGC CATCGCTCAA CTCAAAACAC TGGCGGGCGG AAAAGACACG ACCGCTCAAT ACGCTGCGTA TACGCTAGGT GTCAGCTACT TACAAACGCA AAACCCGACG TATGCTCTGA ACGCTTTTGA TCAGGCAGGA CGGCTATCGT TTAACCGGGA GATTCAGGAA GAAGCGCGTT TTAACCACGC TAAACTTCAG CTTGACCAGA ACAACGGGGC GGATGCGGTA AAAGAGTTGA CGGCTTTTTT GAAGCAGTAC CCCGATAGTA AGTTCGAAAA TGAGGCCAAC GAACTGGTCG GTGAAGCCTA TTTTGCGTCC AACAATTACC CGGCGGCTAT CGCTTATATT GAAGGGCTGA AACGCCGGAC ACCGAAAATT AACGCGACTT ACCAGCGGCT CACCTACAAT CAGGGGATTA ACGATTTCAA CGCCGAACGG TACCAGCAGG CAGTTGCCAA CTTCGATAAA TCGCTGAAAT ATCCGGTCGA AAATAGTTTA CAACAGGCAG CCCAGTTCTG GAAAGCGGAA TCCTATTCGG CGGGCAAGCA ATATGATACG GCCATCCCCC TCTACGCCAG CATTTCCAAA GCGGGCGGGG CAGACTCGTA TGCGACCAAA AGCCTGTATG GCCTCGGATA CGCCTATTTC AACAAAAAAG ACTACACCCG CGCCCTGCCT TACTTTCGGG ATTTTGTAAG TCGGGGTGGC GATGCTGACG ACAGAGTCCA GGTTCAGGAT GCGACGATCC GGCTGGCCGA TACGTACTTC GCAACGAAAC AGTACGAAAA TGCCCTTCGT TCGTACGATC AGGCCATCGC GCAGAATGCA CCGGACAAAG ATTATGCGTC CTACCAGAAG GCGTTAATTC TGAGTTATGT TGGTCGGGAC GCGGAAGCCA AAGCCCAGTT TGACCAGGTG CAACGGCAAT ACCCGAACTC CCGCTTCGTG GATGAATCGC TGTTTCAGAA AGCGAATGTC GACTTCGAGA AAGGTTCATA TCAGGTAGCC ATTCAGGGAT TTACCAAGTT GATTCAGGAC AAGCCAAACA GTGCACTTAT TCCGGCCGCT TTGCTGAAAC GCGCGATTGC CTACGGAAAC CTGCAACAGT ACGATCCGGC GGTGGCCGAT TACAAACGCA TTCTGGACAA CTACGGTGAA TCTGACCAGG CGCAGAGTGC CCTGCTCGGC ATTCAGAATA CACTTAACGA TGCTGGTCGG CCGGAGGAGT TCTCGCAGGT GCTGGGCCAG TACAAGAAAG GGAATCCGGG CAGTACGGAT GTTGAGCGGG TTCAATTCGA AAACGCCCGG AATATTTACG CTAGTGGTAA ATACGAGCAG GCCATCCAGT CGTTCCTGAA CTTCATGCAG GAGTACCCGG CCAGTCCGAA TACCAATCAG GCCCGGTATT ACGTAGCGGA ATCCTACCGT CAAACCAACG ATGTGGCTAA TGCCCTCCGG TATTACAACC TGGTGATTGC CGATAACAAG TCGGATTACC TAGTCCGGGC CGCTACACGG GCCGCGGAAC TGGAAGTAAA ACAGAAGAAT TATGGCCGCG CGGTTCGTAA CTACCAACTC ATCCAGAGCC GGGCTGGTAG CAAGGCAGAG CAGGTTACGG CTCAGTTGGG GCTGATGGAT ACATATTTCG TATACCCGAA ACTGGATTCG GCAGCGATTG TCGCCCGCGA GATTGCGGCT GGTGGCAATG TGGTACCGGG CGCGCAAAAC CGGGCTCAGT TAATGCTCGG AAAAGTAGCC TTGAGCCGGA ATGATTACAA AACCGCCCAG GCTGACTTTG ACAAAACGAT TGCCCTGGCC AAAGATATTT ACGGAGCCGA AGCCCAGTAT TACCTCGGCG AAATCCTGTA CCGCCAGAAA AAGTACAAAG AGTCGGTGTC TACGCTGTTG AAATTTAACG AACAGTTCAG TGATTTTGAG TACTGGAAAG GCAAGGCGTT TATTCTGGTT TCTGACAACA ACGTCGCCCT GGACGAGCCG GCCCAGGCCA AAGCCGTTTT GAACTCCATT ATTGAAAATT CATCCGACGA AACCATCGTT ACCGAAGCTA AACAAAAGCT GGCAACGCTG GAGTCTAAAA ATTAA
|
Protein sequence | MPYSTTQTGH YQFSSMVFNR LLFTLTLGLL ATLSASAQRT QSNVEPDYHY RNGLELFEKA NYAAARYEFR QYMEPRRGDG AKSLLNTSDQ NAVEAEYYIA LTSLYIDEPG AELLVDRFVK NNSQHPKAGQ LYGDLGTYYY NRQDYTKAID FLEKAVRQGG SSTQQSGYKY QLALSYYNTQ NLQKALPLLN EIKVDPNSTD APAASYYAGT INFRNKNFNE AVADFRRIEN NPTYQNQVPN WIAQSLYRQR RYDDLLAYTE PLLKRNNGAG MNEVALFTAE VFYQQNQFAR AIPYYKSYVN TAGAKAPGAV KFRYGQSLFR TGAYNDAIAQ LKTLAGGKDT TAQYAAYTLG VSYLQTQNPT YALNAFDQAG RLSFNREIQE EARFNHAKLQ LDQNNGADAV KELTAFLKQY PDSKFENEAN ELVGEAYFAS NNYPAAIAYI EGLKRRTPKI NATYQRLTYN QGINDFNAER YQQAVANFDK SLKYPVENSL QQAAQFWKAE SYSAGKQYDT AIPLYASISK AGGADSYATK SLYGLGYAYF NKKDYTRALP YFRDFVSRGG DADDRVQVQD ATIRLADTYF ATKQYENALR SYDQAIAQNA PDKDYASYQK ALILSYVGRD AEAKAQFDQV QRQYPNSRFV DESLFQKANV DFEKGSYQVA IQGFTKLIQD KPNSALIPAA LLKRAIAYGN LQQYDPAVAD YKRILDNYGE SDQAQSALLG IQNTLNDAGR PEEFSQVLGQ YKKGNPGSTD VERVQFENAR NIYASGKYEQ AIQSFLNFMQ EYPASPNTNQ ARYYVAESYR QTNDVANALR YYNLVIADNK SDYLVRAATR AAELEVKQKN YGRAVRNYQL IQSRAGSKAE QVTAQLGLMD TYFVYPKLDS AAIVAREIAA GGNVVPGAQN RAQLMLGKVA LSRNDYKTAQ ADFDKTIALA KDIYGAEAQY YLGEILYRQK KYKESVSTLL KFNEQFSDFE YWKGKAFILV SDNNVALDEP AQAKAVLNSI IENSSDETIV TEAKQKLATL ESKN
|
| |