Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4799 |
Symbol | |
ID | 8728563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5851721 |
End bp | 5853412 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TPR repeat-containing protein |
Protein accession | YP_003389576 |
Protein GI | 284039646 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACGT ACATAAACAC CTTACTCCTG TCGGGATTAT CAGTAGGGGC TCTGGCGCAA GCCCCCGCTC CCGATGCAAA CACCCTGATG AATCTTGGTC AGTTTGCGGA GGCCAAGCAA TTGCTGAACC GAAATGCTCA GCAGAGTCCG TCAATGCAAA GTCTGTTCGA CGCGGGCTAC GGTTATCTGC GTTCAGGTCA GCCCGATTCG GCCCGTATCT GGTTTACCAA AGGCCTGTCG ATGGATGAGA AGCGCACGCC ACTCAACGAA ACCGGCGTGG CTATCAGCTA CCTAGTGGAA AACAATGCAG CCAATGCCGA ACCTAAACTG GCAGAAGTGC TTAAAAAGAG TAAAAGTAAA AACGCCGACA TTCTGTATCG AATTGGCGAA GCCTACACGG GCTACCTGAC CCCCGGCAAT GGGTCCATCA AACCCGTCTA CGTCAAAGCG GTCAATGCCG CCAAGGCCAT CGACTACCTC AACCAGGCTA CCGAACGCGA CAAGAAAAAC GCAGCGATTC AACTGGCCCT CGGCGATGCC CATTATCTGA ACAAGGATGC CGGTACGGCC GTTACCCGGT ACGAAAGTGC GCTTGAACTG GGTATGAACC CATCGCGGGT GTATCAGCGC ATCGGCGATA TTTATTGGCA GGGACGCAAC CTGAATCTGG CCGTGGAGAA TTACAAAAAA GCCATTGAGG CCAATGCTGC CTACGCTCCG GCTTACAACC AACTCGCCGA ACTGTATTTT CTGGTCAACC GCTACAAAGA AGCGGCCAGC TACATTGACC AGTACGTTCG GGTTTCGCAC GATGCCCGGC AGGAAACACT GCTGCGTCAG GCACAATTCC ATTTTCTGGC TAACGATTTT CAGCGAGCCG TTACACTCAT CGACAGCAAC CGGACGGCAC TGGGCCAGAA CCCCATTGTT TACCGCATAA AAGGCTGGGC GTACTCGTCG CTCAAACAGC CCGATAAGGC CATCGAGAAC ATCAACACGT TCCTGGAAAA AGCACCGGAC AAAGCCATGC CCGACGATTA CAAGTACCTC GGCCGGGCGT ACATGGCCAC CGACAGCACT AGCGATGACT CCTTGGGTGC GGTCTACCTC GCCAAAGCCG CCCCTTTCGA CACGACAGAC AACCTGTACA GCGACATCGC CAAATATTAT TACCGGGCCA AAAAGCACCC CGAAGCGGTC GCCATTCTGG ACTCGGCAGC GAAGCACGGC TTCAAAGCCG ATGTGCAGGA TTTGTTTCGG TACGGCATGA GTAACTACAC CCTCGGGTTC AAACGGGATA GTTCGGGTAA ACTCGAACGC GACACCCTCC GGTTCGCCCT GGCCGATTCG GCGCTGGCAC TGGCTCAACA AGGCTCACCC GAGTACGCCC CAACGGTACT GTACCGGGCC AAAGCTAATT ACTACGCCTA CGCACCCGAA GAAGCGGTGA TGAAGGGCAA AGCCAAACCC TACTTCGAGC AATTCATCAG CATGATAGCC GATAAGGAGG AAGAACGGAA TCGGTACAAA AAGGACTTAG TACTGGCCTT TAAATACCTG ATCTCGTACA ATGAACTAGT CACGAAAGAT GAGGCCGCCC GCAACGAGTG GCTTACCAAA GGACTGGCCC TATTTCCCGA AAACAAGGAT TTAGCTAAAA TAGCCGGTCC GAACGAAGCC GATAGTCAAT AA
|
Protein sequence | MRTYINTLLL SGLSVGALAQ APAPDANTLM NLGQFAEAKQ LLNRNAQQSP SMQSLFDAGY GYLRSGQPDS ARIWFTKGLS MDEKRTPLNE TGVAISYLVE NNAANAEPKL AEVLKKSKSK NADILYRIGE AYTGYLTPGN GSIKPVYVKA VNAAKAIDYL NQATERDKKN AAIQLALGDA HYLNKDAGTA VTRYESALEL GMNPSRVYQR IGDIYWQGRN LNLAVENYKK AIEANAAYAP AYNQLAELYF LVNRYKEAAS YIDQYVRVSH DARQETLLRQ AQFHFLANDF QRAVTLIDSN RTALGQNPIV YRIKGWAYSS LKQPDKAIEN INTFLEKAPD KAMPDDYKYL GRAYMATDST SDDSLGAVYL AKAAPFDTTD NLYSDIAKYY YRAKKHPEAV AILDSAAKHG FKADVQDLFR YGMSNYTLGF KRDSSGKLER DTLRFALADS ALALAQQGSP EYAPTVLYRA KANYYAYAPE EAVMKGKAKP YFEQFISMIA DKEEERNRYK KDLVLAFKYL ISYNELVTKD EAARNEWLTK GLALFPENKD LAKIAGPNEA DSQ
|
| |