Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4110 |
Symbol | |
ID | 8727869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4948881 |
End bp | 4950137 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | Tetratricopeptide repeat protein |
Protein accession | YP_003388896 |
Protein GI | 284038966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0123824 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCGC TAAGCCAGCG TATATCTGTT CTTGAACGGC TTTGGCAGGA ATTCCGACGG CTGCCCGATG CCCGCCTGTG CCGTTGGTTA TTTGTCCCCG ACGAACACTG GTTTCTGACT TTGTTTCTGC AAAAACAGAC CGGCCCGCAG GCGGTCTCCA ACGATATTTT CATCACCCTC GATACACCCG TTCAGCACTG GCAGACCTTT TTCGAACAGG CACTGGCCGA CCTTTCTGAA CAGGTCGGCC ACGATCTGCT CCTGCTGGCC GATCAGAACA TTCGCATCCA GTGGCCCCCC GAAGCCAACG GGGTAAAGCA GGACCCGATC ACCCGCTTTA TCTCGTCTGT CAATCTGCTG GACACCCGGC TGCGTACCTA TACCGATGGT CTGCTGGTGT TATGCCTGCT GCCCCAGGCC ACGGCCAGCA ACATAGTGCT GGACCAGATG CTGACGGCCC TGCTGATGGG TGGCCTTTCG CCAACGGTTC GCATCCTGAT TACGGATACC GTCGGTGCCG AACAACTGGC CACGTTACCC GGCCGGTTTC GAAAAGAGGT CTGCTCAAAC CCCATTAATC TCCAGCTTCA GAAGGTCATT CACCAGGTAG CCGCGCTCGG TTCGCCTATG GCACCCGACG TGAAGTTCCG GCAGTGGCAT GCCGAGCTGA GCACCGCCCT TACCCAGCGA AATTTGCCGG ATGTACTGTA TTTTGCTCGC AAGTGTGAGC TGATCTGCCA GCAGGAACGC TGGGCAGCTC TTGAGGGTAG CGTTCACTTG TCTGTCGCCC AGGCCTATAA AGATCACCGA CGGCACGAGG ACGCACTGGA TCGCTACAAA CGGGTGATTG ACCAAATGGA GCCTCTGTAC GACGCGGGCG ATGCGCTCGC CGGTCGAATC AGTCTGGTGG CCTGGCTGGG GGCTGGTGAG ATCTATGAAA CCACCCGACA GCGGCAACGG GCTATTAATG AGTATGAACT GGCCAGTAAT AGGGCCGAAC ATCTGAAAGA GTGGTTACTG GCAATCGAGA GTCATCGTAG GCTGGCTCTG GCCTATGAAC AGCATGGCCG GAGCCGCCTG TCCGAAGAGC ATTACCTACG GCTGTTTACA CTGGCCGAAC ACCTACCGCC CGAACAGCAC GCGGTAGCCC GTTTGCCAGA GATAGGCAGA CAGTACTGGC AGCGACAGCA AACGCCAGAC AAAAGGCGAA AAGCCGATGA GTTATTAACC CAACTCCTCG GAAACAGACG GTGGTAA
|
Protein sequence | MNPLSQRISV LERLWQEFRR LPDARLCRWL FVPDEHWFLT LFLQKQTGPQ AVSNDIFITL DTPVQHWQTF FEQALADLSE QVGHDLLLLA DQNIRIQWPP EANGVKQDPI TRFISSVNLL DTRLRTYTDG LLVLCLLPQA TASNIVLDQM LTALLMGGLS PTVRILITDT VGAEQLATLP GRFRKEVCSN PINLQLQKVI HQVAALGSPM APDVKFRQWH AELSTALTQR NLPDVLYFAR KCELICQQER WAALEGSVHL SVAQAYKDHR RHEDALDRYK RVIDQMEPLY DAGDALAGRI SLVAWLGAGE IYETTRQRQR AINEYELASN RAEHLKEWLL AIESHRRLAL AYEQHGRSRL SEEHYLRLFT LAEHLPPEQH AVARLPEIGR QYWQRQQTPD KRRKADELLT QLLGNRRW
|
| |