Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4593 |
Symbol | |
ID | 8728357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5566581 |
End bp | 5568998 |
Gene Length | 2418 bp |
Protein Length | 805 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003389371 |
Protein GI | 284039441 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCGCA ATTACCTGAA AGTCGCCCTG CGAACGCTCT GGAAACACCG CACGCACACA CTCATCAACA TCGTTGGGCT GTCGGTGGCG TTCGGTACCT GCGTGCTGTT GTTCCTGACG GCTACCTTCG AACTCTCCTA CGACAGCTTT CATACCGATG CCGACCGCAT CTTCCGGCTG AACTTTCTGT CGACCAACCG CGATGGAACG ACGGATAGAG GCAGCACCAT GCCGTATCCC ATCTCCCCCG CCCTCAAAGC CGAATTTCCG GAGATTGAAG GGGTTACGCG TTGGTTCGAC CGGAGCGCGA GTATCCGGCG CAATAGCCAG ACCTACACCA AGGATGTCCG CATGGCCGAT GCGGATTTTC TTCATATGTT CTCCTTCCCG CTCCAGAAAG GCAACCCCAA AACGGCCATG AACGGTCTGA GCGACATTGT CATCAGCGAA CGTATGGCGA ACGATATTTT TGGGAAGGAA GACCCCGTGG GCAAGCCCCT CCAGCTCCGT ATGAACGGCG CCTGGCAGGC ATTTACCGTG ACGGGCGTAA TAAGCAATCC TCCAAAAAAC TCAACCTTCG ATTTTGATGC ACTCATCCGC AGCGACAACG CGGGCGATTA TCAGGAGTTC AAAAGCCGCT GGGACCACGG CAACCATGAT GTATATGTAC AGGTAAAAGC CGGTACCGAC GCGCAAACCC TGCAACGCCG GACGCAGGCT TTCATGGACA AATACTTCGC GAAGGATATT AAAGAGCGGC AGGAACAGGG GTATCCAAAA AATGAACTGG GTTACCAGCG CAGCCTTCTT CTGGAGCCCT TGCGCGATGT CCATTTTGAC ACCGTCACCA CACATGGTGC TGGTATCAGT CGGGCGTACG TGTATACATT GCTGCTCATC GGCCTGTTCA TTCTCGCCAT TGCCTGTATC AACTTCATTA ACCTCACCAT TGCGCAATCG CTCTCGCGGG CTCGGGAAGT GGGCGTTCGT AAATCGCTGG GGGCTCAACG GGCACAGCTG TTTGGCCAGA TCTGGGGTGA AACCCTGTTG CTGTGCTTCG GCGCATTAGT CATTGGCCTG GGGTTGGCGT ATGCTGTGCT ACCCACCTTC AATCGCCTGT TCCGAAGCTA TCTGACACTG GACAACTTCC TGACACCAAC CGTATTGCTG GTAACGGCAT TGTGCTTTCT GCTCATCACG CTCATAGCGG GCGGCTATCC ATCCTGGTTT GTGACGCGCT TCAATGCTGT GGAGGTCCTG AAAGGCCGGG TGAAGGTGAG CAAACCGGGC GTACTTCGCA ATTCACTCAT CATCACTCAG TTTACCATAG CCTGTCTGCT TATCGTCTGC ACCATAATAG TCCGGCAGCA GATCACCTAT TTGCAGCAGC GACCGATGGG CATGGACAAA GAACAGGTTA TCAGCGTACC GGTGGGTGGC GAACTCAACG GCACCGTCGC CCTGAAAGCT ATGCGCGACC GGCTGGCCAA CCAACCCAAC ATCACCGCCG TATCGGGTTC GGGCGTGAAC ATCGGCGCGG GACTGGACGG TAGCTCATCC CGAATGATGT TCGGTTTCCA ATACGGCAAA CGGGACGTTA CCTGCGACTG GCTCCGCATC GACACGGATT ACCTGAAAAC AATGGGTATC AAACTCCTGA AGGGCCGCGA TTTCAGCCCG GACTTCAGTA CGGATTCCAG CTCGGCGGTG CTGATTACCC AGAGTATGGC GAAGGCACTA GGCGAAGCAA ATCCTATAGG AAAATTCATT AAGCCCGATA ATAAATCGTA CCAGATTGTG GGTGTCGTTT CCGATTTCAA CCTGTACTCC CTGCATCAGG AAGCCAAACC GATTACCTTG CAAATGGAGT CGAGCGCACC CATTCAGTAC ATTCTTGTCC GGGTAAATCC GCAGAATCTA ACGGGCGCAA TGGAAACCAT CAAGACTGCC TGGAAGACCA TCGCGCCCAA ACAGGAGTTC ATCGGCTCGT TTCTGGATGA AAACACCGAA CGCTGGTATC GGAAAGAACA GCGGTTATCG ACCATCTTTT CGACCGCTGC GGGCATTGCC ATTTTGCTCT CGTGCATGGG TTTGTTCTCC ATCGCTCTGC TCACCATCGA ACAGCGCACC AAAGAGATTG GCGTTCGGAA AGTGCTGGGT GCCAGCGTAG CCAGTATTGT GGCCCTGCTC TCAAAAGACT TTCTAAAACT GGTTGTAGCG GCCATCGTCA TTGCCTCACC TCTGGCATGG TGGGCCATGG ACAACTGGCT TCAGGATTTC GCCTATAAAA TTGATATTGC CTGGTGGGTC TTTGCGGTAG CGGGTTTGCT GGCGGTTGTG ATTGCGCTGG CAACCGTAAG CTTCCAGAGT ATCAAAGCCG CCTTGATGAA CCCAGTGCAA TCGTTACGGT CCGAATGA
|
Protein sequence | MFRNYLKVAL RTLWKHRTHT LINIVGLSVA FGTCVLLFLT ATFELSYDSF HTDADRIFRL NFLSTNRDGT TDRGSTMPYP ISPALKAEFP EIEGVTRWFD RSASIRRNSQ TYTKDVRMAD ADFLHMFSFP LQKGNPKTAM NGLSDIVISE RMANDIFGKE DPVGKPLQLR MNGAWQAFTV TGVISNPPKN STFDFDALIR SDNAGDYQEF KSRWDHGNHD VYVQVKAGTD AQTLQRRTQA FMDKYFAKDI KERQEQGYPK NELGYQRSLL LEPLRDVHFD TVTTHGAGIS RAYVYTLLLI GLFILAIACI NFINLTIAQS LSRAREVGVR KSLGAQRAQL FGQIWGETLL LCFGALVIGL GLAYAVLPTF NRLFRSYLTL DNFLTPTVLL VTALCFLLIT LIAGGYPSWF VTRFNAVEVL KGRVKVSKPG VLRNSLIITQ FTIACLLIVC TIIVRQQITY LQQRPMGMDK EQVISVPVGG ELNGTVALKA MRDRLANQPN ITAVSGSGVN IGAGLDGSSS RMMFGFQYGK RDVTCDWLRI DTDYLKTMGI KLLKGRDFSP DFSTDSSSAV LITQSMAKAL GEANPIGKFI KPDNKSYQIV GVVSDFNLYS LHQEAKPITL QMESSAPIQY ILVRVNPQNL TGAMETIKTA WKTIAPKQEF IGSFLDENTE RWYRKEQRLS TIFSTAAGIA ILLSCMGLFS IALLTIEQRT KEIGVRKVLG ASVASIVALL SKDFLKLVVA AIVIASPLAW WAMDNWLQDF AYKIDIAWWV FAVAGLLAVV IALATVSFQS IKAALMNPVQ SLRSE
|
| |