Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4639 |
Symbol | |
ID | 8728403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5646738 |
End bp | 5649113 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003389416 |
Protein GI | 284039486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGAA ACTACCTCAA AACGGCCCTT CGAAATTTAT GGAAACACAA GCTGTTTTCA TTCATCAATG TCTTTGGGCT AGCGTCGGGC ATGCTGGTGT GTTTGCTGGC CATGATCGAC ATCAAAGGCG CGTTCGATTA TGATTCGTTT CATCCACACA CCGACCGGAC ATACCGCATT CTGACCGATG TGACCGACAA AGCGAACGAT GAACGGGCGT TTGCCACTAC CCCCCTGCCG CTGGCCGACG ACCTCAGCCG AAACTACCCG TTTGTGGAGG CCACCACGCG GGTTATCCGC CAGTATGGGG AAGCCACCGC GAATCGAAAG CAACTTCAGG TGATAGCCAG CGCCGTCGAC CCCGGATTCT TTACCGTCTT CGGGTTCAGG CTGGCAGCGG GTCAGGCAGC CACCGCCCCC GGAACGGTCG TGCTCACCAG ACAAACGGCC GAACGGTTTT TCGGGACAGC AAACCCGGTG GGAAAGGTGC TGGAACATAC TGGATTAGGA CCACTGACGA TTACGGGCGT TTTTGCCAAA CCCCCAACCA AAACCCACCT CAACTTCGAT ATGGTGGTGT CGATGGCCAC CCTATCCACC CCGGACTGGC AGCGCAAACG GGCCGACTGG ACGCAGTATT CGCAGGGCTA CACATACGTT CTGCTCAAGC CGAACACCCC AACCGAAACA CTGGAAGCTT CCCTCCCGGC CCTGGCCAGC CGGGTTACGA CCGGCATCCG GTTTGCTACT GAAAAAGGGT ATACCTTCCG TACGCAGGCC CTGGCCAGGA TTTCGCCCTC GCGCGAGGAC CTTATGTATG CTACGTACGA ACCCACCGCC GGAAAGCTGG AAGCCGAACT GGGCGTTGGC TTACTGACGT TGCTGCTGGC AGCTTTCAAT TACATCAACC TCACTCTGGC CCGGTCGCTG AGCCGCGCCC GCGAAGTGGG CATCCGGAAA GTGGCCGGGG CAATGCGCTG GCAGTTGATG GGGCAGTTCA TGGCCGAATC CGTCATTTTG TCGGTGCTGG GCCTTGGGCT GGCGTATGGT ATGCTACAAC TGGTAAAACC CATGCCTTTT GTTCAGCAAT GGCTCATTGG CGATAGTCAA TGGGAAACGA ACAGCACCCT TTGGACGGTG TTCGTAGTAT TCAGCGTGGT AACGGGCTTG CTGGCGGGCT TGTTACCGGC CCGCGTACTG TCGGGATTTC AACCGGCGCA GGTACTCCGC AGCCAGACCG GTCTGAAAGC GTTCAGGGGT GTAACGTTAC GTAAATCCCT GATTGTGGCG CAATTCTCCA TTTCGTTACT CGCCATGATC GCGCTGCTGG CTATGGCCCG ACAGCAGCAG TTTATGGCCA CGGCCGACTA CGGCTTTCAG CGGGAAGGGC TGTTGACCAT TCCGCTGAAT GGAATGCCAC CGGCCCGCCT TTCGGCCCAG ATCAGCCAAT TGGCGGGTGT GGACCGGGTA GCCGCAACCG TCGCGCTATT TGGTGATCAC GGCGGAAACT GGCAAAAAGT GTATCGGCAG AAAGCCAAAA GCGATTCCTC GATAACCGAT GTTTTTGCCG CCGATGCCAA CCTGATTCCT ACCGCCGGGC TAACCTTGGT GGCCGGACAG AACATGCCAC AATCTGCATC CGATACGGCC TCCAACCAGG TTCTTATCAA CGAGGAAGCC GTGAGGACCT TTAAACTGGG CGAACCCAAA GCGGCTGTCG GGCAAACGCT CTGGCTCAGC GACAGCACGG AGGTGCAGAT TGCCGGTGTT GTGAAAGATT TCCAGTTTAC GACGATGGTC TGGAAAATCC GTCCGTTGAT ACTCCGCTAT CAACCCGGTG ACTTCCGGTA CCTGACAGTG AAAGTTGCGG GGGGAAATCC CGAATCCGTC AAAGCCGACA TCGCCCGTAT CTGGAAACGG CTCAACCCCT ACGAACCCTT CGCCGGGCAG TGGTACGACG ATTTCCTGTA TAACCGGCAC AGCCATACCG ACGATCTGAG TTTTATGGGC TTGCTCCTTG GCCTGGCCAT GTCGATTGCC TGCCTGGGCC TGTTGGGGAT GGTGACCTAC ACCACCGCCC TAAGGACCAA AGAAGTGGGG GTTCGGAAAG TGATGGGTGC CAGCGTTGGG CAGGTGGTGT GGCTGCTGTC GTGGGATTTC CTGCGTCTGC TGCTCATTGC CGGTACCATT GCCATGCCAT TGGGGTACCT GGCCAGCAGC TTCTTCCTGA TGACATTCGC CTATCACATT ACGGTAGGCG TCGGACTGCT GGGGCTGTGC TTCGGCACAA TGCTCCTGCT GGGTGGCCTG ACCATTAGCT GGCGAACATA CCGGACAGCC CTGACCAACC CGGTGAATAG TCTTCGAAAT GAATAA
|
Protein sequence | MLRNYLKTAL RNLWKHKLFS FINVFGLASG MLVCLLAMID IKGAFDYDSF HPHTDRTYRI LTDVTDKAND ERAFATTPLP LADDLSRNYP FVEATTRVIR QYGEATANRK QLQVIASAVD PGFFTVFGFR LAAGQAATAP GTVVLTRQTA ERFFGTANPV GKVLEHTGLG PLTITGVFAK PPTKTHLNFD MVVSMATLST PDWQRKRADW TQYSQGYTYV LLKPNTPTET LEASLPALAS RVTTGIRFAT EKGYTFRTQA LARISPSRED LMYATYEPTA GKLEAELGVG LLTLLLAAFN YINLTLARSL SRAREVGIRK VAGAMRWQLM GQFMAESVIL SVLGLGLAYG MLQLVKPMPF VQQWLIGDSQ WETNSTLWTV FVVFSVVTGL LAGLLPARVL SGFQPAQVLR SQTGLKAFRG VTLRKSLIVA QFSISLLAMI ALLAMARQQQ FMATADYGFQ REGLLTIPLN GMPPARLSAQ ISQLAGVDRV AATVALFGDH GGNWQKVYRQ KAKSDSSITD VFAADANLIP TAGLTLVAGQ NMPQSASDTA SNQVLINEEA VRTFKLGEPK AAVGQTLWLS DSTEVQIAGV VKDFQFTTMV WKIRPLILRY QPGDFRYLTV KVAGGNPESV KADIARIWKR LNPYEPFAGQ WYDDFLYNRH SHTDDLSFMG LLLGLAMSIA CLGLLGMVTY TTALRTKEVG VRKVMGASVG QVVWLLSWDF LRLLLIAGTI AMPLGYLASS FFLMTFAYHI TVGVGLLGLC FGTMLLLGGL TISWRTYRTA LTNPVNSLRN E
|
| |