Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2474 |
Symbol | |
ID | 8726218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 2991081 |
End bp | 2994368 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003387292 |
Protein GI | 284037362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.840021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.838838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAA TGTTTACACA ACAAAAGCTC CTCTTGTTTA GAACAGGGGC TTCCCTGACA AACGCAGACG TCAGTCGACG GTTTTCGTTT TGTTTGTTAT TCCTTCTGGG CCTCCTGGTC TCAGTAGGTG CTTACGCCCA GACGAAGGTT TCGGGTAAGG TGGTTGATGC GCAGGGGCTA GCTCTGCCGG GGGTGAGTAT CGTCGTGAAA GGTACGACAA TGGGTACGGT TTCGGCCGCA CAGGGTGATT ATACCCTTAA TCTGGCCAAA GGGAATGAAA CGCTGGTTTT TTCCTACATC GGATTCCTGA CTCAGGAAAT ACCCGCTAAC AACCGGTCTA TGATTAACGT TACACTCGCT TCAGACGATA AAATGCTGAA CGAGGTAATT GTGGTTGGTT ACGGCGAGCA GAAGAAAGAG ACGGTAACGG GCTCGGTTGC CACCGTAAAA GGTAGTGAAT TGATCAAGTC GCCGGCGGTC AACCTGTCGA ACTCGATTGC AGGCCGGATG CCGGGCGTTA TCGCCACTAA CGCCAGTGGT GAGCCGGGTT ATGATGGAGC CGCTATCAAG ATCCGGGGTT CTAACACGCT GGGTAACAAC GACGCGCTGA TCGTAATTGA CGGTGTACCG GCACGGGCCG GGGGTATCGA CCGCTTGAAT CCCAACGACA TCGAGAGCAT CTCGGTATTG AAAGATGCGT CGGCTGCCAT TTACGGATCA CGCGCTGCTA ACGGGGTTAT CCTGGTAACT ACCAAGCGGG GTAAAACGGG TAAGCCTGAC ATTTCGTACA GCTTCAACCA GGGCTTCGCT CAGCCAACCG TCATTCCTAA GATGGCGACA GCCTCTCAGT ATGCGGAGTT GAACAATGAG ATCAACGTTT ATAACCTGCC TTCTCAGTAT TGGAAAGATG CTTCGGCTGC ATTCAAGGCT ACGGGTAGCT ACACCCGCCC TGACAACGGC TCGATTGCCA AAGCGGCTTT CACGCCCGAT GATATGAAAA AGTATCAGGA TGGTTCCGAT CCCTGGGGTC ATCCCAACAC TGACTGGTTT GGTGCGGCTC TGAAAAACTG GTCGCCACAA ACCCGGCATA CCCTGCAACT GGTGGGTGGA AATGACAATG TTAAGTATTT AACCTCTGTC AATTATCAGA ATCAGGATGG CTACTACAAG AACTCGGCCA CGGGTTACAA GCAGTATGAC TTCCGCATGA ACCTGGATGC TAAGGTTAAC AAGTACATTA ACACAGTGGT TGGCGTGGTA GGTCGTCAGG AGAACCGTTT CTTCCCAACA GTAGGGGCGG GGGCGATCTT CCGTATGCTG ATGCGGGGTT ACCCCAACAA GCCAGCTTTC TGGCCAAACG GTCTGCCCGC TCCGGATATC GAGAACGGAC AGCAGCCTGT ACTTGTTACG ACCGATGCTA CTGGTTATGA CAAAGACACT CGCTATTATC TGCAAAGCAA TGCCAGCGTA ACGGTAACTA ACCCTTGGAT TGCCGGTCTG AAGTTTGTAG GTAGTGTGGC CCTGGACAAA TACATCCAGC AGGGTAAAAC ATGGCAGACG CCGTGGTTCG TATATAGCTG GGATTATACC TCCTACGATG CCAACAAGCA ACCACTTCTG CAACGCGTTC AGAAAGGACC TGCTCAGGCT ACGCTGAATC AGTACACCAA CGACCAGTTC AACTCGCTGT TGTCGGGTAT CCTCTCGTAT GACCACGTCT TCGGGGGTAA CCACGCGGTA ACGTTACTGG CCGGTATTAC CAAGGAGCAG TCCAATTCAA GCGGTTTCTC AGGCTTCCGG AAGTACTTTG CCTCGACCGC CATCGACCAA CTGTTTGCTG GTGGTAGTGC CGAGAAAAAC TCGAACACCA CCGCTGCCTG GCAACGGGCT CGTATGAGCT ACTTCGGTCG GGCGGGCTAC AACTTCAAGG AGAAGTATCT GGCCGAGTTC CTGTGGCGTT ATGATGGTTC CTATATGTTC CCTTCAGCTA CCCGTTGGGG CTTCTTCCCC GGTGTAACGG CGGGCTGGCG TATTTCGGAA GAAAACTTCT TCAAGAAGAG TCTGCCAGCA GTAAGTTCGT TGAAACTGCG GGCTTCATGG GGTCAGTTGG GTAACGACCA GGTATACTTC AACGGTTCGC TGCGTGAGTA TGACTACCTG CCTACTTATG CCTATGGCGA CGTAGTGAAC TCGAACTGGG GTTATGTAAC GGGTGGTCAG GTGTCGCAGA CACTGTATGA GAACGGGGTG CCTAACCCAA CCCTCACCTG GGAAGTGGCT AACAACGCCG ACATCGGTCT GGAAGGCTCG CTGTTGAACG GGAAGATCTT CTTCGAATTC GACGTATTCC AGAATAAGCG GTCGAATATT CTGTGGCGCA AGAGTGCTTC CATTCCTCAG ACCACGGGTA TGACCCTGCC AGCAACGAAC ATTGGTAAGG TGACCAACAA AGGGTATGAG TTCAACATTG GTTACAATGG TCAAACGAGC GGTGGTCTAA AGTATAGCGT TAGCGTCAAT GGTGGTTATG CCAAGAACGA AATCACGTTC TGGGACGAAA CGCCAGGTGC ACCCGAGTGG CAGCGGTCGA CGGGTAAGCC CATCCCAAGT AACGTGAACG ATCCGAACCA GCAAAACGGT ACACTGCTTT ACCAATATGA CGGTATCTTC TCGACACAAG CCGACATTGA TGCCAACAAG CTGGATTACA GTGGTGTTGG AGCCAGCCTG CTACGTCCTG GCGACATGAA GCTGAAGGAC ATCAACGGCG ATGGCAAGAT CAACGGCGAT GACCGGGTTC GGGCCGACCG CAACAACCAG CCTCGTTTCC AGGGTGGTTT CAACGCCAAC CTGCGCTATA AGAACTTCGA CCTGAGCATT CTGGTACAAG CTTCGGCTGG TGGTCAGATC TTCCTGCAAA CGGAGTCGGG TACCATTGGT AACTTCCTCG CCTGGAGCTA TGACAACCGC TGGACGGTCG ATAACCCAAG CACGGTTAAC CCCCGCATCG TTGACCGGAG CAACCAGTAC TTCTCCAACG GTACCAGCTA CTGGTTGAAG AGCACGGACT ACGCTCGTCT GAAAAACCTG GAGTTAGGCT ATACCTTACC GAGCACCATT GGTAGTAAAA TTGGTCTGAA CAACCTGCGC GTTTATGTTA ACGGTCTGAA CCTGATCACC TACGCACCTG CCATGAAGGG TCTGTTTGAT CCTGAATCGA CCAGCGGTAG TGCTCAGTAC TATCCTCAGG CACGGGTTAT CAACACAGGG GTATCCGTTA GTTTCTAA
|
Protein sequence | MQKMFTQQKL LLFRTGASLT NADVSRRFSF CLLFLLGLLV SVGAYAQTKV SGKVVDAQGL ALPGVSIVVK GTTMGTVSAA QGDYTLNLAK GNETLVFSYI GFLTQEIPAN NRSMINVTLA SDDKMLNEVI VVGYGEQKKE TVTGSVATVK GSELIKSPAV NLSNSIAGRM PGVIATNASG EPGYDGAAIK IRGSNTLGNN DALIVIDGVP ARAGGIDRLN PNDIESISVL KDASAAIYGS RAANGVILVT TKRGKTGKPD ISYSFNQGFA QPTVIPKMAT ASQYAELNNE INVYNLPSQY WKDASAAFKA TGSYTRPDNG SIAKAAFTPD DMKKYQDGSD PWGHPNTDWF GAALKNWSPQ TRHTLQLVGG NDNVKYLTSV NYQNQDGYYK NSATGYKQYD FRMNLDAKVN KYINTVVGVV GRQENRFFPT VGAGAIFRML MRGYPNKPAF WPNGLPAPDI ENGQQPVLVT TDATGYDKDT RYYLQSNASV TVTNPWIAGL KFVGSVALDK YIQQGKTWQT PWFVYSWDYT SYDANKQPLL QRVQKGPAQA TLNQYTNDQF NSLLSGILSY DHVFGGNHAV TLLAGITKEQ SNSSGFSGFR KYFASTAIDQ LFAGGSAEKN SNTTAAWQRA RMSYFGRAGY NFKEKYLAEF LWRYDGSYMF PSATRWGFFP GVTAGWRISE ENFFKKSLPA VSSLKLRASW GQLGNDQVYF NGSLREYDYL PTYAYGDVVN SNWGYVTGGQ VSQTLYENGV PNPTLTWEVA NNADIGLEGS LLNGKIFFEF DVFQNKRSNI LWRKSASIPQ TTGMTLPATN IGKVTNKGYE FNIGYNGQTS GGLKYSVSVN GGYAKNEITF WDETPGAPEW QRSTGKPIPS NVNDPNQQNG TLLYQYDGIF STQADIDANK LDYSGVGASL LRPGDMKLKD INGDGKINGD DRVRADRNNQ PRFQGGFNAN LRYKNFDLSI LVQASAGGQI FLQTESGTIG NFLAWSYDNR WTVDNPSTVN PRIVDRSNQY FSNGTSYWLK STDYARLKNL ELGYTLPSTI GSKIGLNNLR VYVNGLNLIT YAPAMKGLFD PESTSGSAQY YPQARVINTG VSVSF
|
| |