Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1911 |
Symbol | |
ID | 8725648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 2312400 |
End bp | 2315594 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003386755 |
Protein GI | 284036825 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.420637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATA ACTACTATGT ACGTTGGGGA TCCCATCTGC TGTGGATTAC CTTACTACTG ATCCACACAG CAGTTTTAGC CCAGGACCGT ACCATTACCG GGCGTATCAC GTCTAAAGGA GAGGGCAGTG CACTTCCCGG TGTGAACGTT TCGATTAAAG GCACTTCACG CGGAGTTGTT AGTGACGCCA ACGGTGGATA CAGTATTGTA GCTCCTCCCA GAACTACGCT TGTTTATTCA TTCATTGGAT TCAAAGCGCA GGAAGTGGTT GTCGGGAATC AGTCCGTAAT TAACGTGACC CTGTCGGAAG ATGTCTCCAC ACTCAATGAA GTTGTTGTTA CGGGCTATAG TGCCCAGTCA AAACGGGACA TCACCGGAGC CGTTTCGACT GTGAATACCA AAGAACTGCT GTCGATCCCG GCAACGGACG TAGCCCAGCA GTTGCAGGGC CGTGTAGCTG GGGTAACGGT AACGAACGAT GCTACGCCGG GCGGTTCGGC CACGGTACGG ATTCGTGGAT TTGGTACGAT TGGTAACAAC GACCCACTCT ACATTATTGA TGGTGTTCCA ACCCAAAATC TTGGCACCAT CAACCAGAAT GATATTGAGA CCATTCAGGT ACTTAAAGAC GCTTCAGCCT CGTCTATATA TGGTTCCAGA GCGGCCAATG GCGTTGTGAT CGTAACGACC AAAAAAGGGA AAGCCGGTGT ATCGCGGATC ACGTTCGATG CCTATTATGG GTCGCAGCAG TGGGCCAAAA AAGGGGAAGT CCTCAACGCT ACTGAACTGG GCCAGTACTT ATATCTGGCC GATGTTAACG CAGGTAAGGT CCCGTCACAC GGGCAGTATA CGTATGGTGC TAACGGGCAG GTAACCATCC CAGCCTATGT ATTCCCCAGT AAAGGAGCAG AAGGTACAGC AGCGGTTGAT CCGAGCAAAT ACTCTCTCAC TCCAGACAAT ATTTACGCGA TCACCCGCTC GGCCAATACA AACTGGTTCG ATGAAGTTTC TCGGACAGCT CCTATCCAAA ACTATCAGCT TGGCGCATCT GGAGGCTCAG AAACGGGCCG TTATGCCTTA TCGGTTGGCT ATTTCAACCA GCAGGGAACT GTTAGGGATA TCAGCTACGA TCGCTACTCT ATCCGGGCTA ACACGGAGTT CAATGTTAAA AAACGTATTC GTGTTGGCGA AAACCTGACA GCCGCTTACA GCAGCCGTAA AGGAGGTTTT AACAACAACG AGGAACAGAA CGCAGTATCC GGTGCCTACA AGCACCATCC ACTACTTCCT GTTTACGATA TCGCCGGAAA CTTTGCCGGT AGCCGGGGAC TTAACCTGGG TAACAACTCC AATCCGGTAG CTACGCTGTT CCGTGAACGC GATAACCGGT ATAACAGCCT TCGCGTATTT GGTAATGCGT ATGCTGAAGT CGACATCATC GAAGGCCTGA CGGCTCGTAC ATCGATGGGT CTCGATGCCA ACGGAGACCG CGCCAAATAT TTGGGACGGG CAAACCCGGA ATATATAGAG GGTAGCTTCA ACAACAGCCT GACCGACCAG AACCGCTACT TCTACCAGTG GGTATGGACC AACACGCTGA ACTACTCCAA GACGTTCAAG AACGTTCATA AAGTAGATGC TTTTGTTGGT ACCGAAGCCA TCCGTCAGTA TCAGGAATTC TTCGGAGCCG CTCGCAGTGG CTACTTCACT GAGCAGAAAG ACATTCAAAG CTACCTTGAC CTGGGTACGC AGTCATCAGC CAGCAACGAA GGACGCATTG AGCAGGATTA CTCCCTGTTC TCGGTCTTCG GTAAACTGAA CTACGCTTAT AGCGACAAAT ACCTGTTTCA GGCCATTATT CGCCAGGACA AGTCGTCACG CTTTCTGTCG GCTTCGAACA GTGCTCTGTT CCCGGCTGTG TCAGCGGGCT GGCGTATCTC GCAGGAAGAT TTCTTTAAGA ACAACCTGAC ATTCGTAAGC GATATGAAAC TGCGGGCAGG TTGGGGTAAA ACAGGAAACC AAGCCATCGG AGATTACAAC GCCTATACTA CCTATCGTTC CAATACCTCA ACGAACGGCT ATCCGATTGA CGGAAGCATG TCGACAGCAA CAGCCGGTTT CAGCCCACAG CGTTTTGGCA ACCCGAATGC TAAATGGGAA GCTACAGCTT CGACCAACTT TGGTTTCGAC CTGGCCATGC TGTCGAACAA GTTGGATGTG AGCTTCGATG TGTGGAGCCG GAAAACGACC GATATGCTCT TCACCTCACC GTTTACGTTC ACTGCGGGCG ATGCAGATAT TCCGGCTTAT AACGTAGGTA GTATGCAAAA CCGGGGTATT GACCTCGCTA TTGGCTACAA GGATCGCAAA GGTGATTTCC GTTATGGTGC CAGCATCAAC TTCGCTACGT ATCGCAACAA AGTATTGAAA CTGGATGAAA GCGAGAATAC CCGTTACTTC GGTTATGGCT CGCGCGTTCC GGCGGTTACT CTGACACAGG CCGGACTCCC CATTTCATCG TTCTTTGGCT ACAAGGTACT TGGTATCTTC CAGACAGCCG AAGAAGCAAA AGCCTGGGCT CCCTATGGTG ATTACAATGC TGTCGGTAAA TTCAAAATGG CCGATATCAA TGGTGACGGC AAAATTGATG ATGCGGACCG AACCATCATC GGTAACCCCC ACCCCGATTT CACCTATGGC ATAAATGTGA ACCTTGGTTA TAAAAACTTC GATCTGACCA TCTTTGGTAA CGGATCCCAG GGCAACGACA TTTATAACTA CACCCGTTAT TTTACGGATT TCAACACCTT CCAGGGTAAC CGTTCACGTC GGGCGTTATA CGATGCCTGG TCGAAAACGA ACCCAGGTGG CACAGTACCC GTCCCGGATG CCAACGACCA GATCAGCAGC CGTCCCTCCT CTTACTTCAT TGAGGATGGT TCATACTTCC GGATCAAAAA CGTTCAGTTG GGCTATACCC TGCCAGCCAG TTTGCTCTCC AAACTGGGCT TAGCTTCCTG CCAGATCTAT GTGCAGAGCC AGAACCTGCT CACATTCACC AAATATCAGG GACTCAACCC GGAGATTAGC ATTTCGAACA ACTACAATAG CGACAAAAAC CGGAACCTCG GCTTCGACGG CGGTTACCTG CCCGCTTCCC GTACGCTGCT TTTTGGCCTA AGTGTTGGAT TTTAA
|
Protein sequence | MKNNYYVRWG SHLLWITLLL IHTAVLAQDR TITGRITSKG EGSALPGVNV SIKGTSRGVV SDANGGYSIV APPRTTLVYS FIGFKAQEVV VGNQSVINVT LSEDVSTLNE VVVTGYSAQS KRDITGAVST VNTKELLSIP ATDVAQQLQG RVAGVTVTND ATPGGSATVR IRGFGTIGNN DPLYIIDGVP TQNLGTINQN DIETIQVLKD ASASSIYGSR AANGVVIVTT KKGKAGVSRI TFDAYYGSQQ WAKKGEVLNA TELGQYLYLA DVNAGKVPSH GQYTYGANGQ VTIPAYVFPS KGAEGTAAVD PSKYSLTPDN IYAITRSANT NWFDEVSRTA PIQNYQLGAS GGSETGRYAL SVGYFNQQGT VRDISYDRYS IRANTEFNVK KRIRVGENLT AAYSSRKGGF NNNEEQNAVS GAYKHHPLLP VYDIAGNFAG SRGLNLGNNS NPVATLFRER DNRYNSLRVF GNAYAEVDII EGLTARTSMG LDANGDRAKY LGRANPEYIE GSFNNSLTDQ NRYFYQWVWT NTLNYSKTFK NVHKVDAFVG TEAIRQYQEF FGAARSGYFT EQKDIQSYLD LGTQSSASNE GRIEQDYSLF SVFGKLNYAY SDKYLFQAII RQDKSSRFLS ASNSALFPAV SAGWRISQED FFKNNLTFVS DMKLRAGWGK TGNQAIGDYN AYTTYRSNTS TNGYPIDGSM STATAGFSPQ RFGNPNAKWE ATASTNFGFD LAMLSNKLDV SFDVWSRKTT DMLFTSPFTF TAGDADIPAY NVGSMQNRGI DLAIGYKDRK GDFRYGASIN FATYRNKVLK LDESENTRYF GYGSRVPAVT LTQAGLPISS FFGYKVLGIF QTAEEAKAWA PYGDYNAVGK FKMADINGDG KIDDADRTII GNPHPDFTYG INVNLGYKNF DLTIFGNGSQ GNDIYNYTRY FTDFNTFQGN RSRRALYDAW SKTNPGGTVP VPDANDQISS RPSSYFIEDG SYFRIKNVQL GYTLPASLLS KLGLASCQIY VQSQNLLTFT KYQGLNPEIS ISNNYNSDKN RNLGFDGGYL PASRTLLFGL SVGF
|
| |