Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4782 |
Symbol | |
ID | 8728546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5824338 |
End bp | 5827772 |
Gene Length | 3435 bp |
Protein Length | 1144 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003389559 |
Protein GI | 284039629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCAAC ATGAACCAGC GTTAGTGAAT CCTGACGTGC ATCGGACCGA CTGCGAATCG GGCCCCGACA AGCCGGATTG TTTTGCCGAT TTCCTATCTA ACAAAACCCT TAAAGGTATG AAAAAAAGAC TACCTGTGCC CACCGGCGGC TTACCCGGTT GGTCAGCCGA TCTACTTCCA CGATTAATGA ATCTTTCGCT TACCCAGCTT TTCCTGATGA TTGCCTGCAC GAGTTTCTCG TTTGCCTTCG ATGGCCAGGC GCAGGAATTA ATGAACCGTC CCGTAACCCT GAAGGTAGAG GGGCAGCGGC TCCGCGTGGT GCTGGCGCAA ATCGAACAGC AAACAACGGC CCGCTTCGTT TACAGCTCGA AGTCGATTGG TGTCGACCGC CCCATAACCA TCACCACCCG CGACAAACGG CTGGCCGACG TGCTGACTGA ATTACTCCGG CCGCTGAAAC TGAGCTACCG GATGGTGGGC GGTCAAATCG TGCTGGAAAG CGATGCCGAC GCCCATTCGC TGGTAACACC GGCAAACGAA GCCGCCGACC GCGCGCTGTC GGGAATGGTG ACCGACGAGA AAAACGCGGC CCTACCGGGC GTAAGTGTCG TGATTAAAGG CTCGAACCGG GGCTCCACTA CGGATGCCAA CGGGCAGTTC AAAATCACGG TGCCGGACGG TAACGCCGTT ACGCTGACGT TCTCATTTGT TGGCTACCAG AGCCAGGATG TGGTGGTTGG CAGTAAAACG ACGGTCAACG TGTCGATGGT ACCCGACGTC AGTGCGCTGG ACGAAGTTGT GGTCATCGGC TACGGGGCCG TTCGCAAAAA AGACTTGACC GGCTCGGTGG TGCAGCTCAA GAGTGAGCAA CTAAAAGAAG TACCGACCTC CAACGTGCTC GAAGCCGCGC AGGGTAAAAT TGCCGGGGCC GACATTACCC GCAGCAGCGG TCAGGCGGGG GCGCGAATAA ATATCTCCAT CCGGGGCAAC CGCTCCATTG GCGGCAACAA CTCCCCGCTC ATTATCGTGG ATGGTATCCA GTACAGTAAC CTGGAAGACA TCAACGCCAA CGACATCGAG ACGATGGATG TCCTGAAAGA TGCGTCATCT ACGGCCATTT ACGGGTCGCG CGGGTCGAAC GGGGTTATTC TGATCACGAC CAAGAAAGGT AAGCTGGGCA AACCCGACAT TTCGTTCAAC GCCTATTCCG GTATCTCGCA GGTGACGATG TACCCGAAGG CGATGGACAT TACCGGTTTC CGGGATTTCA AACGGGAAGC GTGGCGGGCC GCCGGTATCT GGAAAAGCCC CGCCGATGAT GCCGCCATTT TCACCAACGT AGCCGAATAC GACGCCCTGC AAAAGGGTCT CTGGACCGAT TATCAGGACG CGCTGATTCA CAACGGGCTT CAGCAAAACT ACCAGGTGGG TATTCGCTCC GGCACCGACC GGCTGAAATC GTACGTTTCG GTCGACTATT TCAATGAGAA AGGCATTCTG AAACTGGACG AACTGAGCCG CTACACGGGC CGACTCAATG TCGACTTTAC CATTAACGAC TGGATGAAAA TCGGGTTGCA AAGCCAGCTG ACGTACTACA ACCAGAGCGT ACGCCGGGAC CCGCTCAACC AGGCCAACAA GATCAGTCCG CTGGGATCGC TCTACGATGC CAACGGCAAT TTCAATTTCA TTATGCTCGA TGGACAGACC GCCAACCCCC TCTCCGACGA GCAGCCCAAT GTGTTTAACA ACTCAGTGCT GACCACCCGG GTGCTCACCA ATGGGTACCT CGAACTGACG CCCTTCAAAG GGTTCTCGTT CCGGAGTACG CTGGGCGTCA ACCTGGCTTC CATACGTGAT GGGGCCTATT CATCGCCCAA ATCCATCGAC CGCTCGCTGA CGGGCAAATC GCTTTCTACG TACAACACCA GCAACGGCCG CACCGTGAAC TGGGAGAACG TCATGACCTA CCAGCGGACC TTCGGCCAGC ACGCCGTTAC CATGACGGGC ATCGCCAGTT ACCTGGGCAA CACCTCCGAC AACTCGGCGG CTTCGGGCGT CAATCAGTTG CTGCCTTCGC AGTTGTTTTA CTCGCTGGGC AGTGCCACGG AAGAGATCAA GATCAATTCG GCGTTTTCCA AGAACAACCT GGTCTCGTTT GCCGCCCGCC TGAACTACGC CTTCCGCGAC CGGTATCTGC TCACGCTGAC CGCCCGCGAA GATGGTTCGT CGAAGCTGGC AGCGGGTAAT AAGTGGACGT TCTTCCCGTC GGCGGCTTTC GCGTGGCGGG TTATCGAGGA GAAATTCATG CAGGACGTAA AGGGCCTGAG CGACCTCAAA ATCCGGGCAA GTTATGGCGT AGCGGGTAAC GACCCATCCG GCCCTTACGC GACCCAGACA ACCCTGACCC GGCTGGCTTT TGGTTTCGAT GACATCTCGG CTCCGGCCTA TACCTTCTCC CGAAACGTGG GCAACACCGC CCTCGGCTGG GAATTGTCGA ACACGAAGAA CCTGGGCGTG GATTTCGGGC TGTTCAACGG GCGCGTCAAC GCATCGCTCG ACTACTACGA CACCCGCACC TCCGATCTGC TGCTGGACCG GGGACTTCCA CCAACGACCG GCGTAACGAC GGTGAAACAG AACATCGGCA AAACCCGCAA CCGGGGCCTT GAGCTGTCGC TGGGAAGTAC CAACATCCGC ACCCAGAACC TGACCTGGAG CAGCAACGTC ACCTTCACTA AGAACAAGGA AGAAATCACC GAACTGGTGA CCGGCTCCAA CGACATCGGC AACGGCTGGT TCATTGGCTC GCCCATCAGC GTGTATTATG ACTACGAGAA ACTAGGCATC TGGCAAACTT CGGAAGCCGA CCTGGCCGCC AAACTCGCGC CCACGCAGCT ACCCGGCGAA ATCAAAGTCA AAGACCAGAA CAACGATGGC AAGATCGACG CCGTCAACGA CCGGATTATC CTGGGAACAC CCCGCCCAAA ATGGAGTGGC GGCTTCGACA ACACGGTTAA ATTCAAAGGA TTCGATCTGA ACGTGTTCCT GTATGCCCGT GTGGGCCAGA TGATCAACTC CGACCGGTCG GCGCGTTTCG ACCAGCAGGG AGTCGGCAAC AGCACGGCTG GGCTAGACTA CTGGACGCCC GAGAATTCAA CCAACGCCTA TCCGCGCCCG AACAAGAACG GCGGTTTGAA ATACCTCTCC ACGCTGGGTT ATCAGGACGG CACCTACGCC CGCATCCGGA ATATTACGCT GGCGTATAAC GTCCCAGTCA AAGTGCTTCC CAAGGTGGTT CGGGGCGTTC GCGTGTATGT AACGGGTAAG AACCTGGTCA CCTTCACGAA GCTGAATTAC GACCCCGAGC GGGGTGGTTC GGAAAACTTC CCCATGACCA AACTGTACGT CTTCGGCCTG AACGTCAACT TATAA
|
Protein sequence | MRQHEPALVN PDVHRTDCES GPDKPDCFAD FLSNKTLKGM KKRLPVPTGG LPGWSADLLP RLMNLSLTQL FLMIACTSFS FAFDGQAQEL MNRPVTLKVE GQRLRVVLAQ IEQQTTARFV YSSKSIGVDR PITITTRDKR LADVLTELLR PLKLSYRMVG GQIVLESDAD AHSLVTPANE AADRALSGMV TDEKNAALPG VSVVIKGSNR GSTTDANGQF KITVPDGNAV TLTFSFVGYQ SQDVVVGSKT TVNVSMVPDV SALDEVVVIG YGAVRKKDLT GSVVQLKSEQ LKEVPTSNVL EAAQGKIAGA DITRSSGQAG ARINISIRGN RSIGGNNSPL IIVDGIQYSN LEDINANDIE TMDVLKDASS TAIYGSRGSN GVILITTKKG KLGKPDISFN AYSGISQVTM YPKAMDITGF RDFKREAWRA AGIWKSPADD AAIFTNVAEY DALQKGLWTD YQDALIHNGL QQNYQVGIRS GTDRLKSYVS VDYFNEKGIL KLDELSRYTG RLNVDFTIND WMKIGLQSQL TYYNQSVRRD PLNQANKISP LGSLYDANGN FNFIMLDGQT ANPLSDEQPN VFNNSVLTTR VLTNGYLELT PFKGFSFRST LGVNLASIRD GAYSSPKSID RSLTGKSLST YNTSNGRTVN WENVMTYQRT FGQHAVTMTG IASYLGNTSD NSAASGVNQL LPSQLFYSLG SATEEIKINS AFSKNNLVSF AARLNYAFRD RYLLTLTARE DGSSKLAAGN KWTFFPSAAF AWRVIEEKFM QDVKGLSDLK IRASYGVAGN DPSGPYATQT TLTRLAFGFD DISAPAYTFS RNVGNTALGW ELSNTKNLGV DFGLFNGRVN ASLDYYDTRT SDLLLDRGLP PTTGVTTVKQ NIGKTRNRGL ELSLGSTNIR TQNLTWSSNV TFTKNKEEIT ELVTGSNDIG NGWFIGSPIS VYYDYEKLGI WQTSEADLAA KLAPTQLPGE IKVKDQNNDG KIDAVNDRII LGTPRPKWSG GFDNTVKFKG FDLNVFLYAR VGQMINSDRS ARFDQQGVGN STAGLDYWTP ENSTNAYPRP NKNGGLKYLS TLGYQDGTYA RIRNITLAYN VPVKVLPKVV RGVRVYVTGK NLVTFTKLNY DPERGGSENF PMTKLYVFGL NVNL
|
| |