Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0441 |
Symbol | |
ID | 8724169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 547991 |
End bp | 551095 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003385304 |
Protein GI | 284035374 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000408448 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTCAA CAATTACCCA ACCACCACGC CTGGCGTGGC TTCCTCTGCT TGGCTTAGCG GCACTTACGC TGGTAAGCCA ACCGGCGTTT AGTGCGCCAC CAACAGTAAA ACTTAAGTTA GCCAGCCCAA CTCAGGAACG CTCCGTTGCT GGTAAAGTAT TATCAGGCGA TGATAACACT GGATTACCGG GTGTAAGCGT TGCCGTGAAG GGCACCACGC GCGGTACAAC TACTGACGCT AACGGCGAGT ACAAAATCAG CATACCTAAC GAACGGGCTG TTCTGGTTTT CTCCGCTGTT GGCTTTATTA GCCAGGAAGT TACTATCGGC AATAAGTCAA CGGTTAATCT AACCCTAAGC ACTGATACAC GCGCCCTGAA TGAAGTCGTT GTTATTGGCT ACGGTTCTCA GAAAAAGAGC CAGACAACGG GAGCTATTTC GTCAGTTACG CCAAAGCAAA TTACAGAACA GCCTATTACC AACATTGGTC AGGCCATGCA AGGCCGGGTA GCAGGTGTCG ACGTAGCACA GTCGGGTAGC CGACCAGGTT CCGTACCAAC AATCCGGGTT CGTGGGCGTC GTTCGTTCAA TGCCGGTAAC GACCCGCTCT ATGTAGTTGA CGGGATTCCC CTTTCAGAAG GTTATGAAGA CATTAACCCG AACGATGTGG GTTCGATGGA AATCCTGAAA GATGCTACCG CAACGGCCAT TTATGGTGCC AGAGGTGCCA ATGGCGTTAT TCTGGTTACA ACCAAGCGGG GTAATCCGGT TGGTAAAACA ACCATCAGCT ACGATAACTA CGTCGGTTTT ACCGATGCGC TGGATAAAGT AAAGCTGTTC AGTGGCTCTG AATTTGCCGA ATTTGTTCGG GAAGCTTACC GGACTACAGG CAACTACAAA GACGCGAACG GCAATCCCGT TCCAACGGGT GTGGCCGATC CATATGCCGA CTCCAAAGTG GCGGTACTGG GTGGTGACCC GAACGTTGCA GCTGGCCTTG CCGCCAACCG GAATACTGAC TGGCAGTCGT TGATTCTGAA GCAGGGAGTT CAGCAGAATC ACTCGTTGGG TATTCAGGGC GGCAACGAGA AAACGCAGTT TTATATATCG GCTGGTTTTT TCCAGGACAA AGGGATTATG CCTGGTCTGG ACTTTACCCG TCAGTCGCTG CGTGCCAATA TTGATCACCA GATCAACAAG GCTCTTAAAG TGGGGATTGC CTCGTATATG ATGTATAGCG TACGGAACGG AGAGACGCTG AACCCCTATA ACTTTACCCT TCAGCAAAAT CCGCTTGGTC GGCCTTACGA CGATAACGGT AACCTGATCT TCTCGCCTAC GAACGATGCG CTGCTTACCA ATCCACTCGC CGAAGTTGTG CCGGGTGCTC AGGTAGAGAA TAGAAAGAAA TACCGCATTT TCAACAGCGT TTACGCAGAA GTAAACATCC TTGAGGGCTT AAAATACCGC GTTAACTTCG GGCCAGACTT TACCATCAAC CGATTTGGCC GCTTTATCGG TGCGCAAACA AACGCCCGGA AAGGTGGTGA CCCACAGGCG CAGACGGCCA GTGCATTTGG CTTCAACTAC ACGCTGGAGA ACGTGGTGAC GTATAACAAA AAAGTGGGCG ATCACAACTT CGGTTTTACC GCCCTGCAAT CCATTCAGCG GGATAACTTC GAGCAGAATA ACATCTCTGT TCAAGGTGTG CCAGCCGAAT CGCAGCAGTT CTACAATGTA GGCAACGCCA GTGCTGTATT GGGAGTAGGT AGTGGATTGC GGCAGTGGAC CATTAACTCG TACATGGGTC GTATCAACTA CGATTATAAA GATAAGTACC TGGTAACCGC TACGTTGCGC CGGGACGGAT CGAGCCGATT TGGCGAAAAT ACCAAATATG GTAATTTCCC CGGTATCGCC CTAGGCTGGA ATGTCAGCAA CGAAGACTTC ATGAAGGGAT CTAGCTGGGT CGATCTGCTA AAAATCCGGG CCAGCCGTGG TTCGGTAGGT AACCAGGGTG TAGCTCCCTA TCAAACGCAG GGATTATTGG ACCGCACGGT ATATGCCTTT GGCAATACAC CCGCTTATGG CTATCGCCCT AACACGATTG GCAACCCTGA TTTGCGCTGG GAAACGTCAA CCAGCACAAA CATTGGTATT GACTTCAGTC TCTGGCGGGG CCGGGTATCA GGTGCTATTG AATTATATAA TACCCGCACG ACCGACCTGC TACTATCCGA TCTGCTGCCT ACATCAATCG GTTTCAACTC TGTGACCCGC AACATTGGCG AGACCCAGAA TAAAGGGATA GAAGTGAGTG TATCAACGGT GAACGTGAAT TCAAAAAGTG GATTCAAATG GACATCCGAC ATTGTGTTCT CTAAAAATTC GGAAGCCATC ATCTCCCTTT TCAACGGACC GGTTGATGAC GTGGGTAACA AACGCTTCAT TGGCAAGCCT TTGACGGCCA TGTATGATTA CAAAAAAGCG GGTATCTGGC AAACCAGTGA AGCAGATGCC GCTAAATCCT ACCAGAGTGC AGTTGGCCAG ATTAAAGTGC AGGACACCAA CGGCGATGGT AAAATCACGG CTGATGACCG GGTATACTTA GGCTCTGACA TTCCAACCTG GAGTGGCGGT ATCACGAACC GGTTCAGCTA TAAAGGATTT GACCTGAACT TCTTTATTTA TGCCCGTATT GGCCAGACCA TTCTAAGCGG TTTCCACCGC GACAACAACC AGTTGGCTGG TCGTTATGAG CAAATCAAAG TTGACTACTG GACACCTAAC AACCCAACGA ACGAGTTCCC ACGGCCTAAC TCCAGCCAGG AGTTCCCGGT CTATAACTCA GCTATCATCT ATTTCGATGG ATCGTTTGTG AAAGTACGGA ACATCAACTT TGGTTATACG TTCCCATCGA GCATTACGTC GAAACTGCGC ATGCAGTCGC TACGTCTGTT CAGTAGCATT CAGCAGCCGT TCATCTTCTC GTCGTACCGG TCGAAGTACA ACGGTGTTGA CCCAGAGACA AGCGATGGCA CGGTAAGCAA CGGTGTTACG CCTGCTACCC GCGTAGTAAC CTTTGGTTTG AACGTCAAAT TCTAA
|
Protein sequence | MHSTITQPPR LAWLPLLGLA ALTLVSQPAF SAPPTVKLKL ASPTQERSVA GKVLSGDDNT GLPGVSVAVK GTTRGTTTDA NGEYKISIPN ERAVLVFSAV GFISQEVTIG NKSTVNLTLS TDTRALNEVV VIGYGSQKKS QTTGAISSVT PKQITEQPIT NIGQAMQGRV AGVDVAQSGS RPGSVPTIRV RGRRSFNAGN DPLYVVDGIP LSEGYEDINP NDVGSMEILK DATATAIYGA RGANGVILVT TKRGNPVGKT TISYDNYVGF TDALDKVKLF SGSEFAEFVR EAYRTTGNYK DANGNPVPTG VADPYADSKV AVLGGDPNVA AGLAANRNTD WQSLILKQGV QQNHSLGIQG GNEKTQFYIS AGFFQDKGIM PGLDFTRQSL RANIDHQINK ALKVGIASYM MYSVRNGETL NPYNFTLQQN PLGRPYDDNG NLIFSPTNDA LLTNPLAEVV PGAQVENRKK YRIFNSVYAE VNILEGLKYR VNFGPDFTIN RFGRFIGAQT NARKGGDPQA QTASAFGFNY TLENVVTYNK KVGDHNFGFT ALQSIQRDNF EQNNISVQGV PAESQQFYNV GNASAVLGVG SGLRQWTINS YMGRINYDYK DKYLVTATLR RDGSSRFGEN TKYGNFPGIA LGWNVSNEDF MKGSSWVDLL KIRASRGSVG NQGVAPYQTQ GLLDRTVYAF GNTPAYGYRP NTIGNPDLRW ETSTSTNIGI DFSLWRGRVS GAIELYNTRT TDLLLSDLLP TSIGFNSVTR NIGETQNKGI EVSVSTVNVN SKSGFKWTSD IVFSKNSEAI ISLFNGPVDD VGNKRFIGKP LTAMYDYKKA GIWQTSEADA AKSYQSAVGQ IKVQDTNGDG KITADDRVYL GSDIPTWSGG ITNRFSYKGF DLNFFIYARI GQTILSGFHR DNNQLAGRYE QIKVDYWTPN NPTNEFPRPN SSQEFPVYNS AIIYFDGSFV KVRNINFGYT FPSSITSKLR MQSLRLFSSI QQPFIFSSYR SKYNGVDPET SDGTVSNGVT PATRVVTFGL NVKF
|
| |