Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6433 |
Symbol | |
ID | 8730217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 7796368 |
End bp | 7799409 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003391189 |
Protein GI | 284041259 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.234793 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATTA AATGGCTACT TTTTTGTGGC GTCCTGCTTC CCGCACAGGG AGTACTCGGC CAGCAAATTG CCAGCAATAG CAGTCCGCTA TATGCTTCTG TCAGATCAAC GGTTACCGGA CTACGTGCGT CGGAGACGGC GGTGATTACC GTGACGGGAA AAGTGACCGA CGAGAAAGGC GACGCGCTGC CCGGCGCTAC CGTTTCGCTG AAAGGCGGCT CCGTCGGGGC AAATACCGAC GCCGACGGCA ACTACACCCT CCGTATCCCC GACGGCACCC CGAATCCGGT GCTGGTCTTT TCGTTTATCG GCTACACGTT GCAGGAAGTT GCCATTGGCA ACCAGACCGT GGTGAACGTA CAACTCAAAG GTGATGCAAA ATCCCTGAAC GAAGTCGTCG TGGTCGGTTA CGGTACCCAG AAACGTTCAG ACATTACCGG TTCGGTGGCG TCTGTACCCA AAACCCGTTT GTCGCAGTTA CCCGTTACCA ACGTGTTGCA GGCCATTCAA GGCTCAGTGG CGGGTGTCAA CATCTCGCAG TCGTCGTCGG TACCGGGAGC CGCGCCGTCA ACGACCATCC GCGGGCAGAA CTCCATCAAC GCCAACTCCG GCCCCTATGT GGTTGTCGAC GGTATTCCGC TGAGCAAAAC GGGCGGCTCG CTGAGCGACA TCAACCCCAA CGACATCGAG TCGATGGAAG TACTGAAAGA TGCCTCGGCG GTGGCTATTT ACGGTACCAA CGGCGCCAAC GGCGTTATTC TTGTGACAAC CAAACGGGGT AACACCGGCA AGCCGACCAT CCGGTATAAC AACTACGTGG GTGTCGAAAA CTTCGCACAT ATGCTGCGTC CGCGCAACGG CGCTGAGTAC GTGCAGAAGT ACGCCGATTA CATGGCCCAG ACCGGCCAGA AACTGGTCAA TCCGGTGCCT AACTACGACG AGTTGGCCAA CTACAACGCG GGCATCACCA CCGACTGGAT GAAAGAAGCG ACCCAGACGG GCGTGTTGCA GGACCACAAC CTGAGCATTT CGGGTGGATC GCCCAATGTG CGGTACTTCA TCTCGGGTGA GTTTCTGGAT CAGAAAGGCG TTATCAAAGG CTATCAGTAC AAGCGGGCCA GTTTCCGCTC CAATCTGGAC GTTACCCTGA CGGATTACCT GACGGTGGGC ACCTCGCTGT TCATTGCCAA CAGCAACCGC GACGGCGGCC GCGCCAACAT GCTCAACGCA TCGGCCATGA GCCCCTACGG GCAGGAGTAC AACGCCGACG GGACCTACCG CATTTACCCC ATGTTCCCGG AGCAGTTGTA TACCAACCCA ATGATTGGCC TGACCGTCGA CCGCGTTGAC CGCAACACCA ACCTGAACGG GAACGCCTAC CTCGAACTGA AACTGCCGGG CAAACTGAAC GGGCTGAAAT ACCGCATGAA CCTCGGTTAC TCGTACATTC CGGCCCGCAC GGCGAGTTAT AATGGCCGGG CGGCCAACGA CCTGCTCGGT ACGGCCAACA CGTTCTTCTC CGAAACCAAC AGCTTCACCC TAGAAAATAT CCTGTCGTAC AGCCGGGATT TCGGCAAGAA CCACTTCGAC TTCACGGGTC TGTACAGCGC CCAGCAGCGG AAATACGCCA CCGCAACCGG AACGGCAACG GGCTTTGTCA ACGACCAGTT GTCGTTCAAT AACCTGGGGG CCGGTGCCAC GCAGTCGAGT AACTCCTATG CCGACCGCTA CGGCCTGAAC TCGCAAATGG GTCGGGTCAA CTACTCGTAC GACAGCCGGT ATCTGTTCAC CGTTACGGCC CGTCGCGATG GTTCGTCAGT GTTCGGAGCC AATACTACCA AGTACGGCCT GTTTCCCTCG GCGGCTATCG GCTGGAACAT CAGCAATGAG GCTTTCATGA AAAACGTGAA CCTGGTGAGC AACCTGAAAC TGCGTTTCTC GTACGGCAAA TCGGGTAACG AAGCCATCAG CGTATACCGG ACCATTACGA CCGACAACAC CGTTCGGTCG CCCTTCAACG GCGTGAGCAC CATCGGCGCA CAGCCGGGCA ACCTGGGCAA CGCCAATCTG CAATGGGAGA CGACCCTCAG CCGCAACATC GGCGTAGATT TCGGTATCCT CAACAACCGC ATCAACGGTA GCCTTGATCT GTACAAGAAT AACACCAAAG GATTGCTCCT ATTGCGCAGC CTACCCATCC TGACCGGCTA TTCGAGCGTA TACGATAACC TCGGCGAAAC GTCGAACACC GGTATCGAAC TCACGCTGAA CACCCGAAAC GTAACGAATG GCGATTTCAA ATGGGAAAGC ACCGTTGTAT TTGCCTCCAA CCGCAACCGC ATTCTGGACC TCTACGGCGA CAAGAAAGAC GACCTCGGAA ACCGCTGGTT CATTGGTCAG CCCATCAGCG TGGTGTACGA CTACAAACTG GCCGGTGTCT GGCAAACGGG CGAAGACGCG TCCTCGCAGG ACCCCGGCGC GGTGGCCGGT GACCTGAAAT TTGCCGACCT CAACGGCGAC AAGAAGATCA CCGCCGACGG CGACCGGATG ATTCTGGGCC AGACGGCTCC CAAGTGGACC GGTGGCCTGA CGAACACCTT CCATTACAAG AATTTCAACC TCAACGTGTT TATCCAGACG GTTCAGGGCA TAACCCGCAA TAACGCCGAC CTGACCTACG CCGACGAAAC CGGCAAACGG AACACGCCCA TCGACGTGGG GTACTGGACG GCCAACAACA AGAGCAACAC CCGCCCGTCG CTGGCGTTCA AGAACCCACG GGGCTACGGC TATGCGTCGG ATGCGAGCTA CACCCGTATC AAGGACGTAA CGCTGAGCTA CGTCTTCGAT CAGAAACTGC TTGATAAACT GCACCTGGGC AGCCTGACAG TTTATGCCAG TGGTCGTAAC CTGTACACCT TTACCAACTG GATTGGCTGG GACCCCGAAG CGGTGCAGTC TTCCCGCGGC TCCGGCGACT GGACGAACAA CTACCCGCTG ACCCGCTCTT TTGTGATGGG CCTTAACATC AGCCTTCGCT AA
|
Protein sequence | MDIKWLLFCG VLLPAQGVLG QQIASNSSPL YASVRSTVTG LRASETAVIT VTGKVTDEKG DALPGATVSL KGGSVGANTD ADGNYTLRIP DGTPNPVLVF SFIGYTLQEV AIGNQTVVNV QLKGDAKSLN EVVVVGYGTQ KRSDITGSVA SVPKTRLSQL PVTNVLQAIQ GSVAGVNISQ SSSVPGAAPS TTIRGQNSIN ANSGPYVVVD GIPLSKTGGS LSDINPNDIE SMEVLKDASA VAIYGTNGAN GVILVTTKRG NTGKPTIRYN NYVGVENFAH MLRPRNGAEY VQKYADYMAQ TGQKLVNPVP NYDELANYNA GITTDWMKEA TQTGVLQDHN LSISGGSPNV RYFISGEFLD QKGVIKGYQY KRASFRSNLD VTLTDYLTVG TSLFIANSNR DGGRANMLNA SAMSPYGQEY NADGTYRIYP MFPEQLYTNP MIGLTVDRVD RNTNLNGNAY LELKLPGKLN GLKYRMNLGY SYIPARTASY NGRAANDLLG TANTFFSETN SFTLENILSY SRDFGKNHFD FTGLYSAQQR KYATATGTAT GFVNDQLSFN NLGAGATQSS NSYADRYGLN SQMGRVNYSY DSRYLFTVTA RRDGSSVFGA NTTKYGLFPS AAIGWNISNE AFMKNVNLVS NLKLRFSYGK SGNEAISVYR TITTDNTVRS PFNGVSTIGA QPGNLGNANL QWETTLSRNI GVDFGILNNR INGSLDLYKN NTKGLLLLRS LPILTGYSSV YDNLGETSNT GIELTLNTRN VTNGDFKWES TVVFASNRNR ILDLYGDKKD DLGNRWFIGQ PISVVYDYKL AGVWQTGEDA SSQDPGAVAG DLKFADLNGD KKITADGDRM ILGQTAPKWT GGLTNTFHYK NFNLNVFIQT VQGITRNNAD LTYADETGKR NTPIDVGYWT ANNKSNTRPS LAFKNPRGYG YASDASYTRI KDVTLSYVFD QKLLDKLHLG SLTVYASGRN LYTFTNWIGW DPEAVQSSRG SGDWTNNYPL TRSFVMGLNI SLR
|
| |