Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3946 |
Symbol | |
ID | 8727704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4729714 |
End bp | 4732845 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003388735 |
Protein GI | 284038805 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA CTCTAATCTT TCTGGGTTGT CTGCTACTGA GTTGTGGCGT TACGCTGGCC CAGACAAACC GAATAACCGG TAAGGTAACC GGCCCCGATA AACAGGGGTT ACCCGGCGTA AACGTCCTCG TGGGCGGCAC TTCGGTCGGA ACGGCCACTG ATGCGTCGGG AAATTACGCC ATCAATGCAC CCGCAAATGC TTCGCTGATT TTTTCATTCA TCAGCTATGT AACGCAAACT GTACCCGTCA ATAACCGATC CATAGTGAAC GTAGAACTGG CCGAAGATGC CAAGGCCATC GATGAGGTGG TTGTTACGGC CCTCGGTATC AAGCGGGAAG CCAAAACGCT GGGGTATGCC ACCGCAACGG TCAACGCCGA GCAGATCAAC GTAAACCGGA CGCCCAATTT TATGAGTGGA CTACAGGGCA AAATGGCTGG TGTCAACATC ACGTCTATGG GTACTGGTCC CGCCGGAACG GCCAAAATCC GCATTCGGGG GCAGTCGTCG TTCAGCGGAC AGAACAACCC GCTGATTGTC GTCAACGGGG TGCCCATCGA CAACTCCAAC TACTCCCTCG GGGGCGATTT CGGCAACCGG GCTTCCAACA GTTCGGATGG GGGCGATGGA CTGAGCAGTA TCAATCCCGA CGATATCGAG ACCATGACCG TACTGAAAGG TGCTACGGCG GCTGCCCTGT ACGGCTCACG CGCTAAAGAC GGCGTGGTCA TGATTACGAC CAAAAGCCGG GGGTCGGGCA AAGGCTTCGG TGTGACCTAC AACGCCAACT TCACCACCGA TACCCCGTTG GATTTTACGG ACTTTCAGTA CGAATACGGG CAGGGCGAAG GCGGCAAACG ACCTACAACG GAGAAACCGA CCTCGGGCGT ATGGAGTTTC GGCGAGAAGT TCCAGCCGGG CATGACACAG ATTCTGTTCG ATAACAAAAC GTACCCCTAT GAACCCGTTT ATAATCGGGT ACGGCAGTTT TACCGCGTAG GCACCAACTT CACGAATACC GTAACCGTGT CGAACAACGG CCAGAATGGC GGTTTCAGCC TGTCGTTTGG CAACACCGAC AACCGGGGTA TCATGGAAAA CAACACCTTC AACCGAAAGG TGATCAACCT GGGATTCACG CAGAACATCA CGCAAAAGCT AACCGCGTTG GGCAACATTA ATTACTCGCT GGAAAACAAC GTCAATCCGC CCCAGCTAAA CACCCAGGAC CTGTCTGTAT CGACGGTGAT TTTTACGCTG GCCAACTCCA TGCCTTTCGA CGCACTGCGC GACAACCAGA CATTGCCCAA CGGCGATGAG TTCGTCTTCT CCCGTTTTCT GGTTCGGAAT AACCCCTACT ATTCCATGAG CCACAAATTC GAGAACGTCA ATCGCAGTCG ACTGTTCGGG AACGTTGCCC TTAAATACCA GTTCACCGAC TGGCTGTACG CACAGGCCCG CCTGGCCCAG GATTATTATG TGCGTAACCA GGAGTATAAC ATCCCGAACG GCTACGCCCC CATTGCCCGC GCCCCGGTGG GCTTTGTGAA CGGTTCCTAC ACGCAGGATG TCCGCCAGAA CACCGAACGG AACCTCGACT TGATTCTCGG AATGAACAAG ACGTTTGGCA CGATCGGCGT GGACGTCACG CTGGGTGGCA ACCACCGCTA TGCACGCAAC GACTACAACA GTGTAACGGT GCAGGATTTT GTACAGCCGG GCCTGTATAC CGTCATGAAC GGCCGTATTA AAGACCCGCT CTACAGTCTG GCCGAGAAAA AGATCAACTC CGTGTTTGGG GCGGCTACGG TATCGTACAA GGATTTCCTG TTTTTGAGTG CTACGGCCCG CAATGACTGG TTTTCGACGC TGGCCCCCTC CAACCGGAGC ATTCTGTACC CATCCGTTAC GAGTAGTTTC GTATTCTCGC AGGCGTTCGA CAACATGCCC GCCTGGCTGT CGTTCGGAAA GCTACGCGCG GCTTATGCGC AGGTTGGCTC GGACAACGTC GACCCCTATT CCAACGCGCT GTATTTTTCC GTTGATAACA ACTCATTCCC GAATCCTTCT GGCGCTCTGG TACCCGTGGG CGGCATCAAT GCAACGGTTG TTCCCAACAA AAACCTGCGT CCGCTGCGCA TTCAGGAGGC CGAAGTGGGG CTGGAGCTGA AGCTGTTTGC CAATAAAGTC GGCTTTGATT TCACGTACTA CCACAAAACG ACGGACGACC AGATTCTGGC CGCTCAGGTT TCCGATGCCT CGTCGTACAC CAGCAAGCTG ATCAACGTAG GCCGGAGTAT GAACCAGGGC CTTGAGATGC TGCTGACCTT TTCGCCCGTC CGGACGACCA CGTTCCGTTG GGATGTCAGC GCCAACGTGT CGTACAATAC ATCCAAAGTG CTGAAACTTG GTTTATCGCC CAACGACACT GTCATTACCG TCAGCAGCGG GGGCGGCCGG ACGCTGAATC AGGTAGTGGG CAAGCCCATT GGGCAGTTGT ACACCTTCAC TTACCTGCGG GATGCGCAGG GTCGGCAGGT TTTCGACGCC AACAGCGGGA TGCCGCTGCG CAACAATACC CTTAAGAATG TGGGCAACGC CCTGCCAAGT TATTTTGGCG GTATCACGAA CACCTTCACG TATCGAGGCA TTGTGCTGTC GGCGCTGATC GACTTCAAGC TGGGCCATAA ACTGATTGCG GGCCGCAACA TCAACTACAT GCGCCACGGC CTGTCGAAGC GGACATTACC GGGTCGGGAT GTGGGGTATG TAATTGGCAA CGGGGTCAAC CCAAACGGAG AAATCAACCA GACGAGAGCC GCCGTACAAC CTTTCTACGA ATCCATTAAC CCGCTGGGCA TCAACGAAGA TTTCGTGTTC AACGCTGGGT TCTGGAAACT GCGTCAGATA TCGCTGGGCT ATGACTTCGA CAAACTCCTA CCCCAGCGTT TTTTCCTGAA AGGTCTGCGG TTAAATGCGG TGGCCAACAA CGTCCTGATC ATCAAAAAGT GGACCGAGAA TATGGACCCC GAAGAAGTGC TGGTGTCGTC GGACAACGCC GTGGGACTGG ATTTCTGGCC GGGCCTGCCG CCTACCCGCA GCATAGGCTT CAACCTCAAC GCCCGCTTTT GA
|
Protein sequence | MNKTLIFLGC LLLSCGVTLA QTNRITGKVT GPDKQGLPGV NVLVGGTSVG TATDASGNYA INAPANASLI FSFISYVTQT VPVNNRSIVN VELAEDAKAI DEVVVTALGI KREAKTLGYA TATVNAEQIN VNRTPNFMSG LQGKMAGVNI TSMGTGPAGT AKIRIRGQSS FSGQNNPLIV VNGVPIDNSN YSLGGDFGNR ASNSSDGGDG LSSINPDDIE TMTVLKGATA AALYGSRAKD GVVMITTKSR GSGKGFGVTY NANFTTDTPL DFTDFQYEYG QGEGGKRPTT EKPTSGVWSF GEKFQPGMTQ ILFDNKTYPY EPVYNRVRQF YRVGTNFTNT VTVSNNGQNG GFSLSFGNTD NRGIMENNTF NRKVINLGFT QNITQKLTAL GNINYSLENN VNPPQLNTQD LSVSTVIFTL ANSMPFDALR DNQTLPNGDE FVFSRFLVRN NPYYSMSHKF ENVNRSRLFG NVALKYQFTD WLYAQARLAQ DYYVRNQEYN IPNGYAPIAR APVGFVNGSY TQDVRQNTER NLDLILGMNK TFGTIGVDVT LGGNHRYARN DYNSVTVQDF VQPGLYTVMN GRIKDPLYSL AEKKINSVFG AATVSYKDFL FLSATARNDW FSTLAPSNRS ILYPSVTSSF VFSQAFDNMP AWLSFGKLRA AYAQVGSDNV DPYSNALYFS VDNNSFPNPS GALVPVGGIN ATVVPNKNLR PLRIQEAEVG LELKLFANKV GFDFTYYHKT TDDQILAAQV SDASSYTSKL INVGRSMNQG LEMLLTFSPV RTTTFRWDVS ANVSYNTSKV LKLGLSPNDT VITVSSGGGR TLNQVVGKPI GQLYTFTYLR DAQGRQVFDA NSGMPLRNNT LKNVGNALPS YFGGITNTFT YRGIVLSALI DFKLGHKLIA GRNINYMRHG LSKRTLPGRD VGYVIGNGVN PNGEINQTRA AVQPFYESIN PLGINEDFVF NAGFWKLRQI SLGYDFDKLL PQRFFLKGLR LNAVANNVLI IKKWTENMDP EEVLVSSDNA VGLDFWPGLP PTRSIGFNLN ARF
|
| |