Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3960 |
Symbol | |
ID | 8727718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4752627 |
End bp | 4756055 |
Gene Length | 3429 bp |
Protein Length | 1142 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003388749 |
Protein GI | 284038819 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAAC GAGTACTTTA TCAAAAGACT TTACAGAAAA TCATGCGCAT TGGCATCTAC CAGGCCTTTC TTTCTGTCGC TTTTGCCACC TTCTCCTACG CCGTCGAGGT TAGTGGGCAG AAAGTGCTGG AGCAAAAAGT AACCCTTCAA CTGTCCAACG CCGATGTTGA AAAAGTGCTG GACAAAATTG AGACCGTTAC GAACGTCAAA TTTCTGTACA ACCCGCAAAT CTTCGGGAAC GACACCAAAG CCACCTATAA ATTTCGCAAC GAGCCGCTCT CCGATGTTCT GAACAAGATT CTAAACCGGT ATCAGGTTAC CTACGAAGTA CTCCAGGACC GGATCATTCT GAAACGGCTG GAGTCGTTCG AAATCAGAGT ACCCGTTCAG CAGGAAGCCC CCAAACGGAA AGTAGCGGGT ATCGTTCTCG ACGAAAACGG GGCCGGATTG CCGGGCGTGA GCGTCGTAAT CAAAGAAGCC CAGAAAGGGA CCACCACGGG TGCTGACGGC CGTTTTTCGC TCGATGTCCC AGACGATAAC GCTGTACTGG TGTTCAGCTT TGTGGGCTAC AAACGGCAGG AAGTTACCCT AGGTAACCAA AGCAACCTCT CGGTAACGCT GGCTCCGGAA GCTAGCACCC TCGGCGAAGT GGTGGTAACG GCCCTCGGTA TTGCGCGGGA GAAAAAAGCC CTCGCCTATG CCGTGTCGGA AGTGAAAGGC AGCGAGTTTA CACAGGCCCG CGAAAACAAC GTGGCCAACG CCCTGACGGG TAAGATTGCC GGGGTCAACG CAACGGGTAT GGCGACCGGC CCCGGTGGAT CAAGCCGCAT CATCATCCGG GGTAATGGCT CCCTGAACGG CAACAACCAG CCGCTGTACG TCATCAACGG GATGCCGATG GATAACAGCA CCCCCGGCGG CACACAAGCC GACGGCAACG GCATGAACGT TGACCGGGGT GACGGTATCG GCGGTATCAA CCCCGACGAT ATCGAGTCCA TCAGCGTACT CAAAGGTGGT CCCGCTGCGG CTCTGTACGG AGCCCGTGCC TCCAATGGCG TTATTCTGAT TACGACAAAA AAAGGCCGCG CTCAGAAAGG TGTCGGCGTA GAGATTAACA GCAATACGAC CTTCGAAGAC ATCGCCGTGA TTCCAAATTG GCAGTACGAA TACGGCCAGG GACTCGATGG CAGAAAACCA ACCACGGTAA CGGAGGCCAA AAGCACCGGC CGACTGTCTT ACGGGGCCAA AATGGATGGG CTACCCACAA TTCAGGTGGA TGGGCAAATG CACCCCTATT CGCCCCAGAG AAACAATCTG AAAAACTTTT ACCGCACAGG CACCAACTAC ATTAACTCGC TGGCCTTCAC CGGCGGCAGC GAAACGGTAA ACTTCCGGCT GGGACTCAAC AACACCCAGT CGAACAGCAT CGTACCCAAC TCGTCGTTCT CCCGGCGGAT CGCCAACCTG AACCTGAACG CGTTTCTGGG CAAGAAACTG AGCGTTGAAA CGGTATTCCA GTACAACGTG GAAGAGGGCA TCAACCGACC GAAAGTTGGG TATGCCGACT TCAACCCGCA CTGGGCCACT TACCTAATCG CCAACGTGGT CGACATTCGT AGTCTGGCAC CGGGATACGA CCCCGTGACG GGCAAAGAGA TGGAATGGAA CCCTGTTCCG GCCGCGCCGA ACCCGTATTT TGTGATCAAC AAGTTTAAGA ATAACGACAC CAAACACCGG TTTATCAGCC AGGGAAGCAT CCGCTACGAT ATTCTGGACA ACCTGTTCCT CAAAGGCAGC GTCAGCCAGG ACTTTTACAG CTTTTCGTCG GAATACGTCC AGCCTACCAA TAACGCCTAC CAGCCGCTGG GCACCTACGA AGCCCGTAAA ACAAGCTCCT CGGAAACCAA TGGTATGCTG ACGCTGAACT ACAACACGAC CTTTTTTAAG GACCTTACCT TTTCAGCCCT GCTGGGCGGC AATGTCCAGA AAGCGATCTT CGACCAGACA ACCATAGCTG GCAGTGAATT TACGGTACCG TATTTCTACA GCTACACCAA CCTCGCGACA TCGACCACAA CGCCAACCTA CCTGAAAAGC GCCATCAATT CGGTGTTTGG CTCGGCCGAT TTCGGGTATA AAAACGTCGC TTATCTGACG CTGTCGGGTC GGCAGGACTG GTTCTCGGTG CTGAACCCGA AGAGTAACCA CATCTTCTAC CCATCTGTGG GCGGCTCGTT CATTCTGTCT GATGCGTTCC AGTTGCCCAA GGCGGTGAGC TTTGCGAAGT TGCGGGCGTC GTGGGCGCAG GTGGGTGGCG CTACGGTCAA CGCCTATCAG ATTTACCAGT ACTATTCCAT GCAGCAGGGC GGTCACAACG GTCGGCCGGT GCAGGTTTTA TCGTCCTCGC AGGTACCCAA CCCCGACCTG AAACCGCTGA CCTCGACTAC GTACGAGGGG GGTATTGAAG CCAAGTTCCT GAACAACCGG CTAGGTATCG ACCTCACGCT CTACAACCGC AAAACCACGG ACGACATCGT GACGACGAAC ATCGCCCTGT CGTCGGGCTA CACCTCGGCG CTGTTGAACG TGGGTGCGTT GAGTAACAAA GGCGTTGAGC TGCTACTGAC TGGCACGCCC GTCAGCAAAG GGCCTTTCTC CTGGGATGTT AGCTACAACA TGGCCTACAA TAAGAGCAAG ATCGAACAGC TGGCCGCAGG CATCACCGGT ATTGATGTTG GTGCGGGCGT AGGCGGTGGT CTGGTTCGGA ACGTACTCAA CCGGCCTTAC GGCACCGTTT GGGGCTACAA CAAGAAGACC GACGCCAACG GCAATGTGGT CTTTAACACA GCCAGCGGGT ATGCGCTTCG GGGCGATTTG CAGGAAATCG GGCAGGGCAC GCCCCCACTC ACGATGGGGA TCACCAACAA CTTCCGATAT AAGAACTTCT CGCTGAACAT CCTGGTCGAC GGTAAATTTG GCAGCATCGT TTACTCGAAC CTATACCAGT ATGCCTACCG CTTTGGTCTG CCGCAGGAAA CCCTGCCCGG CCGCGAAACC GGCATCACCG TCACGGGCGT AACCCCCGAA GGCAATCCGT ACAGCAAAAC ATGGAGCAAG GAAGAGGTCG ATACGTACTA TGACAACGAC AAGAACTACA CCGCCATGTT CATGTTCAAC AACGATTTCG TGAAGCTGCG TCAAGTGATC CTCAGCTACA ATCTGCCCGT TGCCAAACTG CCCTTCCTGA AGCTACAATC GGCCACGATC TCGTTTGTAG CGCGTAACCT GGCTATTCTC TACAAGGATA AAAAGAATCA GTATTTCGAT CCGGAGTCGG GCTATACGAG CACCAACGCA CAGGGGCTGG AGGCTTTCGG CGTACCCAGA ACCCGCAGCC TGGGTGTGAA CTTAATGGTG AAATTCTAA
|
Protein sequence | MSKRVLYQKT LQKIMRIGIY QAFLSVAFAT FSYAVEVSGQ KVLEQKVTLQ LSNADVEKVL DKIETVTNVK FLYNPQIFGN DTKATYKFRN EPLSDVLNKI LNRYQVTYEV LQDRIILKRL ESFEIRVPVQ QEAPKRKVAG IVLDENGAGL PGVSVVIKEA QKGTTTGADG RFSLDVPDDN AVLVFSFVGY KRQEVTLGNQ SNLSVTLAPE ASTLGEVVVT ALGIAREKKA LAYAVSEVKG SEFTQARENN VANALTGKIA GVNATGMATG PGGSSRIIIR GNGSLNGNNQ PLYVINGMPM DNSTPGGTQA DGNGMNVDRG DGIGGINPDD IESISVLKGG PAAALYGARA SNGVILITTK KGRAQKGVGV EINSNTTFED IAVIPNWQYE YGQGLDGRKP TTVTEAKSTG RLSYGAKMDG LPTIQVDGQM HPYSPQRNNL KNFYRTGTNY INSLAFTGGS ETVNFRLGLN NTQSNSIVPN SSFSRRIANL NLNAFLGKKL SVETVFQYNV EEGINRPKVG YADFNPHWAT YLIANVVDIR SLAPGYDPVT GKEMEWNPVP AAPNPYFVIN KFKNNDTKHR FISQGSIRYD ILDNLFLKGS VSQDFYSFSS EYVQPTNNAY QPLGTYEARK TSSSETNGML TLNYNTTFFK DLTFSALLGG NVQKAIFDQT TIAGSEFTVP YFYSYTNLAT STTTPTYLKS AINSVFGSAD FGYKNVAYLT LSGRQDWFSV LNPKSNHIFY PSVGGSFILS DAFQLPKAVS FAKLRASWAQ VGGATVNAYQ IYQYYSMQQG GHNGRPVQVL SSSQVPNPDL KPLTSTTYEG GIEAKFLNNR LGIDLTLYNR KTTDDIVTTN IALSSGYTSA LLNVGALSNK GVELLLTGTP VSKGPFSWDV SYNMAYNKSK IEQLAAGITG IDVGAGVGGG LVRNVLNRPY GTVWGYNKKT DANGNVVFNT ASGYALRGDL QEIGQGTPPL TMGITNNFRY KNFSLNILVD GKFGSIVYSN LYQYAYRFGL PQETLPGRET GITVTGVTPE GNPYSKTWSK EEVDTYYDND KNYTAMFMFN NDFVKLRQVI LSYNLPVAKL PFLKLQSATI SFVARNLAIL YKDKKNQYFD PESGYTSTNA QGLEAFGVPR TRSLGVNLMV KF
|
| |