Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4206 |
Symbol | |
ID | 8727965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5065280 |
End bp | 5068351 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003388990 |
Protein GI | 284039060 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.43352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAAT TACTACAAAT TGGTTTCCTG ATTTTGATAA CCGTATGGGC AACCTACGCA CAGGGTCAGG CCGTTTCAGG CAGAGTTACA TCATCAGACG ACGGTAATCC CCTACCCGGC GTATCTGTCA GTGTCAAAGG AACGACACAG GGAACGTTAA CCGATGCCTC CGGTAACTAC CGCATTAACG CAGGTAACAA CGCGGTAGTC GTTTTCAGCT TTATTGGCTT CACGACTCAG GAAGAAAAGG TGGCGAATCG GTCAGAAATC AATGTCCAGT TAAAAACCGA TGTACGTAAC CTGAGTGAGG TTGTTGTAAC AGGTTACGGG CAGCAGATCA AACGGGACCT AACCGGCAAC ATCGCGAAAG TTAAAGCCGC CGATATTCAG GATCAGCCCG TAACGACCTT CGATCAGGCA TTACAGGGCA AAGCGGCCGG CGTTCAAATC AATTCCGGCT CCGGCAAACT GGGTCAGGGA ATACAGGTTC GGGTACGGGG CCAGTCGTCG GTATCGGCAT CGAACCAACC GCTTTACATC ATCGACGGTA TTCCCGTCAC GACAGACAAC CTAAGTATCA CCAGTTCAGC CACCAATCCT TTGGCCGATA TTAACCCTCA GGATATTGAG TCGGTCGATA TTCTGAAAGA TGCGTCGGCC GGAGCTATTT ACGGTGCCCG GGCGGCCAAT GGTGTCGTGC TGATTACCAC CAAACGCGGC AAAGCCGGAC GTACCAACGT TAATTTCGGT GCTCAGTACG GGTCAAGCAA GCCAACCCGC AAGCTGGAGT TTCTGAATAC GGAACAGTAC GTTAAGTTTT ACAATCAGGC AGCGGCCAAT TCCGACCGAA TTGAGGGGCT AGATCCGAGC GACCCTGACT CGTATACCAC GTATATGAAG GATTTCTACC AGACGCAGGG ATTAGGTACC TACGGCACGT CTAACCAGGC AAGTACCAAC TGGGGTGATC TGGCCTATCA GGATGCCCCC TATCAGCAGT ATGATCTGAA CCTGAACGGT GGTAACGAAA AAACGACGTT TTACCTCTCG GGCCAACTGC TGGATCAGAA AGGTATTCTG GTTGGTAATG CCCTGCAGCG CTATGCCGGC CGTCTGAATA TCGAACACCA GGTATCCAGC CGGTTCAAAG CAGGCTTTAA CATGGGACTG ACTCGTACTC TGAACCAACG CATCTCGGGC GATAACCAGT TCGATAACCC CATGCAGATG GTGGCCCTGC CGCCAATGAC ACCCGCAACG GATGCCACAA CGGGGCTCCC TGTGGGCTCC CCTCCCGGCG ACATCAGCAT TCCGGTTTAC TACAACCCAC TCATTAACAT CGGCAATGCG TATTTCAACA CCACCGTTTA CCGGAATATC AGCAATGTAT TTGGGCAATT GCAGATTATG AAAGGCCTAA CGTTCCGAAC AGAGTTTGGC CTCGATGTAC TGAATCAGCA GGAAGAGTTG TACTACAACA GCAAAACGCA GCGGAACTTT GGCTCACCGC TGGGCCTGGG CCGGAACCGT TTCGCCCGGG TAGAAAACTA TACGACGAAC AACTTCTTCA ATTATTCCAC CGCTTTTGGC CGGAGTAACC TCGACGCTAC GGTGGGGATG TCGTATCAGC AATCGCAGCA GAAAACGAAC TTCACCGAAG GCCGGGATTT CCCGTCTGAT GCTTACCGGA TGATTGCCAG TGCGGCCCGC AAAACCGACG GTAGCTCGTC GCAAACGGAT TACCGCTTCC TGTCTTACTT TGCCCGCGCC AATTACAAAT TTGCCGACCG CTACCTACTT GGTGTGAGTG CGCGGGTAGA CGGTTCATCC CGCTTTGGTA ACAACAGCCG GTATGGCTTC TTCCCATCTG TTTCGGCAGG CTGGGTGCTT AGTGAGGAGG GGTTCATGAA AAACACAACG GCTATCAGCT TCCTGAAACT TCGGGCCAGC TACGGCCAGA CGGGTAATGC CGAAATTCCG AATTTCCCGC AGTTGGGTCT CTTCACCGGC GATGCCGGCT ATGGTACGCT GCCTGGTCAG CGCCCATCGC AGTTGGCCAA CCCCGACCTG AAATGGGAAA CAACCAATCA GTTCGACATT GGTATCGACT TTGGTATTCT TAACAACCGC ATCAATGGCG AAATCGACTA TTACAACAAA CAAACGTCGG GTCTGCTGCT CAATGTAAAC GTGCCGGGAA CGACAGGCTT TGCCACGCAG TTCCGCAACG TAGGCAGCCT CGAAAACAAG GGGGTCGAAA TTGTTATTAA TACCGAAAAC CTGACGGGTG CTTTCCGCTG GACAACGAGC TTCAACGCAG CTACGAACCA GAACAAGATC AACAACTTAC AGGGCCAGAT TATCGAAGGC GGTATCAATG CGATGAGTCG TGCGGTAGAA GGCCAGCCAC TGGGCGTTTA TTTCACGCAG GAATATGCCG GTGTTGATCC GGCCAATGGC GATGCTCTTT GGTTCAAAAA CACCACCAAT ACCGACGGTA CTATTGACCG GAGCACCACT AAAACCTACA ACCAGGCTCA GCGGGTTGTT GCTGGTAGCC CGTTACCCAA GTGGACGGGT GGTATTACCA ACACGTTCAG CTACAAAGGT TTCTCACTGA GTGTACTGTT CAACGGTGTT TTTGGCAACA AGATCAACTT CTACGGTGTA GGTCGCTACT CATCGGCCAA CGGTCGTTTC GAGGATAACC AGACGGTCAA CCAGTTGGCA GCCTGGACGA AAGAGAACCC CAACACCAAC GTTCCGGAAG CCCGTCTGTT CTACAACAAC GGTGCCCAGT CGTCCAGCCG TTTCATTCTT GATGGTTCAT TCGTTCGGTT ACGTACGGCT ACCTTATCTT ACTCGCTGCC CAAAACGCTC GTTAACCGGG TTAAGATGAA TAGCGTCCGT CTGTTCGTTA CAGGACAAAA CCTGCTAACG TTTACCAACT ATGCCGGATG GGACCCTGAA GTCAACGCCG ACTACATTGT GTCGAACATT GCGCAGGGGT ACGATTTCTA CACGGCTCCC CAGGCACGCA CCATTACGGG CGGTATTAAC ATTGGTTTCT AA
|
Protein sequence | MRKLLQIGFL ILITVWATYA QGQAVSGRVT SSDDGNPLPG VSVSVKGTTQ GTLTDASGNY RINAGNNAVV VFSFIGFTTQ EEKVANRSEI NVQLKTDVRN LSEVVVTGYG QQIKRDLTGN IAKVKAADIQ DQPVTTFDQA LQGKAAGVQI NSGSGKLGQG IQVRVRGQSS VSASNQPLYI IDGIPVTTDN LSITSSATNP LADINPQDIE SVDILKDASA GAIYGARAAN GVVLITTKRG KAGRTNVNFG AQYGSSKPTR KLEFLNTEQY VKFYNQAAAN SDRIEGLDPS DPDSYTTYMK DFYQTQGLGT YGTSNQASTN WGDLAYQDAP YQQYDLNLNG GNEKTTFYLS GQLLDQKGIL VGNALQRYAG RLNIEHQVSS RFKAGFNMGL TRTLNQRISG DNQFDNPMQM VALPPMTPAT DATTGLPVGS PPGDISIPVY YNPLINIGNA YFNTTVYRNI SNVFGQLQIM KGLTFRTEFG LDVLNQQEEL YYNSKTQRNF GSPLGLGRNR FARVENYTTN NFFNYSTAFG RSNLDATVGM SYQQSQQKTN FTEGRDFPSD AYRMIASAAR KTDGSSSQTD YRFLSYFARA NYKFADRYLL GVSARVDGSS RFGNNSRYGF FPSVSAGWVL SEEGFMKNTT AISFLKLRAS YGQTGNAEIP NFPQLGLFTG DAGYGTLPGQ RPSQLANPDL KWETTNQFDI GIDFGILNNR INGEIDYYNK QTSGLLLNVN VPGTTGFATQ FRNVGSLENK GVEIVINTEN LTGAFRWTTS FNAATNQNKI NNLQGQIIEG GINAMSRAVE GQPLGVYFTQ EYAGVDPANG DALWFKNTTN TDGTIDRSTT KTYNQAQRVV AGSPLPKWTG GITNTFSYKG FSLSVLFNGV FGNKINFYGV GRYSSANGRF EDNQTVNQLA AWTKENPNTN VPEARLFYNN GAQSSSRFIL DGSFVRLRTA TLSYSLPKTL VNRVKMNSVR LFVTGQNLLT FTNYAGWDPE VNADYIVSNI AQGYDFYTAP QARTITGGIN IGF
|
| |