Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3031 |
Symbol | |
ID | 8726783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 3669574 |
End bp | 3672642 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003387841 |
Protein GI | 284037911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.258909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAC ACTTACTTTA TTTCAAAATC CTGGTTCTTA CACTGCTTAC GACGCTCAGT TTTGCCCAGA GCGTTGAGGT GAAGGGACGC ATAACGGGCG AAGGAGGAGC GCCTATCTAC GGGGCCAACG TAGTGGTGAA AGGTAGCCGT CAGGGGGCTA TTTCAGACGA AAAGGGCGAT TACCGCATTC AGGTACCCAA AGGCTCTACC TTAACCGTCA GTTTTATTGG CTATGTAGCC AAGGATGTTG TTGTCGGCAA TAGCTCGGTT ATCAATGTGT CCCTGGCTCC TCAGTCGTCG GTATTGGATG AGGTGGTGGT TACGGCTCTT GGTATCAAGA AGGAGAAAAA AGCACTTGGT TATTCTGTAA GTGAAGTAAA GGGCGAGGAA CTCACGCAGG CCCGGACGGT CAACGTAGCG AACTCACTAC AGGGTCGGGT AGCGGGCCTG AACATCGCCA ATACCGCCAC TGGCCCGGGG GGCTCGGCAC GAATCATTAT CCGGGGTAAT GGATCGATCT CCGGAAATAA CCAGCCGCTG ATCGTTGTTG ATGGCACGCC CATCAATAAT GACAACCAGG GCTCTGCAGG TATGTGGGGT GGTGGCGACG GTGGCGATGG TATCTCCAGT CTCAACCCGG ACGAAATCGA AACCATCAGC GTACTGAAAG GAGCTACGGC TTCGGCGCTG TATGGCTCAC GGGCTTCGAA CGGGGTTATT CTGGTGACGA CGAAAGGAGG TAAAGCCAAC AAAGGCATTG GTGTTGAGGT CAACAGCAAC TTTGTTGGTG AGAGCCTGTT GCTACCTACC TACAAAGATT ATCAGTACGA GTACGGTATG GGAAACAATG GCATTAAGCC AACAACCATT GCCGAAGCGC TGACATCGAA TAGCTGGGGT AGCAAATTGG ACGGAAGCAG TGTTATTCAA TTTGATGGCG TTTCCCGGCC TTACTCGGCT GTTCGGGATA ACCAGCAGAA TTTCTACCGG GTAGGAAGCA CCTTTACCAA CTCGGTTGCG CTTACGGGAG CTACCGAATC GATGACGTAC CGCCTGTCGA TGAATGACCT CAACAACAAG GCGGTTGTTC CCAATAGCGG TCTGCGCCGG AACAATTTCG CCTTGAATCT CAATGCGAAT CTGGGTAAAA ACCTGTCTGT CGTTACCAAC GTCAAGTATA TCCTGGAACG CACCAACAAC CGTCCCCGCT TATCTGACTC ACCGGGTAGC GCAACCTATG CTCTGAATGC CATGCCTACA TCGCTGGGTA TAGCAGCACT GGAGCAAAGC CGGTATAATG CGGATGGTTC GGAGAAAACC TGGTCGGATA ACATCTATAT TCAAAACCCT TATATCGCTG CCTACGACTG GCGTCAGGAG GACAAAAAAG GCCGGATCAT CGGTGTAATA GAGCCTCGGT ACAACTTCAC TGACTGGCTG TTCTTACGGG GTCGTCTGGG CTTCGATAAC TTCAACTACC GGAACCTGAG CATTACGCCG TATGGTACAC CCTTCCAGCC TCGGGGCGGT ATGAACGTTG CCAACCGCAA CTTCACGGAA ACCAATACCG AATTGCTGTT GGGTGTTAAC CGGAAGTTTG GCGAAGCATT TGGTGTAAAT GCGCTGTTCG GTGGTAACCT GATGCGTCAG GTGTACCAGA ACTCAAACTA CGGCGGCAAC AACTTCAATA TCCCGTACTT CTACGATATA TCGAACATCG ACCCGGCTGC CCGTAACTCC AGCGAAAACT ACATCGAGAA GCGGATCAAC TCGGTATATG GTTCGGCTGA ATTCTCGTAT AAAGGTTATC TGTTCGTAAC GGCTACGGCT CGTAACGACT GGTTCTCGAC GTTGGCTAAA GGTAACAACA GTATTCTGTA CCCATCCGTT GGTGGTAGCT TTGTGCTGTC AGAAGCGGTT AGAATGCCGA AAGCCGTTAA CTATCTGAAA TTTAGAGGCT CGTGGGCGCA GGCCGGTGGT GATACCGACC CGTACAACCT GTCTCTGTAT TACGGACTGG CCGGTGCTCA CCTGGGCGCT CCGCTGGCAC AGATCAATGG TGACCGCGTA CCGAACTCCA ATCTGCAGCC GCTCACCTCG ACAACTTCGG AAGCCGGTCT GGAAACACGT TTGTTCAACA ATAAGCTTAG CATCGACTTC GCTGTATATT CCCGTAAAAC AACCAACGAC ATCGTTGGTG CCACCATTTC CAACACGTCG GGCTACAACA GCGCCCTGTT TAACGTGGGC GAAATTTCCA ACAAAGGGAT TGAGCTTTTG CTGACCTACC GGTTGGCAAG CAGCAAGGAT TTTAGCTGGG ATGCTTCGTT CAACATGGGC TACAACAAAA GCGAAGTGGT TAGCCTGTAT GGTAACCTGA CAACCCTGCG GGTAGATGAA AACCGGACGC GTATTGCGTA TATCCACCAG GACGTAGGAC TGCCCTACAG TCAGGTAAAA GGGTTTACCT ACAAGCGGAA TTCGGCCGGG GCTATTGTCT ATGATTCGCA GGGTTACCCA ATGCAGGGTG ATCTGGTTAA TTTCGGCACG GGTGTAGCGC CAACTACGCT TGGCTTCAAT AACTCTTTCC GATACAAAGG GATTGGCATT AGCTTCCTGA TCGATGGTAA ATTTGGTGGC GTGATTTACT CCGGTACCAA CGCGTACGCA AACCGCCGTG GTTTGCTCAA GTCGACGCTC GAAGGTCGTG AAACGGGTAT CGTTGGCGTA GGTGTAAACG AAAAAGGTGA ACCTAATACG GTTAAGGTGC CAGCACAACA GTACTATGAG CGCCTGTTCA ACATTGCTGA TCCCTTCGTT TACAGCGCTG ATTTCCTGAA ACTCCGCCAG GTAATTATCG ACTACACGAT TCCGGCGCGG GTATTCGGCA AATCGCCTAT CAAAGGGGCT TCGATTTCTA TTGTTGGTCG TAACCTGGCT ATCCTGATGA AGCATACGCC AAACATCGAT CCTGAATCGA CTTACAATAA TTCAAACGCA CAAGGGCTTG AACTGGCAGG CGTACCCGCC ACACGCACTT TGGGGGTTAA CCTGAATTTG AAATTCTAA
|
Protein sequence | MQKHLLYFKI LVLTLLTTLS FAQSVEVKGR ITGEGGAPIY GANVVVKGSR QGAISDEKGD YRIQVPKGST LTVSFIGYVA KDVVVGNSSV INVSLAPQSS VLDEVVVTAL GIKKEKKALG YSVSEVKGEE LTQARTVNVA NSLQGRVAGL NIANTATGPG GSARIIIRGN GSISGNNQPL IVVDGTPINN DNQGSAGMWG GGDGGDGISS LNPDEIETIS VLKGATASAL YGSRASNGVI LVTTKGGKAN KGIGVEVNSN FVGESLLLPT YKDYQYEYGM GNNGIKPTTI AEALTSNSWG SKLDGSSVIQ FDGVSRPYSA VRDNQQNFYR VGSTFTNSVA LTGATESMTY RLSMNDLNNK AVVPNSGLRR NNFALNLNAN LGKNLSVVTN VKYILERTNN RPRLSDSPGS ATYALNAMPT SLGIAALEQS RYNADGSEKT WSDNIYIQNP YIAAYDWRQE DKKGRIIGVI EPRYNFTDWL FLRGRLGFDN FNYRNLSITP YGTPFQPRGG MNVANRNFTE TNTELLLGVN RKFGEAFGVN ALFGGNLMRQ VYQNSNYGGN NFNIPYFYDI SNIDPAARNS SENYIEKRIN SVYGSAEFSY KGYLFVTATA RNDWFSTLAK GNNSILYPSV GGSFVLSEAV RMPKAVNYLK FRGSWAQAGG DTDPYNLSLY YGLAGAHLGA PLAQINGDRV PNSNLQPLTS TTSEAGLETR LFNNKLSIDF AVYSRKTTND IVGATISNTS GYNSALFNVG EISNKGIELL LTYRLASSKD FSWDASFNMG YNKSEVVSLY GNLTTLRVDE NRTRIAYIHQ DVGLPYSQVK GFTYKRNSAG AIVYDSQGYP MQGDLVNFGT GVAPTTLGFN NSFRYKGIGI SFLIDGKFGG VIYSGTNAYA NRRGLLKSTL EGRETGIVGV GVNEKGEPNT VKVPAQQYYE RLFNIADPFV YSADFLKLRQ VIIDYTIPAR VFGKSPIKGA SISIVGRNLA ILMKHTPNID PESTYNNSNA QGLELAGVPA TRTLGVNLNL KF
|
| |