Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1571 |
Symbol | |
ID | 8725305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1894943 |
End bp | 1897357 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003386419 |
Protein GI | 284036489 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.579738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0739696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGGC TTTTAGGTTT AATTCTCGGT TTATTGCCTT TTTGGACATC TGCTCAGACG CTATTGGTGC GTGATAAAAC TACGCTTCAA TCCATTGAGA ATGTGGAGGT CAGAAGACTG TCGCCGGGGG CATCACAGCC GATTTTTACA GACCGTTCAG GTCAGGCAGA TGCATCAGCG CTAACCGGTA CCGACAACGT TGTTTTCCGC CGGGTGGGTT ACCAGACCGT TCGGTATTCA ATGGAGCAGC TTCGGACGCT GAATTTTACC GTGCTGATGG CCGAAAAGCA ACTGGCAATC AATGAGGTCG TCGTGGCCGC TAGCCGTACT ACCGAGTCGC TTTTGAAAGT GGCTCAGCCT ATTCGGGTCT TTACCCGGAA TGAGCTGCGC TTTCTGAATC AGCCAACCAT GGCCGAGGTG TTGCAGCAAA GTGGTCAGGT ACTGGTTCAG AAGAGCCAAT TGGGTGGTGG TAGCCCGATT CTTCGTGGTT TTGAAGCCAA TAAAGTGCTG ATGGTTGTCG ACGGCGTTCG GATGAACAAT GCCATCTTTC GGGGAGGGCA CCTTCAGAAT ATCCTGACCA TCGACAATGC CGCTGTCGAA CGGATGGAAG TCGCGCTGGG GCCAGGATCG GTGGTGTACG GCAGCGATGC GCTGGGCGGT GTTATCTATG TACAGACTTT ATCGCCCAAA CTGAGCGTGT CCGAAAACAC GGCCGTCAAC GCCAATGGCT TTGTTCGATA CGGCAGTGCC ATGAACGAAA AAACCGCTCA CGCCGACTGG AACCTTGGCT TTCGGAAGTG GGCGTTGACA ACCAGCGTAA CCGGTTCGGA CTTTGGTGAT CTCCGGCAGG GAAAACAGCG GAACGCCGAT ATGGGACAAC TGGGTTTACG GCCCTTCTTC GCAGGGTTTG AAAACAATAC GGATGTGAAG ATCACCAATC CTGACCCGCT GGTTCAGACA CCGTCGGGCT ATAAACAGAT CGATCTATTG CAAAAAGTGT TGTTTCAGCC GAATGAACGG ACGCAGCATT TGCTGAACGT TCAGTTTTCG ACCAGCAGTG ATATTCCGCG CTACGACCGG CTCACGGAAG TCGACGCAAA AGGGAATCCG AGCCATGCTC AGTGGTATTA TGGGCCGCAG AAACGCCTGT TAACATCGTA CGGTCTGACC AAACAGTTTA CTTCCGGTAT AGCCGATGAA CTCAAATTGA TTGCCGCTTA CCAGTCAATA GAAGAAAGTC GGCATAACCG TCGCTTCGGA AATTACGGAT TGCAGCACCG AACGGAAAAC GTGAATGTCT GGACGCTGAA CGCCGATTTG AAAAAGAAAC TAGCCGACTC GCATACCCTG CGCTACGGCC TGGAGGGAAC CTACAACACC GTTCAGTCGA CGGCGTACCG ACAAAATGTA CAGACCGGAA AAATAGACCC GCTGGACACG CGCTACCCCG ATGGCGGAGC CAATACCCAG TCGTTGGCGG GGTATGTGTC GGGAACGCTG GACGTGAGCA CTCGTTCCAC ACTGACCTAT GGCGCCCGCT ATGCCTATAA TCGATTGTAC GCGAAATTCA ATGACAAAAC ATTCTTCCCG TTTCCGTTCA ATGATATCAC CCAGCAGTCG GGTGCCGTTA CGGGTAGTCT TGGTTGGGTA ACGCGCCTGC AGGGAGAGTG GCAACTGGCC ACGTCGGTTT CGTCGGGGTA TCGCGTGCCG AATGTGGATG ATCTGGCCAA AGTGTTCGAG TCGGTGGCCG GAAATCTGAT CGTTCCCAAT CCCAATCTGA AACCGGAGCG CACCTACACC TTCGATGCCG GTGTTCGCAA GCAGATTGCC GAACGCGTTT CGTTCGAAGC AGAAGGCTTT TATACGATCT ACAATAATGC TATCAACACC CAGCCGGGCA TGTTAAACGG ACAATCCCAA ATCGACTACA ACGGTCGCAG CAGTCGGATC GTTACCCAGG TCAATTCGCA GCAGGCGCGG TTATTCGGGT TCAACGCGCA GCTTTCGGCC GATCTGACTC AGTCGCTTAC CGTGTTCGGC ACCGTAACCT ATACAAAAGG CCGTATCCGG ACCGACTCCG TGGGCTACCC CCTCGACCAC ATTCCACCGC TGTATGGCAA AGGCGGCATC CGGCTAACGA TCAGACAATT TCGGGCTGAG GCCAATGTTC TATTTAATGG ATGGAAACGG TTGAAGGATT ACAATCTGGT AGGGGAGGAT AACATCGTGT ACGCAACATC ACAGGGTATG CCCGCCTGGC AAACGGTTAA TCTCAGAACC AGCTATCAGG TGAATCGCAA CTTGCAGATG CAGGCCTCGC TGGAAAATAT TCTGGATCGA AACTATCGCG TTTTTGCATC GGGAATCAGC GCGCCCGGCC GGAATCTAAT ACTCACCTTG CGGGGAACGC TATAA
|
Protein sequence | MKRLLGLILG LLPFWTSAQT LLVRDKTTLQ SIENVEVRRL SPGASQPIFT DRSGQADASA LTGTDNVVFR RVGYQTVRYS MEQLRTLNFT VLMAEKQLAI NEVVVAASRT TESLLKVAQP IRVFTRNELR FLNQPTMAEV LQQSGQVLVQ KSQLGGGSPI LRGFEANKVL MVVDGVRMNN AIFRGGHLQN ILTIDNAAVE RMEVALGPGS VVYGSDALGG VIYVQTLSPK LSVSENTAVN ANGFVRYGSA MNEKTAHADW NLGFRKWALT TSVTGSDFGD LRQGKQRNAD MGQLGLRPFF AGFENNTDVK ITNPDPLVQT PSGYKQIDLL QKVLFQPNER TQHLLNVQFS TSSDIPRYDR LTEVDAKGNP SHAQWYYGPQ KRLLTSYGLT KQFTSGIADE LKLIAAYQSI EESRHNRRFG NYGLQHRTEN VNVWTLNADL KKKLADSHTL RYGLEGTYNT VQSTAYRQNV QTGKIDPLDT RYPDGGANTQ SLAGYVSGTL DVSTRSTLTY GARYAYNRLY AKFNDKTFFP FPFNDITQQS GAVTGSLGWV TRLQGEWQLA TSVSSGYRVP NVDDLAKVFE SVAGNLIVPN PNLKPERTYT FDAGVRKQIA ERVSFEAEGF YTIYNNAINT QPGMLNGQSQ IDYNGRSSRI VTQVNSQQAR LFGFNAQLSA DLTQSLTVFG TVTYTKGRIR TDSVGYPLDH IPPLYGKGGI RLTIRQFRAE ANVLFNGWKR LKDYNLVGED NIVYATSQGM PAWQTVNLRT SYQVNRNLQM QASLENILDR NYRVFASGIS APGRNLILTL RGTL
|
| |