Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0021 |
Symbol | |
ID | 8723749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 19274 |
End bp | 22204 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003384894 |
Protein GI | 284034964 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCATA ATTTGGAGGA AAAACCAGTG GCTGAAATAA CCGTTAGCGG ACGGGTTACG GATGCCACTA CCAACGAAGC CCTCGCGGGT TGTAACGTTG TACTGAAAGG TACACAGAAA GGAACAACGA CGGATGCCAA TGGCGATTAT AAAATTGTAG TGCCCGATGG TAATGCCACA CTGGTGTTCG GGTTTATCGG TTTTATTTCT CAGGACGTAC CCGTAGGAAA CCGCACGGTC ATCAACGTAT CGCTGAAAGC GTCGGCTTCG GAGCTGGCGC AGGTCGTTGT TATTGGTTAC GGTACTACCA CAAAGAAAGA CGTAACCGGA TCGCTCAAAA CAATCAAGAG TACAGATTTT AACCGGGGTA TCATCAACTC ACCTGAGCAG CTTTTGCAGG GTAAAGTAGC GGGCGTAAAT GTTACCTCAG CCAGTGGTGA GCCGGGGGGT GTTCAAAATA TTACGGTACG TGGGCCGGGG GGTGTTCGGA CAGGTAGTAC GCCACTCTTC GTACTGGACG GTATTGCGCT CGATAACTCA AGTACGGGTG GTGCAACTAA CCCATTAAAT TTTCTGAACC CACAGGATAT CGAAGCCATC GATGTTCTGA AAGATGCCTC TGCAACAGCT ATTTATGGTG CACGGGGTGC TAACGGCGTA ATTCTGATCA CGACCAAGAA AGGCAAAGCT GGTGCAACTA ACCTTACTCT TTCCTCAAAC ATAGGGATTT CGAACATGGC CCGCCCCATT GCGCTGTTTT CAACGGATGA GTACAAGCAG CAGGTAGCGG CTGTAGGCGG TGTGGTCGAC GATCAGAAAG GATCTACGGA TTGGCAGCGT GAAATCAGCC GGACGGCGGT TACCCAAAAT CATAATCTGT CGTTCGGTGG CGGTGCCGAC CGCCTGACCT ATTATGGCTC TATTGGCGTG CAGGACCAGC AGGGTATCCT GAAAAATAGC AGCCTAAAAC GGTACACCGC CCGTTTCAAC GCTTCACAGA AATTTCTGGA AAATCGACTG GTGCTGGATG TCAACATGAC GGCCTCGCAA ACGATCAACG AGCGTCCGCC AATCGAAGGA ATAATCGGAG CGGCTCTGTC GGCCAATCCA ACGTATCCGG CGCGCGATGC CAATGGCAAT CCAGCCCGCT ATCAGGCCTT CACCAACCCA TTGCTGGCAT TGAATCTGAA CAAGGACCTG ACAACCATCA ACCGGGTTGT GGCGTCGGTA TCACCATCGT TCAGCATCAC CAAAAACCTG GTTTACAAGC TGAATCTGGG CGTTGATAAT TCAAGCTCCA CGCGCGACCA GCAGTCTTAC GCTAGTACGG TGCCGCAGCA GGACGGCCGC CTGGATGCTA CCTACCTTAA CAACCGGAAC GTACTGGTCG AAAACTATTT CACCTACACC AAAACTTCGG GCGATCATAA CCTGACCGCT TTGCTGGGGC ATTCGTATCA GAAGTTTACG ATTCAGGGGC GTAACTGGAG CATCAACAAA TTTCCAATAT CACCCATTGA ACCCGCCAAC AACCCTGGCC TGGGGCAAGA CCTGACGCTG GCCAACAACC GTCCCGGCGG CTATGCGATC ATCAATGAGT TACAGTCGTT CTTCTCACGG GTTAATTACG CCTATAAAGA TCGCTACCTG TTTACGGCTA CGGTTCGTGC CGATGGGTCG AGCAAATTTG GCGCAAACAA CAAGTATGGT GTATTCCCTT CGTTCTCGGG CGGCTGGCGG CTGTCGGAAG AAGGGTTCCT TAAATCGGGA CCATTCTCGG ATCTGAAACT CCGCGCCGGT TGGGGACAAA CGGGTAACCA GGAAATACCG TCGAAGATTA CCCAGGCCCT GTTCACCTCG AACGTGTCGG CCTCAACCAG TTACCCGCTC GATGGGTCAA CCAACTATCC GGCGGGAACC ACGTATACCC GTCTGGCCAA TCCTGACATT CAATGGGAAG TATCGACCCA AACCGACCTG GGCCTTGATT TTGGTCTGTT CCGGGGGGCG TTGACGGGTT CCGTCGATTA TTTCCATAAG ACATCGGGTA AGATTCTGCT CGAAGTGATT CCTTCCGATC CTATTCAGCC CGCTTCCACC TACTGGACCA ACGTGCCGAA TATGACCATC ACCAACCAGG GACTTGAGCT TGATCTGAAC TACCGTTACG CCAGCACAAG CGGCTTCCGG TTCGACATAG GTGGCAATGT TACGTTCATT AAAAATGTAG TGAATAATTC GCCATACACG GTTATTACCT CAGGCTCCGC ATCGGGAGCC GGGTTGACAT CGGCTACGGT AAACGGCTAT GTGAACGGAC AGCCCATCGG AACGTTCTTC CTGCGGGAAT ACCTGGGCGT TGACGACAAA GGGGTTAACC GATTCAGTGA CATAGACGGT GATGGAATCG GTGGTACCGA CAAAGACCGG ATTGCTGCGG GAAGCGCCTT GCCAACCCGC CAGTTTAACC TCAATTTCAG TACGGCTTAC AAAGGTTTCG ACCTAACGGC CAATTTTAAC GGCGTGTCGG GTAATAAAAT TTACGACAAC ACGACGAATG CGTTCTTCTA CAAAGCACGT CTGGTAAAAG GGCTGAATGG ACCCGCTGAA TCAATTGGTG AGCCAACCGA GTCAATCAAT AACCCGGCTC CTGTATCGAC ACGCTTCCTG AGAGACGGCG CTTTCTTCCG GCTCAATAAC CTGTCGCTGG GCTACAATCT AAATCCCCGT ACCCTTGGTA TGAATCGCTG GATTTCAAAC ATCCGACTAT CGGTAACGGG TCAGAACTTG TTTGTTATCA CGAAATACAA AGGGTATGAT CCTGAAGTAA ACATCGACCG CACGGTTAAT GGTATCTCGT CGTATGGAAT CGACTACCTC AGTTATCCTA AAGCGCGTTC GTTTGTGTTT GGCTTAAATC TTACCTTCTA A
|
Protein sequence | MAHNLEEKPV AEITVSGRVT DATTNEALAG CNVVLKGTQK GTTTDANGDY KIVVPDGNAT LVFGFIGFIS QDVPVGNRTV INVSLKASAS ELAQVVVIGY GTTTKKDVTG SLKTIKSTDF NRGIINSPEQ LLQGKVAGVN VTSASGEPGG VQNITVRGPG GVRTGSTPLF VLDGIALDNS STGGATNPLN FLNPQDIEAI DVLKDASATA IYGARGANGV ILITTKKGKA GATNLTLSSN IGISNMARPI ALFSTDEYKQ QVAAVGGVVD DQKGSTDWQR EISRTAVTQN HNLSFGGGAD RLTYYGSIGV QDQQGILKNS SLKRYTARFN ASQKFLENRL VLDVNMTASQ TINERPPIEG IIGAALSANP TYPARDANGN PARYQAFTNP LLALNLNKDL TTINRVVASV SPSFSITKNL VYKLNLGVDN SSSTRDQQSY ASTVPQQDGR LDATYLNNRN VLVENYFTYT KTSGDHNLTA LLGHSYQKFT IQGRNWSINK FPISPIEPAN NPGLGQDLTL ANNRPGGYAI INELQSFFSR VNYAYKDRYL FTATVRADGS SKFGANNKYG VFPSFSGGWR LSEEGFLKSG PFSDLKLRAG WGQTGNQEIP SKITQALFTS NVSASTSYPL DGSTNYPAGT TYTRLANPDI QWEVSTQTDL GLDFGLFRGA LTGSVDYFHK TSGKILLEVI PSDPIQPAST YWTNVPNMTI TNQGLELDLN YRYASTSGFR FDIGGNVTFI KNVVNNSPYT VITSGSASGA GLTSATVNGY VNGQPIGTFF LREYLGVDDK GVNRFSDIDG DGIGGTDKDR IAAGSALPTR QFNLNFSTAY KGFDLTANFN GVSGNKIYDN TTNAFFYKAR LVKGLNGPAE SIGEPTESIN NPAPVSTRFL RDGAFFRLNN LSLGYNLNPR TLGMNRWISN IRLSVTGQNL FVITKYKGYD PEVNIDRTVN GISSYGIDYL SYPKARSFVF GLNLTF
|
| |