Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0203 |
Symbol | |
ID | 8723931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 268973 |
End bp | 272116 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003385067 |
Protein GI | 284035137 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.671806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAA TTTTATTGGG AAGCTGGTTA CTTTCTCTGT TATTCTGTTT GCCCGTACTG GCGCAGGATA TAGCAGTTAG TGGCCGCGTC ACTTCATCAG ACGACGGCTC CACCCTACCC GGTGTGAGCG TGCAGGTAAA AGGAACAACC CGTGGTGCTA TTACAGACGC CGACGGAAAC TATCGAATCA GCGTACCGGC CAATGCGCGA CTCGTATTTA GTTTTATTGG TTATACCGGA CAGGAGGTTG CCGTTGGCAA CAAAACAACG ATCAACGTTA CGCTGGTAGC GGGTTCGCAA AGCCTCGATG AAATCGTTGT GACGGCTCAG GGTATCGAGC GCGACAAGCG TTCACTGGGC TACGCTACGC AGGAAATTGG CGGTAATATT CTAGCACAAC GCTCAGAACC AAACCTGCTT AATGCCTTAC AGGGTAAAGT AGCGGGCGTC AATATCACGG GTTCCAGTGG CACACCAGGC GCATCAACAA ACATCAACAT TCGGGGTATT ACATCATTCA ATGGCAGTAA CCAGCCCTTG ATCGTGGTCG ATGGGATCAT CTTTAGCAAC GACGTTAACC TGACACAGAA CACGCTGTTT GGTACGCAGC CTTCTAACCG TCTGGCCGAC ATCAACCCGG AGAGTATTGA ATCGGTGAAC GTACTGAAAG GACCCGCAGC TGCCGTTCTG TACGGTTCGC GGGCTTCGGC TGGTGCTATC GTTATTACGA CGAAATCGGG CCGCAACCAG AACAACAAAA CCGAAGTAAC GGTCAATTCG TCGTACAACG TACAGAATGT ATACGGCATT CCAAGGTTCC AGAACGACTA CGGGCAAGGA GCAAACAACC TGTTTACCCC AACGTCCAAT AACTCATGGG GACCTCCGTT TGTAGGTGGA CCAACATCGG TTACTAATAC CCAGAACCAG GTTGTGCCCT ATCAGGCGTA TCCGAACAAC GTACGGGACT TCTACCGTCA GGGCAGTATT CTGCAGAACT CGGTTAACAT TGCCTCGGGC GATGCCACAC GCAACTACAT TATTGCTATT GGTAATACCC TACAGAATGG TATCGTTCAG AACACGAAAT TCAACCGGAC AAACGTACAG TTAGGCGGTG AGTCGAAACT GAAAAACGGC CTGAAAGTGA GTGGTACGGG TACGTACGTA CAAACGGTAT CGCGCGCAGT GCCGGGTGGT AACGGCGCCA GTGCGTTTGG GCAGATTACC CGTATTCCGC GGAGTTATGA CCTGGCAAAC GAGCCCTACC AGGGTGCTGA CGGAAAAAGT ATCTACTTCA CCCCGTCGAC AAACAACCCA CAATGGAGCG TTAACAACGA ACGTCTCGAC AGCCAGGTTG ACCGTTTCTT CGGCAATTTC CAACTGAGCT ACGACGTAGC CAGCTGGTTA AACGTCGCTT ACCGGGTAAC GGGTGATACT TACACCGACC GTCGTAAGTT GATTCTGCCC ATTAGCTCGG GTCGTGCTCC GGCAGGCCAG GTTCAGCAGG ATAACTTCTT CCGGAATGAG TTGAACGGCG ATCTGTTGAT CACGGCCCGC AAGGACAACC TGTTTATGGA AGGGCTGAAC GCGAACCTGC TGCTGGGTAA CAACATCAAC CAGCGGAAAA CCCAGGAGTC GGCGGCCGAT GCAACCTCCC TGACCCTGCC CGGTTTTTAT AACATCAACA GCGGTACGGT GTTCACGGGT ACGTTCGAAA GCTCGACCCT GCGTCGTCTG GTGGGTTACT ATGGCCAGTT GTCGCTGAAC TACAATAACT ACATCTTCCT GGAATTATCC GGACGTGCCG ACCAGTCGTC TACTCTGCCG AAAGCGAACA ACACCTATTT CTATCCGGGT GCCTCGGTTA GCTTCGTGCC AACAGACGCC TTCAAGATCA ACTCCGACGT GCTTTCTTAT GCGAAAGTGC GGGCCAGCAT TGCTAAAGTT GGTCGTGATG CCGACCCTTA CCAGTTGAGC ACCGTTTACA ATAAGTCTAG CTATGGTAAC AACGTTGCTA ACATCGTTTA CCCACTGGCA CCAAACAACA CACCCGGCTT TAGCATTGAT ACCCGAATTG GTAACAACAG CCTCAAGCCT GAGTTTGTAA CGTCGTATGA GTTCGGTATC AACCTTGGCT TCTTCAAGAA CCGCCTGAGC GTCGATGCTA CGTACTTCGA CTCAAAGAGT ACGCAGCAGA TTTTCAACGT AGCCGTTTCG AACTCATCGG GTTTCGACAC CCGGACAACC AACGTTGGTG AATTACAGAA CCGGGGGGTT GAATTGATCC TGAGTGCAAC GCCCGTGCGA GTTGGTGGTT TCAAATGGGA TGCTACGCTG AACTACACGC TGATCCGCAA CAAGGTGGTT TCCATTGCTC CCGGTGTTAA GTCGTCTAAC ATAGCGGGTA ACTCATTTAT CGGTATTGCT CCGTCTATCT ACGAAGGCTA TCCGTATGGT GTTATTGTTA GTACGGCCAA CTCACGGGCT CAGAATACCG ACCCCAATGG TTTGTACTAT GACGCAACCG GTCAGTTTGC CGGTCAGTAT ATTATCAATG GAACCAACGG GCAGTTTGCG CCGGGCTTGG CCAACTCGGT TATTTCGAAC CCGCAGCCTA ACTACATCGC GGGCTTAACC AACACCTTCT CGTACAAAGG TATAGCCCTG TCTGTACTGG TTGATACCCG TCAGGGTGGT CAGATTTTCT CGTTCAACGC CGTTGATGCC CGCCAGAATG GCTCGATGTA CGTAACGGGT ATTGACCGCG ATCAGCCGCG TATTTTACCC GGCGTGATCC AGAATGCAGA CGGCACCTTC CGCCCGAACA ACATCCAGCT ACCGGCCCAA ACCTACTGGG GCGCCCTGGG CGGTCTGGCT TCGGAAGCCG CCGTTTACGA CGCAACGGTC TATCGTCTGC GCGAGGTGGC GCTGAACTAC TCCTTACCGA AAACCTTACT TGGCAAAACG CCATTTGGTG CTATTTCGGT GGGCGTGAGC GGTCGTAACC TGTTTTTCTA CGCGCCCAAC TTCCCGGCCG ACCCGGAGGT CAACACGCAG GGAGCGGGTA ATATTCAGGG CCTTGACCTG AACGGGCCGC CAAATACACG GAACTTCGGC GGTAACATTC GGCTCACGTT CTAA
|
Protein sequence | MQKILLGSWL LSLLFCLPVL AQDIAVSGRV TSSDDGSTLP GVSVQVKGTT RGAITDADGN YRISVPANAR LVFSFIGYTG QEVAVGNKTT INVTLVAGSQ SLDEIVVTAQ GIERDKRSLG YATQEIGGNI LAQRSEPNLL NALQGKVAGV NITGSSGTPG ASTNINIRGI TSFNGSNQPL IVVDGIIFSN DVNLTQNTLF GTQPSNRLAD INPESIESVN VLKGPAAAVL YGSRASAGAI VITTKSGRNQ NNKTEVTVNS SYNVQNVYGI PRFQNDYGQG ANNLFTPTSN NSWGPPFVGG PTSVTNTQNQ VVPYQAYPNN VRDFYRQGSI LQNSVNIASG DATRNYIIAI GNTLQNGIVQ NTKFNRTNVQ LGGESKLKNG LKVSGTGTYV QTVSRAVPGG NGASAFGQIT RIPRSYDLAN EPYQGADGKS IYFTPSTNNP QWSVNNERLD SQVDRFFGNF QLSYDVASWL NVAYRVTGDT YTDRRKLILP ISSGRAPAGQ VQQDNFFRNE LNGDLLITAR KDNLFMEGLN ANLLLGNNIN QRKTQESAAD ATSLTLPGFY NINSGTVFTG TFESSTLRRL VGYYGQLSLN YNNYIFLELS GRADQSSTLP KANNTYFYPG ASVSFVPTDA FKINSDVLSY AKVRASIAKV GRDADPYQLS TVYNKSSYGN NVANIVYPLA PNNTPGFSID TRIGNNSLKP EFVTSYEFGI NLGFFKNRLS VDATYFDSKS TQQIFNVAVS NSSGFDTRTT NVGELQNRGV ELILSATPVR VGGFKWDATL NYTLIRNKVV SIAPGVKSSN IAGNSFIGIA PSIYEGYPYG VIVSTANSRA QNTDPNGLYY DATGQFAGQY IINGTNGQFA PGLANSVISN PQPNYIAGLT NTFSYKGIAL SVLVDTRQGG QIFSFNAVDA RQNGSMYVTG IDRDQPRILP GVIQNADGTF RPNNIQLPAQ TYWGALGGLA SEAAVYDATV YRLREVALNY SLPKTLLGKT PFGAISVGVS GRNLFFYAPN FPADPEVNTQ GAGNIQGLDL NGPPNTRNFG GNIRLTF
|
| |