Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_12040 |
Symbol | |
ID | 8395095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | + |
Start bp | 1383891 |
End bp | 1388579 |
Gene Length | 4689 bp |
Protein Length | 1562 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644985961 |
Product | putative collagen-binding protein |
Protein accession | YP_003143581 |
Protein GI | 257063909 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.750329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTTA GCAAAAGAAC GCTCAACATC GTTTCGATAC TCATGAGTTT TCTGCTGGTT CTGAATTGCT TCGGAGCAAC AGGCATTGCC TTTGCAGAAG ACGGCGATGT GGACGCGGTT GGCGCCGCTG AGGAGCAAAC TGTGGCGGCC CTTGCCGATG GCGAGGCCGA ATCCGCCGAA GAACCCCAGG TAGAAGCATC CGATCCCGCA GTCGGGGAGG ATCCTGCCAC CGAGCAAGAG CCCTCTGCCC AAACAGAAGG TTTCGAAACC GTCCAGGCTC CGCGGGCACA AGAGGCAACG GACACAGCCG CAGCCGAAGC GGTCGAGCCT CAGCCTGACC AAGAGGTCCA ACCCTTGGCC GAGCCTGAAA CGGAGGCCGA GGCTACCAGG GCCAACGCGG GTGACGTGCG ATCCCAGACG AGCACCGATC TGGAAAACTT CCTGGTCAAC GTTACAATCG ATGCGCCCAC GGATGATAAT GGCGCATATA TCATCAAGCC TAACAGCACC TACGAAATAG AGATGCGCTT CGCCGAGAAC GAGGACCTTC AGTTCGACGA CGATGCCGTG CTGACATACG ACTTTCCCGC GGGCATGGCT GTTGCCGACG CGGCTGCCAC CACCTTCTCC ATCGCAGTGA CCGACAGCAC CGGCACCGCC ACCGTTGAAG GGAACACCTT CGAAATCGTC GACGGGCAGC TCCGCGTGCG GTTCAACCAG AGCGACCCGA ACTTCGACAA GCTTTCCGCC ACGTCGAACG TTAAATTCGA CATCAACGTC GCTTCCACCT TCGAGCAGGT GGAAGGGCAG CTTGAATTCG CACCGTCCAT CATCAAGGAC TTCGTGTTCG ACACCACTGC CGACGTCACC ATCGACAAGA GCGTCGTGTA CGATGCCGAC TCGGACACCG CGCGCTACAC GCTGCGCATC GCATCCACGG GCACGAATGA GAACGTGGTC ATCGAAGACA GGCTGACAGG CACGGCGCTC GTCTTCAACC AGGACGTCTC CGTGGTGTCG AGCGTCGCGG GCGCACTGTC GGTTACCCCG GACTACGGCT CGGTGCCTAA CGGGTTCCGC GTCGAAATCC CCAGCATGGT CGACGGCGAG GAGCTGACGC TCACCTATAC TGCCGCGGTC GACAACACGA AGATCACCGC CAACGGCACC GTTGCCCAGA CGAACAACAC GGCCACTGTG GATACCGACC AGATTCCCGA CCCCAAGTCC GACAGCGCTG ATTTCTCCGG ACAGGTCAAG TTCAACCGTA TCGACAAGGA GCCGGCCGGG AAACCGGTCC AGATCGGGGA AGGGCTTTAC GAGCAGACCT GGACCATCAC GGTGAACGGC GATCACAAGA TGCCGATGGG TGGAACGTAC ATTTCGGACT GGATCGTCCA GAACAGCCGC CCGTTCATGC AGTTCACGGG CGACGGCATC AGCGTGGCCG TGACCATGGA GAACGGAACC ACCGAAACCA GGAACGTGAC CTGGGATGAC TTGAGGTTGT ACACCAGCGA ATACGGCACC TATGGTTGGG GATACCTGAC GCCCGCATCC GACGGCAAGG CGTCCTACGT CATCACGTGC AAGACGATCA TCAACACCGA GGGCGCGCTG GGCGACCTTA CCCTGCGCAA CGGCGCCCAG GTTTACAGCG CATACGACGA AGCCAGCGTC ACCATGGAAG GCATCGGCGA AGGTACCTTC GACATCGACA AGACCGCCGA AGGAACGACC GCCGAACAGA CCGACTGGAA GATCACCGTT ACGGTGCCAG GCTCCGGCCT GCCTGACCTG CGGGTGGTCG ACGACCTGCC TAGGCTTACC TACGAAGGCC AGGAGTACGT GGACACATAC ATCGAGGATT CGATGACCGT CGAAGGCCTG CTGGAAGGCG AGTCCTGGTC GCTGTACGTG GGTACGGAAA AGAAGAGCTA CACCCTGACC ATCTACCAGG ACGAAGCGCA GACCCAGCCG GGCGCGCGGC CGACGGCCAA CGGCGAACCG CGCGACATCG TGGTGCGCTT CAAGACCGCC GTGAACCAGG ATTGGCTGTC GCTTGCCACG GCGGACGGCT ACAACAGCAG CACGCTTCGG ACCCACACCA ACGTGGCGAA CGCCAGGTCG GGCTCGTACC GCACGGATAA CGCGCAGGCC TCCGTGGTCC CCTTGAAGCC GGACTTCGAG AAGGGCTTCC TCGAGCGCAC GGAGTCGCAG GTGGACGGCG TGACCTACCC TGCGTTCACC TATAAGCTGC AGCTGCTGGG CGTCTACGAA GACGGCGCCG CGATCCAGGA CAGCTTCGAC ACCACGTATC TCAAGTACGA CGAAGCGTCC GGCATCGTGG TGCGCGGCGG CATGAGCTCC GGCGCGACCA ACCTGGTCAC GGGCGGATCG GCAACGGCAA CGCCGAACGC CGAGGGCATG CAGATCAACG TGGCGTCGTT CCCCAAGCAG GCCAACGGGA ATTTCTACCC CTACTATGAG ATCGAATACA CCTTGATGGT GAAGGATGAA GCCGCGCTTG CCGCGTTGAA CGAGGCGGCA GCGTCTGCCC AGGGCGGCGT GTATCTTGAC AACACCGCTA CCTGGAACGA CCTGACCTCG GACGAATCGG TCAACTACAC CTATTTCCCC TATGTGGATA AGGAGCTCAC GCAAAGGCCT TCTTCCGACA ACGGATACGT GGCCGAATTC AAGGTGCTGA TCAACCAGTA CGCCGAGGAT CTGGACCCGA CGTCCGAAAC GCTGACCATT CTGGACGAGC TGTCGCCCAA CCTGCGGTTC CTGCCGGATT CGCTGACCAT TACCCCTGCG AACGACTCGA TCGGCGTGCA GCACGACAGC GCCACGAACA CGTTGACCTT TACGAACGTT CCCGACAACA CGGCCTACGA GATTACCTAT CAGGCCAGGG TGCTGGGTGC TGGTAACGTG TCGTATTCGA ACACGATCAA GTTCGGCAAA TACGAGAAGA CGGTCGAAGA GACCACGACC GTATCCCATT CGGGCGGCGG CACGGCAAGC AACCCGAGCA TCACGCTGGT CAAACGCGAT GCTGAAGACC AGACGGCGAC CCTTGCAGGC GCCACGTTCG AGCTCTATTA CATGCGGGGC GACGTGCGCG TGCCGGTGAC GGACAGCAAC GGCAACGCGG TCTCGTTCAC CACGGATGGC GCGGGCCAGG TCCTGATTGC CGGCAACCAG CAGAGCCTTG GTTGGACGCT GTGGACCGAC AGGACGTATT GCCTGGTAGA AACGGCGGCG CCGGCCGGTT ACGAAATCAA CGCCGAACCG GTGTACTTCG TGCTTACCGA AACCCCGACA AGCCAGATGG ATTTCGACAT CGTGGGCGAC ACGCTGAACG TGAACAACGA GCGCATCAAG ACCCAGGTCG CGGTCACCAA AGAATGGAAG GGCCCTGCTG TTTCCAGCGC GCGTGTGAAC CTGCTGGCCA ACGGTGAAAT CGTCGACAGC GCAACGCTGG ATGACGCCAA TGATTGGACC TATGTTTTCG AAGGTCTGGA CGCATACGAC CGCGACGGCA TCGAGATCGC ATACACGGTG GAAGAGGAGC CCGGCGACGC ATTCCGCCTG ATTTCCATCG AAGGTAACGC GACCGAGGGC TTTACTGTGA CCAACCTGAA CGCGGATACG GTGAACGTGC CCGTGCAGAA GGAATGGGTC GGTCCTGCGG CCGAATCCGT CACCATGAAC CTGCTGGCGG ACGGGACCAT TGCGGATTCC GTGGTGCTGG ATGAGGCAGG CGGCTGGAGC CACACGTTCG AAGGCCTTCC CAAATACGAC GCTTCCGATG GCCATGAGAT TGTGTACACG GTGGAAGAAG ACTCCTTGGA GGGCTACTCG TCCGAAATTT CGGGCGATGC GGAAACCGGA TTCGTGGTGA CGAACACCAA CGACGCCGTG ACGGAAATTA ACGGAACGAA GACCTGGGAC GATGCAGACA ACCAGGACGG CGTGCGCCCC GACAGCATAA CCGTGCGCCT GTTGGCCGAC GGTGTTGAGG CGCAGGTTTT AACCGTGACG GCCGATGACG ACTGGGCATG GTCGTTCCCG AACCTGCGCG TGTACGACGC CGAAGACGGC CACGAGATCA TATATGCCGT TACTGAGGAT ACGGTGCCGG GGTATTCGAC AAGTTATGAC GGATTCGATA TTGTGAATAC CCATGAACCT GGGAAGACGA GCCTTACCGT AACCAAGGCT TGGGACGACA AGTCCGACAA GGATGGCATT CGTCCTGATA GCGTTACCAT TCGCCTGTTT GCGGACGGTG CGGACACCGG ACAAACCCTT GTGCTGAGCG CCGAAAACGG CTGGACGGGA AGCTTCGAGA ACCTGGACGA AATGAAGTCC GGCGCCAAGA TCAGCTACAC CGTTGAAGAG GAGTCCGTCC CGGGTTACAC GGCGTCGATT ACCGGCGATG CCGAAACAGG CTTTGTGGTA ACCAATTCCC ACGACCCGAA GGACGACACG CCTAAGGACA GCGTGACCAA ATCCAAGTCC AAGCCCAAGC CTGCGAATCC CGCACCGAAG TCCAAGCCTT CGGTTCCCAA GACCGGAGAC GGGACGTTGC CTGTCATCTT GCTGGCTGGC GGGCTGGCGG TTGCGGCTAT TGCGGCCCTG ATTATCGCTT TGAGGGCCAA GCGCAAGAAG AAGGCGTAA
|
Protein sequence | MDVSKRTLNI VSILMSFLLV LNCFGATGIA FAEDGDVDAV GAAEEQTVAA LADGEAESAE EPQVEASDPA VGEDPATEQE PSAQTEGFET VQAPRAQEAT DTAAAEAVEP QPDQEVQPLA EPETEAEATR ANAGDVRSQT STDLENFLVN VTIDAPTDDN GAYIIKPNST YEIEMRFAEN EDLQFDDDAV LTYDFPAGMA VADAAATTFS IAVTDSTGTA TVEGNTFEIV DGQLRVRFNQ SDPNFDKLSA TSNVKFDINV ASTFEQVEGQ LEFAPSIIKD FVFDTTADVT IDKSVVYDAD SDTARYTLRI ASTGTNENVV IEDRLTGTAL VFNQDVSVVS SVAGALSVTP DYGSVPNGFR VEIPSMVDGE ELTLTYTAAV DNTKITANGT VAQTNNTATV DTDQIPDPKS DSADFSGQVK FNRIDKEPAG KPVQIGEGLY EQTWTITVNG DHKMPMGGTY ISDWIVQNSR PFMQFTGDGI SVAVTMENGT TETRNVTWDD LRLYTSEYGT YGWGYLTPAS DGKASYVITC KTIINTEGAL GDLTLRNGAQ VYSAYDEASV TMEGIGEGTF DIDKTAEGTT AEQTDWKITV TVPGSGLPDL RVVDDLPRLT YEGQEYVDTY IEDSMTVEGL LEGESWSLYV GTEKKSYTLT IYQDEAQTQP GARPTANGEP RDIVVRFKTA VNQDWLSLAT ADGYNSSTLR THTNVANARS GSYRTDNAQA SVVPLKPDFE KGFLERTESQ VDGVTYPAFT YKLQLLGVYE DGAAIQDSFD TTYLKYDEAS GIVVRGGMSS GATNLVTGGS ATATPNAEGM QINVASFPKQ ANGNFYPYYE IEYTLMVKDE AALAALNEAA ASAQGGVYLD NTATWNDLTS DESVNYTYFP YVDKELTQRP SSDNGYVAEF KVLINQYAED LDPTSETLTI LDELSPNLRF LPDSLTITPA NDSIGVQHDS ATNTLTFTNV PDNTAYEITY QARVLGAGNV SYSNTIKFGK YEKTVEETTT VSHSGGGTAS NPSITLVKRD AEDQTATLAG ATFELYYMRG DVRVPVTDSN GNAVSFTTDG AGQVLIAGNQ QSLGWTLWTD RTYCLVETAA PAGYEINAEP VYFVLTETPT SQMDFDIVGD TLNVNNERIK TQVAVTKEWK GPAVSSARVN LLANGEIVDS ATLDDANDWT YVFEGLDAYD RDGIEIAYTV EEEPGDAFRL ISIEGNATEG FTVTNLNADT VNVPVQKEWV GPAAESVTMN LLADGTIADS VVLDEAGGWS HTFEGLPKYD ASDGHEIVYT VEEDSLEGYS SEISGDAETG FVVTNTNDAV TEINGTKTWD DADNQDGVRP DSITVRLLAD GVEAQVLTVT ADDDWAWSFP NLRVYDAEDG HEIIYAVTED TVPGYSTSYD GFDIVNTHEP GKTSLTVTKA WDDKSDKDGI RPDSVTIRLF ADGADTGQTL VLSAENGWTG SFENLDEMKS GAKISYTVEE ESVPGYTASI TGDAETGFVV TNSHDPKDDT PKDSVTKSKS KPKPANPAPK SKPSVPKTGD GTLPVILLAG GLAVAAIAAL IIALRAKRKK KA
|
| |