Gene Shel_12040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_12040 
Symbol 
ID8395095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp1383891 
End bp1388579 
Gene Length4689 bp 
Protein Length1562 aa 
Translation table11 
GC content60% 
IMG OID644985961 
Productputative collagen-binding protein 
Protein accessionYP_003143581 
Protein GI257063909 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.750329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTA GCAAAAGAAC GCTCAACATC GTTTCGATAC TCATGAGTTT TCTGCTGGTT 
CTGAATTGCT TCGGAGCAAC AGGCATTGCC TTTGCAGAAG ACGGCGATGT GGACGCGGTT
GGCGCCGCTG AGGAGCAAAC TGTGGCGGCC CTTGCCGATG GCGAGGCCGA ATCCGCCGAA
GAACCCCAGG TAGAAGCATC CGATCCCGCA GTCGGGGAGG ATCCTGCCAC CGAGCAAGAG
CCCTCTGCCC AAACAGAAGG TTTCGAAACC GTCCAGGCTC CGCGGGCACA AGAGGCAACG
GACACAGCCG CAGCCGAAGC GGTCGAGCCT CAGCCTGACC AAGAGGTCCA ACCCTTGGCC
GAGCCTGAAA CGGAGGCCGA GGCTACCAGG GCCAACGCGG GTGACGTGCG ATCCCAGACG
AGCACCGATC TGGAAAACTT CCTGGTCAAC GTTACAATCG ATGCGCCCAC GGATGATAAT
GGCGCATATA TCATCAAGCC TAACAGCACC TACGAAATAG AGATGCGCTT CGCCGAGAAC
GAGGACCTTC AGTTCGACGA CGATGCCGTG CTGACATACG ACTTTCCCGC GGGCATGGCT
GTTGCCGACG CGGCTGCCAC CACCTTCTCC ATCGCAGTGA CCGACAGCAC CGGCACCGCC
ACCGTTGAAG GGAACACCTT CGAAATCGTC GACGGGCAGC TCCGCGTGCG GTTCAACCAG
AGCGACCCGA ACTTCGACAA GCTTTCCGCC ACGTCGAACG TTAAATTCGA CATCAACGTC
GCTTCCACCT TCGAGCAGGT GGAAGGGCAG CTTGAATTCG CACCGTCCAT CATCAAGGAC
TTCGTGTTCG ACACCACTGC CGACGTCACC ATCGACAAGA GCGTCGTGTA CGATGCCGAC
TCGGACACCG CGCGCTACAC GCTGCGCATC GCATCCACGG GCACGAATGA GAACGTGGTC
ATCGAAGACA GGCTGACAGG CACGGCGCTC GTCTTCAACC AGGACGTCTC CGTGGTGTCG
AGCGTCGCGG GCGCACTGTC GGTTACCCCG GACTACGGCT CGGTGCCTAA CGGGTTCCGC
GTCGAAATCC CCAGCATGGT CGACGGCGAG GAGCTGACGC TCACCTATAC TGCCGCGGTC
GACAACACGA AGATCACCGC CAACGGCACC GTTGCCCAGA CGAACAACAC GGCCACTGTG
GATACCGACC AGATTCCCGA CCCCAAGTCC GACAGCGCTG ATTTCTCCGG ACAGGTCAAG
TTCAACCGTA TCGACAAGGA GCCGGCCGGG AAACCGGTCC AGATCGGGGA AGGGCTTTAC
GAGCAGACCT GGACCATCAC GGTGAACGGC GATCACAAGA TGCCGATGGG TGGAACGTAC
ATTTCGGACT GGATCGTCCA GAACAGCCGC CCGTTCATGC AGTTCACGGG CGACGGCATC
AGCGTGGCCG TGACCATGGA GAACGGAACC ACCGAAACCA GGAACGTGAC CTGGGATGAC
TTGAGGTTGT ACACCAGCGA ATACGGCACC TATGGTTGGG GATACCTGAC GCCCGCATCC
GACGGCAAGG CGTCCTACGT CATCACGTGC AAGACGATCA TCAACACCGA GGGCGCGCTG
GGCGACCTTA CCCTGCGCAA CGGCGCCCAG GTTTACAGCG CATACGACGA AGCCAGCGTC
ACCATGGAAG GCATCGGCGA AGGTACCTTC GACATCGACA AGACCGCCGA AGGAACGACC
GCCGAACAGA CCGACTGGAA GATCACCGTT ACGGTGCCAG GCTCCGGCCT GCCTGACCTG
CGGGTGGTCG ACGACCTGCC TAGGCTTACC TACGAAGGCC AGGAGTACGT GGACACATAC
ATCGAGGATT CGATGACCGT CGAAGGCCTG CTGGAAGGCG AGTCCTGGTC GCTGTACGTG
GGTACGGAAA AGAAGAGCTA CACCCTGACC ATCTACCAGG ACGAAGCGCA GACCCAGCCG
GGCGCGCGGC CGACGGCCAA CGGCGAACCG CGCGACATCG TGGTGCGCTT CAAGACCGCC
GTGAACCAGG ATTGGCTGTC GCTTGCCACG GCGGACGGCT ACAACAGCAG CACGCTTCGG
ACCCACACCA ACGTGGCGAA CGCCAGGTCG GGCTCGTACC GCACGGATAA CGCGCAGGCC
TCCGTGGTCC CCTTGAAGCC GGACTTCGAG AAGGGCTTCC TCGAGCGCAC GGAGTCGCAG
GTGGACGGCG TGACCTACCC TGCGTTCACC TATAAGCTGC AGCTGCTGGG CGTCTACGAA
GACGGCGCCG CGATCCAGGA CAGCTTCGAC ACCACGTATC TCAAGTACGA CGAAGCGTCC
GGCATCGTGG TGCGCGGCGG CATGAGCTCC GGCGCGACCA ACCTGGTCAC GGGCGGATCG
GCAACGGCAA CGCCGAACGC CGAGGGCATG CAGATCAACG TGGCGTCGTT CCCCAAGCAG
GCCAACGGGA ATTTCTACCC CTACTATGAG ATCGAATACA CCTTGATGGT GAAGGATGAA
GCCGCGCTTG CCGCGTTGAA CGAGGCGGCA GCGTCTGCCC AGGGCGGCGT GTATCTTGAC
AACACCGCTA CCTGGAACGA CCTGACCTCG GACGAATCGG TCAACTACAC CTATTTCCCC
TATGTGGATA AGGAGCTCAC GCAAAGGCCT TCTTCCGACA ACGGATACGT GGCCGAATTC
AAGGTGCTGA TCAACCAGTA CGCCGAGGAT CTGGACCCGA CGTCCGAAAC GCTGACCATT
CTGGACGAGC TGTCGCCCAA CCTGCGGTTC CTGCCGGATT CGCTGACCAT TACCCCTGCG
AACGACTCGA TCGGCGTGCA GCACGACAGC GCCACGAACA CGTTGACCTT TACGAACGTT
CCCGACAACA CGGCCTACGA GATTACCTAT CAGGCCAGGG TGCTGGGTGC TGGTAACGTG
TCGTATTCGA ACACGATCAA GTTCGGCAAA TACGAGAAGA CGGTCGAAGA GACCACGACC
GTATCCCATT CGGGCGGCGG CACGGCAAGC AACCCGAGCA TCACGCTGGT CAAACGCGAT
GCTGAAGACC AGACGGCGAC CCTTGCAGGC GCCACGTTCG AGCTCTATTA CATGCGGGGC
GACGTGCGCG TGCCGGTGAC GGACAGCAAC GGCAACGCGG TCTCGTTCAC CACGGATGGC
GCGGGCCAGG TCCTGATTGC CGGCAACCAG CAGAGCCTTG GTTGGACGCT GTGGACCGAC
AGGACGTATT GCCTGGTAGA AACGGCGGCG CCGGCCGGTT ACGAAATCAA CGCCGAACCG
GTGTACTTCG TGCTTACCGA AACCCCGACA AGCCAGATGG ATTTCGACAT CGTGGGCGAC
ACGCTGAACG TGAACAACGA GCGCATCAAG ACCCAGGTCG CGGTCACCAA AGAATGGAAG
GGCCCTGCTG TTTCCAGCGC GCGTGTGAAC CTGCTGGCCA ACGGTGAAAT CGTCGACAGC
GCAACGCTGG ATGACGCCAA TGATTGGACC TATGTTTTCG AAGGTCTGGA CGCATACGAC
CGCGACGGCA TCGAGATCGC ATACACGGTG GAAGAGGAGC CCGGCGACGC ATTCCGCCTG
ATTTCCATCG AAGGTAACGC GACCGAGGGC TTTACTGTGA CCAACCTGAA CGCGGATACG
GTGAACGTGC CCGTGCAGAA GGAATGGGTC GGTCCTGCGG CCGAATCCGT CACCATGAAC
CTGCTGGCGG ACGGGACCAT TGCGGATTCC GTGGTGCTGG ATGAGGCAGG CGGCTGGAGC
CACACGTTCG AAGGCCTTCC CAAATACGAC GCTTCCGATG GCCATGAGAT TGTGTACACG
GTGGAAGAAG ACTCCTTGGA GGGCTACTCG TCCGAAATTT CGGGCGATGC GGAAACCGGA
TTCGTGGTGA CGAACACCAA CGACGCCGTG ACGGAAATTA ACGGAACGAA GACCTGGGAC
GATGCAGACA ACCAGGACGG CGTGCGCCCC GACAGCATAA CCGTGCGCCT GTTGGCCGAC
GGTGTTGAGG CGCAGGTTTT AACCGTGACG GCCGATGACG ACTGGGCATG GTCGTTCCCG
AACCTGCGCG TGTACGACGC CGAAGACGGC CACGAGATCA TATATGCCGT TACTGAGGAT
ACGGTGCCGG GGTATTCGAC AAGTTATGAC GGATTCGATA TTGTGAATAC CCATGAACCT
GGGAAGACGA GCCTTACCGT AACCAAGGCT TGGGACGACA AGTCCGACAA GGATGGCATT
CGTCCTGATA GCGTTACCAT TCGCCTGTTT GCGGACGGTG CGGACACCGG ACAAACCCTT
GTGCTGAGCG CCGAAAACGG CTGGACGGGA AGCTTCGAGA ACCTGGACGA AATGAAGTCC
GGCGCCAAGA TCAGCTACAC CGTTGAAGAG GAGTCCGTCC CGGGTTACAC GGCGTCGATT
ACCGGCGATG CCGAAACAGG CTTTGTGGTA ACCAATTCCC ACGACCCGAA GGACGACACG
CCTAAGGACA GCGTGACCAA ATCCAAGTCC AAGCCCAAGC CTGCGAATCC CGCACCGAAG
TCCAAGCCTT CGGTTCCCAA GACCGGAGAC GGGACGTTGC CTGTCATCTT GCTGGCTGGC
GGGCTGGCGG TTGCGGCTAT TGCGGCCCTG ATTATCGCTT TGAGGGCCAA GCGCAAGAAG
AAGGCGTAA
 
Protein sequence
MDVSKRTLNI VSILMSFLLV LNCFGATGIA FAEDGDVDAV GAAEEQTVAA LADGEAESAE 
EPQVEASDPA VGEDPATEQE PSAQTEGFET VQAPRAQEAT DTAAAEAVEP QPDQEVQPLA
EPETEAEATR ANAGDVRSQT STDLENFLVN VTIDAPTDDN GAYIIKPNST YEIEMRFAEN
EDLQFDDDAV LTYDFPAGMA VADAAATTFS IAVTDSTGTA TVEGNTFEIV DGQLRVRFNQ
SDPNFDKLSA TSNVKFDINV ASTFEQVEGQ LEFAPSIIKD FVFDTTADVT IDKSVVYDAD
SDTARYTLRI ASTGTNENVV IEDRLTGTAL VFNQDVSVVS SVAGALSVTP DYGSVPNGFR
VEIPSMVDGE ELTLTYTAAV DNTKITANGT VAQTNNTATV DTDQIPDPKS DSADFSGQVK
FNRIDKEPAG KPVQIGEGLY EQTWTITVNG DHKMPMGGTY ISDWIVQNSR PFMQFTGDGI
SVAVTMENGT TETRNVTWDD LRLYTSEYGT YGWGYLTPAS DGKASYVITC KTIINTEGAL
GDLTLRNGAQ VYSAYDEASV TMEGIGEGTF DIDKTAEGTT AEQTDWKITV TVPGSGLPDL
RVVDDLPRLT YEGQEYVDTY IEDSMTVEGL LEGESWSLYV GTEKKSYTLT IYQDEAQTQP
GARPTANGEP RDIVVRFKTA VNQDWLSLAT ADGYNSSTLR THTNVANARS GSYRTDNAQA
SVVPLKPDFE KGFLERTESQ VDGVTYPAFT YKLQLLGVYE DGAAIQDSFD TTYLKYDEAS
GIVVRGGMSS GATNLVTGGS ATATPNAEGM QINVASFPKQ ANGNFYPYYE IEYTLMVKDE
AALAALNEAA ASAQGGVYLD NTATWNDLTS DESVNYTYFP YVDKELTQRP SSDNGYVAEF
KVLINQYAED LDPTSETLTI LDELSPNLRF LPDSLTITPA NDSIGVQHDS ATNTLTFTNV
PDNTAYEITY QARVLGAGNV SYSNTIKFGK YEKTVEETTT VSHSGGGTAS NPSITLVKRD
AEDQTATLAG ATFELYYMRG DVRVPVTDSN GNAVSFTTDG AGQVLIAGNQ QSLGWTLWTD
RTYCLVETAA PAGYEINAEP VYFVLTETPT SQMDFDIVGD TLNVNNERIK TQVAVTKEWK
GPAVSSARVN LLANGEIVDS ATLDDANDWT YVFEGLDAYD RDGIEIAYTV EEEPGDAFRL
ISIEGNATEG FTVTNLNADT VNVPVQKEWV GPAAESVTMN LLADGTIADS VVLDEAGGWS
HTFEGLPKYD ASDGHEIVYT VEEDSLEGYS SEISGDAETG FVVTNTNDAV TEINGTKTWD
DADNQDGVRP DSITVRLLAD GVEAQVLTVT ADDDWAWSFP NLRVYDAEDG HEIIYAVTED
TVPGYSTSYD GFDIVNTHEP GKTSLTVTKA WDDKSDKDGI RPDSVTIRLF ADGADTGQTL
VLSAENGWTG SFENLDEMKS GAKISYTVEE ESVPGYTASI TGDAETGFVV TNSHDPKDDT
PKDSVTKSKS KPKPANPAPK SKPSVPKTGD GTLPVILLAG GLAVAAIAAL IIALRAKRKK
KA