Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_18990 |
Symbol | |
ID | 8395788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | + |
Start bp | 2129285 |
End bp | 2133085 |
Gene Length | 3801 bp |
Protein Length | 1266 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644986650 |
Product | putative collagen-binding protein |
Protein accession | YP_003144264 |
Protein GI | 257064592 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.133816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACAG CGGCTCGTGA GGCCTCCACC TCCAAGCTCT TCAGGGTCGT CATGGCGGCC GTGCTCGCGG TATCCATGTG CATCCCCGCA ACCGCGCTGC AGGCGCAGCG CGCATTCGCG GCCTCGGACT CGCTCTCCCA GGTCACCTGG GAGGAGGCCA AGCCGCAGAT CGAGCAGTAC ATGGGCGTGC CCTACGTGTG GGGCGGCCGC TCGACCGACG GCTGGGACTG CTCGGGCTTC GTGTCGTTCG TGATGCACGA CATCTACGGC ACCGACTGGC CGGGCGGCAC CTGGGGCGAC TCGGGCACGG GCTCGATCTA CAACTACTGC GCCGACTACC AGGTGTTCTC GGGCGACTCG GCGGAGGACT ACAACTCGGC GTTCGACAAC GGCACCGTGA AGCCCGGCGA CATCATCATC TTCCTGAACT CGCAGGTCGT GACGGTCCAC GCCGCAATCG CAGGCGAGGA CGCCACGGTC TACCACGCCT GGCACGAGGG CTTCGGAACC GGCAACTGCC GCTTCGACTA CATGTGGGGC ATCAACGGCG GGCACGGCAA GGTCTACAGC TCCTTCGTCG TCTACCGCGG CCTCACCGAG GGCGGCTACG TGACCATGGA CAAGTCGTCC GCAGACGTGA AGGTCACGGG TGGCAACGCG CAGTACTCGC TCGCGGGGGC CGTCTTCGGC GTGTACCGCG ACGGCGCGCA GGTGGCCACC ATCACCACCG ACGGGGACGG CCACGGCTCC ACATCCGACA AGCTCCCGAA CGGCTCCTAC ACCGTGAGGG AGATCGAGGC GCCGGCAGGC TACGCGCTCG ACGGCCGCAC GTTCCCCGTC ACCGTCTCCG GCTCCGACGC CGGGGCGGAG ATGGCCGAGC GGCCCGTCAC CGTCACCCTC ACCGTCGCGA AGTTCGACGA CGGCACCGGG CGGCGTGAGG CGCAGGGCGA CGCCACGCTC GACGGGGCGG TCTACGAGGC AAGCTACGCC TACGACGGTG GCACGAAGAC CGTCACGGGC ACGACGAGCA ACGGCACCGT GACCTTCGAA GGCATCCCGC TCGGCGACGT CACCGTGCGC GAGCTTGAGG CTCCCGAGGG CTACCTTCCC GACGCCGAGA AGCACACCCT GACCGTGAGC GCCGGCATGG CCGGCCACGA CCCCGTCTTC TACTACGAGC CGGCCGACGA GTTTGCGGAG ACCGTCTTTC GCGGCGGCAT CACCGTCGGC AAGGCCGACT CCGAGGAGTA CGACCACAGG GACGGCGACT ACGCCAACTA CGCCCAGGGC GACGCGACCT TCGCGGGCGC CGAGTTCGCG GTCGTGAACC GCTCCGAGGG ATCGATCTGG TACGACGCGA ACAAGGACGG GAAGCACCAG GATTCCGAGG ACTTCGCCCC CGGCGCCGAG GTGCTTCGCA TGACCACCTC CTACGACGCG GCCCTCGACG CCTACGTCGC AAAGACCGGC GAGAGGGCGC TGCCGTACGG CACCTACGGG GTCGTCGAGA CCAAGTCGCC CGAGGGCTAC ACCGAGCGCG GCAAGGTCGA CGTCACCTGC GAGATCCGCG AGGACGGCCA GGTGGTGGAA CTCACCCGAG ACTCCGGCAT CGAGAACGAG GTGATCCGCG GCGGCGTGCA AGTGGAGAAG GACGACTGGG AGCTCGGCAA GTCCGAGGCC ATCGGCGGCG CGGGCCACTC CGCGCTCGAC GCGCAGGGCC ACCTCGGGAC GTCGCTCGAG GGCATCGAGT TCACGGTCAC GAACGCCTCC AGGCACGGCG TGATGCTCGA CGGCGAGTAC GTTGGCAAGG GCGAGGTCGC GGCGACCATC TACACGTCCT GGAACGAGGA GCTCGGCGCC TACACGGCCC AGACCGGGCC CGACGCGCTC CCGTACGGCA CCTACACCAT CATGGAGACG GCCACGAACG ACTCATACAT GCTCACCGAC GGGAAGGCCC GCACCTTCGA GGTCCGCACC GACGGCGAGA CCGTGACCTT CGACAAATCC GGCGCGGACC TCACGTGGCG CGACCGCGTC GTGCGCAACG ACGTCCACCT GCAGAAGAAG GGCGCCGACG ACTCGCACAA GTTCGCCTAC GTGCCGTTCC TCATCACCAA CGTCACCACG GGCGAGGCGC ACGTGGCAGT GACCGACCGC AACGGGGCGC TGAACACCTC CGCCGGCTGG AACAGCCACA TCCGCGACGC TAACGCCAAC GACTCGCTCA TCGGCAAGGA AACCATCACG GCCTCCGACG TCGACGAGTC CTGCGGCGTG TGGTTCGGCC TTGCCGAGGA CGGCTCGGTC TCCGAGCCCG ACGACCGGTT CGGCGCGCTG CCGTACGGCG AGTACACCAT CGAGGAGCTG CGCTGCGAGT CCAACGAGGG CTACGGCCTT TGGAGCGACA GCTTCAACGT ATCGCGCGAC TCCACGACGA CCAAGTTCGA CGTCGACCTC GGCACGGTCG ACGACCAGCC CGGCCCGCGC ATCGGCACGA CTGCGACCGA CGCCGCAGAC GGCGACCACG AGGCCTACGC CGGCAAGGTC GAGATCGTCG ACACCGTCGC GTACCGCAAC CTCGAGCCCG GACGTGAGTA CACCGTCACG GGCACGCTCA TGGACAAGGC CACGGGCGAG TCCGTCAAGG ACGGCGGCAA AGAGGTGACC GCCAACGCGA CCTTCACCGC CAAGGACGCC AACGGCACAG TAGACGTCAC CTTCTCGTTC GACGCGTCCG CGCTCGCCGG CCACGACGTC GTAGCCTTCG AGACGCTCAC GCATGAGGGC CGCGAGGTCG CCGTGCACGC GGAGATCGAG GACGAGGGCC AGACCGTGAG CATCGTCCCC AAGCCCGAAA TCGGCACCAC GGCGGTCGAC GCCGACGACC TTGACCACGA GGCGGGGGCC GACTCCAAGG TCGAGATCCG CGACGTGGTG GCCTACAAGG GGCTCACCCC CGGCAAGGCC TACACCCTCA CCGGCAGCCT CATGGACAAG GAGACCGGCG AGCCCGTGCA GTCCGGAAGC AAGGACGTGA CCTCGACCGT GACGTTCACG CCCGAAAAGG CCGACGGGTC GGCCGCGGTG ACGTTCTCGT TCGACGGCAG AGCCCTAGCG GGCCACGACG TCGTCGCATT CGAGTCGCTT AAGTCCGGTG GCGCAGAAAT CGCCTCGCAC ATGGACATCG AGGACGGCGG CCAAACTGTC AAGCTGGTCA AGCCCGAGGA GCCGCAGAAG CCCGAAGAGC CCTCCATCGG CACCATGGCC ACCGACGCCG ACGACGGCGA CCACGAGGCC GTCGCGGACG CAGAGGTGAC CATCGTCGAC GAGGTCGCCT ACAAGGGCCT CACCCCCGGG GAGGAGTACA CCGTCACCGG CACGCTCATG GACAAGGAGA CCGGCAAAGC CGTCCAGTCG GGCGGAAAGG ATGTGACAGC CACGGCCTCG TTCGTCCCCG ACAAGGCGTC CGGTTCCGTG AGCCTCTCCT TCACGTTCGA CGGGAGCGCC CTCAAGGGCC ACGACATCGT AGCGTTCGAG ACGGTCTCCA AGGACGGGGC GGAGGTCGCG GTGCATGCCG ACATCGACGA CGCCGGCCAG ACCGTAAGCC TGGTTGAAGA GCCCGCGACG CCTTCGAGCC CGACGCCCAA GGGCAGCCTG CCCAAGACCG GCGACGCGGT GCCCTGGATC CCGCTCGCAT GCCTGGCTGC CGCGGCGGCA TGCGGAATCG CCATCCTCGT GTTGATGCGC AGGAAGGGCA ACTGGATCGA CGAGGACGAT GGCGACATTA TCGAAGAGTA A
|
Protein sequence | MRTAAREAST SKLFRVVMAA VLAVSMCIPA TALQAQRAFA ASDSLSQVTW EEAKPQIEQY MGVPYVWGGR STDGWDCSGF VSFVMHDIYG TDWPGGTWGD SGTGSIYNYC ADYQVFSGDS AEDYNSAFDN GTVKPGDIII FLNSQVVTVH AAIAGEDATV YHAWHEGFGT GNCRFDYMWG INGGHGKVYS SFVVYRGLTE GGYVTMDKSS ADVKVTGGNA QYSLAGAVFG VYRDGAQVAT ITTDGDGHGS TSDKLPNGSY TVREIEAPAG YALDGRTFPV TVSGSDAGAE MAERPVTVTL TVAKFDDGTG RREAQGDATL DGAVYEASYA YDGGTKTVTG TTSNGTVTFE GIPLGDVTVR ELEAPEGYLP DAEKHTLTVS AGMAGHDPVF YYEPADEFAE TVFRGGITVG KADSEEYDHR DGDYANYAQG DATFAGAEFA VVNRSEGSIW YDANKDGKHQ DSEDFAPGAE VLRMTTSYDA ALDAYVAKTG ERALPYGTYG VVETKSPEGY TERGKVDVTC EIREDGQVVE LTRDSGIENE VIRGGVQVEK DDWELGKSEA IGGAGHSALD AQGHLGTSLE GIEFTVTNAS RHGVMLDGEY VGKGEVAATI YTSWNEELGA YTAQTGPDAL PYGTYTIMET ATNDSYMLTD GKARTFEVRT DGETVTFDKS GADLTWRDRV VRNDVHLQKK GADDSHKFAY VPFLITNVTT GEAHVAVTDR NGALNTSAGW NSHIRDANAN DSLIGKETIT ASDVDESCGV WFGLAEDGSV SEPDDRFGAL PYGEYTIEEL RCESNEGYGL WSDSFNVSRD STTTKFDVDL GTVDDQPGPR IGTTATDAAD GDHEAYAGKV EIVDTVAYRN LEPGREYTVT GTLMDKATGE SVKDGGKEVT ANATFTAKDA NGTVDVTFSF DASALAGHDV VAFETLTHEG REVAVHAEIE DEGQTVSIVP KPEIGTTAVD ADDLDHEAGA DSKVEIRDVV AYKGLTPGKA YTLTGSLMDK ETGEPVQSGS KDVTSTVTFT PEKADGSAAV TFSFDGRALA GHDVVAFESL KSGGAEIASH MDIEDGGQTV KLVKPEEPQK PEEPSIGTMA TDADDGDHEA VADAEVTIVD EVAYKGLTPG EEYTVTGTLM DKETGKAVQS GGKDVTATAS FVPDKASGSV SLSFTFDGSA LKGHDIVAFE TVSKDGAEVA VHADIDDAGQ TVSLVEEPAT PSSPTPKGSL PKTGDAVPWI PLACLAAAAA CGIAILVLMR RKGNWIDEDD GDIIEE
|
| |