Gene Shel_18990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_18990 
Symbol 
ID8395788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp2129285 
End bp2133085 
Gene Length3801 bp 
Protein Length1266 aa 
Translation table11 
GC content68% 
IMG OID644986650 
Productputative collagen-binding protein 
Protein accessionYP_003144264 
Protein GI257064592 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.133816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACAG CGGCTCGTGA GGCCTCCACC TCCAAGCTCT TCAGGGTCGT CATGGCGGCC 
GTGCTCGCGG TATCCATGTG CATCCCCGCA ACCGCGCTGC AGGCGCAGCG CGCATTCGCG
GCCTCGGACT CGCTCTCCCA GGTCACCTGG GAGGAGGCCA AGCCGCAGAT CGAGCAGTAC
ATGGGCGTGC CCTACGTGTG GGGCGGCCGC TCGACCGACG GCTGGGACTG CTCGGGCTTC
GTGTCGTTCG TGATGCACGA CATCTACGGC ACCGACTGGC CGGGCGGCAC CTGGGGCGAC
TCGGGCACGG GCTCGATCTA CAACTACTGC GCCGACTACC AGGTGTTCTC GGGCGACTCG
GCGGAGGACT ACAACTCGGC GTTCGACAAC GGCACCGTGA AGCCCGGCGA CATCATCATC
TTCCTGAACT CGCAGGTCGT GACGGTCCAC GCCGCAATCG CAGGCGAGGA CGCCACGGTC
TACCACGCCT GGCACGAGGG CTTCGGAACC GGCAACTGCC GCTTCGACTA CATGTGGGGC
ATCAACGGCG GGCACGGCAA GGTCTACAGC TCCTTCGTCG TCTACCGCGG CCTCACCGAG
GGCGGCTACG TGACCATGGA CAAGTCGTCC GCAGACGTGA AGGTCACGGG TGGCAACGCG
CAGTACTCGC TCGCGGGGGC CGTCTTCGGC GTGTACCGCG ACGGCGCGCA GGTGGCCACC
ATCACCACCG ACGGGGACGG CCACGGCTCC ACATCCGACA AGCTCCCGAA CGGCTCCTAC
ACCGTGAGGG AGATCGAGGC GCCGGCAGGC TACGCGCTCG ACGGCCGCAC GTTCCCCGTC
ACCGTCTCCG GCTCCGACGC CGGGGCGGAG ATGGCCGAGC GGCCCGTCAC CGTCACCCTC
ACCGTCGCGA AGTTCGACGA CGGCACCGGG CGGCGTGAGG CGCAGGGCGA CGCCACGCTC
GACGGGGCGG TCTACGAGGC AAGCTACGCC TACGACGGTG GCACGAAGAC CGTCACGGGC
ACGACGAGCA ACGGCACCGT GACCTTCGAA GGCATCCCGC TCGGCGACGT CACCGTGCGC
GAGCTTGAGG CTCCCGAGGG CTACCTTCCC GACGCCGAGA AGCACACCCT GACCGTGAGC
GCCGGCATGG CCGGCCACGA CCCCGTCTTC TACTACGAGC CGGCCGACGA GTTTGCGGAG
ACCGTCTTTC GCGGCGGCAT CACCGTCGGC AAGGCCGACT CCGAGGAGTA CGACCACAGG
GACGGCGACT ACGCCAACTA CGCCCAGGGC GACGCGACCT TCGCGGGCGC CGAGTTCGCG
GTCGTGAACC GCTCCGAGGG ATCGATCTGG TACGACGCGA ACAAGGACGG GAAGCACCAG
GATTCCGAGG ACTTCGCCCC CGGCGCCGAG GTGCTTCGCA TGACCACCTC CTACGACGCG
GCCCTCGACG CCTACGTCGC AAAGACCGGC GAGAGGGCGC TGCCGTACGG CACCTACGGG
GTCGTCGAGA CCAAGTCGCC CGAGGGCTAC ACCGAGCGCG GCAAGGTCGA CGTCACCTGC
GAGATCCGCG AGGACGGCCA GGTGGTGGAA CTCACCCGAG ACTCCGGCAT CGAGAACGAG
GTGATCCGCG GCGGCGTGCA AGTGGAGAAG GACGACTGGG AGCTCGGCAA GTCCGAGGCC
ATCGGCGGCG CGGGCCACTC CGCGCTCGAC GCGCAGGGCC ACCTCGGGAC GTCGCTCGAG
GGCATCGAGT TCACGGTCAC GAACGCCTCC AGGCACGGCG TGATGCTCGA CGGCGAGTAC
GTTGGCAAGG GCGAGGTCGC GGCGACCATC TACACGTCCT GGAACGAGGA GCTCGGCGCC
TACACGGCCC AGACCGGGCC CGACGCGCTC CCGTACGGCA CCTACACCAT CATGGAGACG
GCCACGAACG ACTCATACAT GCTCACCGAC GGGAAGGCCC GCACCTTCGA GGTCCGCACC
GACGGCGAGA CCGTGACCTT CGACAAATCC GGCGCGGACC TCACGTGGCG CGACCGCGTC
GTGCGCAACG ACGTCCACCT GCAGAAGAAG GGCGCCGACG ACTCGCACAA GTTCGCCTAC
GTGCCGTTCC TCATCACCAA CGTCACCACG GGCGAGGCGC ACGTGGCAGT GACCGACCGC
AACGGGGCGC TGAACACCTC CGCCGGCTGG AACAGCCACA TCCGCGACGC TAACGCCAAC
GACTCGCTCA TCGGCAAGGA AACCATCACG GCCTCCGACG TCGACGAGTC CTGCGGCGTG
TGGTTCGGCC TTGCCGAGGA CGGCTCGGTC TCCGAGCCCG ACGACCGGTT CGGCGCGCTG
CCGTACGGCG AGTACACCAT CGAGGAGCTG CGCTGCGAGT CCAACGAGGG CTACGGCCTT
TGGAGCGACA GCTTCAACGT ATCGCGCGAC TCCACGACGA CCAAGTTCGA CGTCGACCTC
GGCACGGTCG ACGACCAGCC CGGCCCGCGC ATCGGCACGA CTGCGACCGA CGCCGCAGAC
GGCGACCACG AGGCCTACGC CGGCAAGGTC GAGATCGTCG ACACCGTCGC GTACCGCAAC
CTCGAGCCCG GACGTGAGTA CACCGTCACG GGCACGCTCA TGGACAAGGC CACGGGCGAG
TCCGTCAAGG ACGGCGGCAA AGAGGTGACC GCCAACGCGA CCTTCACCGC CAAGGACGCC
AACGGCACAG TAGACGTCAC CTTCTCGTTC GACGCGTCCG CGCTCGCCGG CCACGACGTC
GTAGCCTTCG AGACGCTCAC GCATGAGGGC CGCGAGGTCG CCGTGCACGC GGAGATCGAG
GACGAGGGCC AGACCGTGAG CATCGTCCCC AAGCCCGAAA TCGGCACCAC GGCGGTCGAC
GCCGACGACC TTGACCACGA GGCGGGGGCC GACTCCAAGG TCGAGATCCG CGACGTGGTG
GCCTACAAGG GGCTCACCCC CGGCAAGGCC TACACCCTCA CCGGCAGCCT CATGGACAAG
GAGACCGGCG AGCCCGTGCA GTCCGGAAGC AAGGACGTGA CCTCGACCGT GACGTTCACG
CCCGAAAAGG CCGACGGGTC GGCCGCGGTG ACGTTCTCGT TCGACGGCAG AGCCCTAGCG
GGCCACGACG TCGTCGCATT CGAGTCGCTT AAGTCCGGTG GCGCAGAAAT CGCCTCGCAC
ATGGACATCG AGGACGGCGG CCAAACTGTC AAGCTGGTCA AGCCCGAGGA GCCGCAGAAG
CCCGAAGAGC CCTCCATCGG CACCATGGCC ACCGACGCCG ACGACGGCGA CCACGAGGCC
GTCGCGGACG CAGAGGTGAC CATCGTCGAC GAGGTCGCCT ACAAGGGCCT CACCCCCGGG
GAGGAGTACA CCGTCACCGG CACGCTCATG GACAAGGAGA CCGGCAAAGC CGTCCAGTCG
GGCGGAAAGG ATGTGACAGC CACGGCCTCG TTCGTCCCCG ACAAGGCGTC CGGTTCCGTG
AGCCTCTCCT TCACGTTCGA CGGGAGCGCC CTCAAGGGCC ACGACATCGT AGCGTTCGAG
ACGGTCTCCA AGGACGGGGC GGAGGTCGCG GTGCATGCCG ACATCGACGA CGCCGGCCAG
ACCGTAAGCC TGGTTGAAGA GCCCGCGACG CCTTCGAGCC CGACGCCCAA GGGCAGCCTG
CCCAAGACCG GCGACGCGGT GCCCTGGATC CCGCTCGCAT GCCTGGCTGC CGCGGCGGCA
TGCGGAATCG CCATCCTCGT GTTGATGCGC AGGAAGGGCA ACTGGATCGA CGAGGACGAT
GGCGACATTA TCGAAGAGTA A
 
Protein sequence
MRTAAREAST SKLFRVVMAA VLAVSMCIPA TALQAQRAFA ASDSLSQVTW EEAKPQIEQY 
MGVPYVWGGR STDGWDCSGF VSFVMHDIYG TDWPGGTWGD SGTGSIYNYC ADYQVFSGDS
AEDYNSAFDN GTVKPGDIII FLNSQVVTVH AAIAGEDATV YHAWHEGFGT GNCRFDYMWG
INGGHGKVYS SFVVYRGLTE GGYVTMDKSS ADVKVTGGNA QYSLAGAVFG VYRDGAQVAT
ITTDGDGHGS TSDKLPNGSY TVREIEAPAG YALDGRTFPV TVSGSDAGAE MAERPVTVTL
TVAKFDDGTG RREAQGDATL DGAVYEASYA YDGGTKTVTG TTSNGTVTFE GIPLGDVTVR
ELEAPEGYLP DAEKHTLTVS AGMAGHDPVF YYEPADEFAE TVFRGGITVG KADSEEYDHR
DGDYANYAQG DATFAGAEFA VVNRSEGSIW YDANKDGKHQ DSEDFAPGAE VLRMTTSYDA
ALDAYVAKTG ERALPYGTYG VVETKSPEGY TERGKVDVTC EIREDGQVVE LTRDSGIENE
VIRGGVQVEK DDWELGKSEA IGGAGHSALD AQGHLGTSLE GIEFTVTNAS RHGVMLDGEY
VGKGEVAATI YTSWNEELGA YTAQTGPDAL PYGTYTIMET ATNDSYMLTD GKARTFEVRT
DGETVTFDKS GADLTWRDRV VRNDVHLQKK GADDSHKFAY VPFLITNVTT GEAHVAVTDR
NGALNTSAGW NSHIRDANAN DSLIGKETIT ASDVDESCGV WFGLAEDGSV SEPDDRFGAL
PYGEYTIEEL RCESNEGYGL WSDSFNVSRD STTTKFDVDL GTVDDQPGPR IGTTATDAAD
GDHEAYAGKV EIVDTVAYRN LEPGREYTVT GTLMDKATGE SVKDGGKEVT ANATFTAKDA
NGTVDVTFSF DASALAGHDV VAFETLTHEG REVAVHAEIE DEGQTVSIVP KPEIGTTAVD
ADDLDHEAGA DSKVEIRDVV AYKGLTPGKA YTLTGSLMDK ETGEPVQSGS KDVTSTVTFT
PEKADGSAAV TFSFDGRALA GHDVVAFESL KSGGAEIASH MDIEDGGQTV KLVKPEEPQK
PEEPSIGTMA TDADDGDHEA VADAEVTIVD EVAYKGLTPG EEYTVTGTLM DKETGKAVQS
GGKDVTATAS FVPDKASGSV SLSFTFDGSA LKGHDIVAFE TVSKDGAEVA VHADIDDAGQ
TVSLVEEPAT PSSPTPKGSL PKTGDAVPWI PLACLAAAAA CGIAILVLMR RKGNWIDEDD
GDIIEE