Gene Shel_04070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_04070 
Symbol 
ID8394299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp483545 
End bp486643 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content64% 
IMG OID644985171 
ProductTPR repeat-containing protein 
Protein accessionYP_003142817 
Protein GI257063145 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.350939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGATG CAGCCCATCT GTTCATAATG GAGGCCGCGC CGGAATCTCC GGTTTCGGAC 
CTCTTGGGCC GCCGCGTGTC CGTATCATCT GGCCGTATCG GCGAAGACGC TCCGCTGGAA
GAAATGCCCT CGTTTGAATC AGACGGCGTC GAGCTGCGCG TGAAGGCGTA CCGCGTCCGC
GTGTCGAGCG TCGGAGCCGA TCCGTATGCT CTAGCTATGA GCACGGCACG GGCCGAATAC
GAGATGCTCG CGGCGCTCCC GACGACGGAC GGGCTTGCGG CCTGCGCGCT CGGCGTGCTG
CAGACGGCCA AGGGCGACCA CCCCTGCGTG GTGTTCCAGG TCGCCGGGGC GTCGGACGGC
GGAACGGGCG AAACGACCGA AGCCGGGGCC GCAGCTCAGG CTCAGGCCGA AACGGGAGCT
CAAGCTGAAG CCGCTGCCGG CGCCGAGACA GAAGCTGCGA CCGAAACCCA ACCCTCTGCC
CGGGACCACT TCGAAGCGGG CCTTCGCGCC GAAATCGGCG AAGGAGTTGA GCGCAACATC
GGCACGGCCC TCTCCCATTA TCTTAAAGGC GCCGACCTGG GCGACCCCGA TTGCCAATAC
CGTGCGGCGT ACCTTTTGGA CACGGAAGAA TCCCTCGCGT TCCAAAGGGG TCGGTGCGCC
GCGCTGTACG AAAGCGCCGC ATCGTCCGGC AATGCGGACG CCCTGTCCAA CCTGGCTCTT
TTCTATCTGT CGGGCGACTT GGTCGGACGC AACCCGGTCC GGGCAGCCGA ACTTATGGAA
TCCGCCGCGA AGCTTGGCAA CGCAGCGGCC CAATACAACC TGGCGATCAT CTACCGCGAC
GGCGAAGACG GCGTTCCCGC CGATCTGAAT CGGGCGATTC CCCTGTTCAA GGCGGCAGCC
GAGCAAGGCG ATGCAGATGC CGCCCTGGCG GTGGCCGATG CATGCGCGCA AGGGGAAGGC
GCCGTCAAGA ATCCGAAGGA AGCGGCGCGG TGGTACCGCA AGGCCGCCGA GGCGGGCCGC
ATGGATGCCA TGTACGAGCT GGGTCTTCTG TACGAGCGGG GCAACGGCGT GACCGAAAAC
CGTCGGGAAG CGGTCAGTTG GTACCGCAAG GCCGCAGACG CGGGCAACGC GGATGCCATG
TTCCGGCTGG CGTCCATACG CCTGCACGGC AACGGCGCGA AAAAGGATCT CGCCGAGGCC
TTCGACCTGT TCAAAAGGGC CGCCGAGGCA GGCCATCCCC AGGCCATGTT CAACACAGGC
GTCATGTACG CCCATGGCGA CGGCGTCAAA AAGGACGCGA CGGAAGCCGC AAGCTGGTAC
CGCAAGGCGG CGGATGCGGG CGTCACAGGC GCGATGTGCA ATCTGGGCAT CATGCATGAA
CGCGGCGACG GCGTGGCCAA GGATCCGCAG GAAGCGGCCA GCCTGTACCG GAAGGCGTCG
GACCTGGACA ATGCGCTGGG CGCATACAAC CTGGGCATCA TGCTCCTCAA CGGTTCCGGC
GTGGCGAAAA ACCCGCAGGA AGCCGCACTC CACCTTCGCC GGGCGGCGGC GTTGGGCAAT
ACGGAAGCCA TGATCAAGAT GGGCGAAGCG TATGAGTCGG GTGAAGGGGT TCGCAAGAAC
AAGAAATCGG CGGTCAAGTT CTACAGGGAT GCGGCGTCGC AGGGCAACAC CGAAGCGATG
TGCAAACTGG GCGCACTGTA CGAAGAAGGC AGCGGCGTGG ATCGCAACCG GCAGGAAGCC
GCCGAGTGGT ACCGGAAGGC CGCGAAGCTC GGCAGCACCG AGGCGACGTG CGCCCTTGGA
AAGCTGTGCC GCAAACATGA CGCGTCGACG GCATTCGGGC TGTTCGAGTC CGCCGCGAAA
GAGGGCAACG CCGAAGCGAT GGGCATCCTG GCCGACATGC TTTCCCAAGG AGAAGGGACC
GGGGCGGACC GCCAGACGGC CCTGCTCTGG TACTGCAAAG CAGCCGACGC CGGCAACGCC
GAGGCCATGT ACAACCTGGG CGTCAAGTGC GCCAACGGCA TAGACGTCGA AAAAGACCAG
CAGAAGGCGA TAGGCTGGTA CCGCAAGGCG GCCGATGCGG GCCATGCAGC CGCCATGTGC
AGCCTGGGCA CCATATGCGA ATACGGCAAC GGCGTGACAA AGAACCTGGC CCAGGCGGTG
AAATGGTACC GCGACGCTGC GAACCTGGGC AACCCGAACG CCATGTACAA CCTGGCCGTC
AGGCTCGCGA ACGGCGGAGG CGTGAAGAAG AACGCCAAGC AGGCAGCGAA CTGGTACCGC
AAGGCGGCGG ATGCGGGGCA TGCACCGGCC ATGAACAGCC TGGGCCTCAT GTACGAGCAG
GGCGAAGGCG TGGCCAAGAA CCACGCCGAA GCGATGCGCT GGTTCCGCAA GGCGGCCGAT
GCGGGCAACG TCATGGCGAT GTGCAATATG GGACGCATGC TCTCAACCGG CAAGGAAGCT
TCGAAAAACC TGATGGAAGC GGCACAGTGG TACCGCAAGG CCGCCGAATT CGGCGAGACG
GAATCCATGT ACAACCTTGG GCGCATGCTT GCCAACGGCC AAGGGACCGG GAAGAACCCT
TTGGAGGCTG CGCAGTGGTT CCGCAGGGCG GCCGAAGACG GGCACGAGCT TGCCATGTAC
CACCTGGGCG TCATGTATGC CAACGGCGAA GGCGTGGCAA GAAACCCCCA CGAGGCTTTG
ACCTGGTATA GAAAGGCCGC AGACCTTGGA AACGCCAACG CCATGTACAA CCTGGGCGTC
ATGCTTGCAG GCGGCATAGG CGTGGAGAGG AATCCGCAGC AAGCGGCGCG TTGGTACCGC
AAGGCCATCG GCAAGGGACA TGTGGCCGCC ATGAACAACC TGGCGCTCAT GTACGAGCGC
GGCGAAGGCG TCGAGAAGAA CCTCAAAGAG GCGGTCAGCT GGTGGAAGAT CGCCGCCAAG
AAGGGGTCGC CGAACGCCAT GTACAACCTG GCCCGCATGT ACGAATCGGG CCAAGGCGTG
GCGAAGGACA AGAAGGAAGC CCAGAACTGG TATAGGAAAG CCGCAAGCTA CGGGCAGGCA
GGCGCGCAAC TCTGGATGAA GAAGCACCGG CTGGTTTAG
 
Protein sequence
MRDAAHLFIM EAAPESPVSD LLGRRVSVSS GRIGEDAPLE EMPSFESDGV ELRVKAYRVR 
VSSVGADPYA LAMSTARAEY EMLAALPTTD GLAACALGVL QTAKGDHPCV VFQVAGASDG
GTGETTEAGA AAQAQAETGA QAEAAAGAET EAATETQPSA RDHFEAGLRA EIGEGVERNI
GTALSHYLKG ADLGDPDCQY RAAYLLDTEE SLAFQRGRCA ALYESAASSG NADALSNLAL
FYLSGDLVGR NPVRAAELME SAAKLGNAAA QYNLAIIYRD GEDGVPADLN RAIPLFKAAA
EQGDADAALA VADACAQGEG AVKNPKEAAR WYRKAAEAGR MDAMYELGLL YERGNGVTEN
RREAVSWYRK AADAGNADAM FRLASIRLHG NGAKKDLAEA FDLFKRAAEA GHPQAMFNTG
VMYAHGDGVK KDATEAASWY RKAADAGVTG AMCNLGIMHE RGDGVAKDPQ EAASLYRKAS
DLDNALGAYN LGIMLLNGSG VAKNPQEAAL HLRRAAALGN TEAMIKMGEA YESGEGVRKN
KKSAVKFYRD AASQGNTEAM CKLGALYEEG SGVDRNRQEA AEWYRKAAKL GSTEATCALG
KLCRKHDAST AFGLFESAAK EGNAEAMGIL ADMLSQGEGT GADRQTALLW YCKAADAGNA
EAMYNLGVKC ANGIDVEKDQ QKAIGWYRKA ADAGHAAAMC SLGTICEYGN GVTKNLAQAV
KWYRDAANLG NPNAMYNLAV RLANGGGVKK NAKQAANWYR KAADAGHAPA MNSLGLMYEQ
GEGVAKNHAE AMRWFRKAAD AGNVMAMCNM GRMLSTGKEA SKNLMEAAQW YRKAAEFGET
ESMYNLGRML ANGQGTGKNP LEAAQWFRRA AEDGHELAMY HLGVMYANGE GVARNPHEAL
TWYRKAADLG NANAMYNLGV MLAGGIGVER NPQQAARWYR KAIGKGHVAA MNNLALMYER
GEGVEKNLKE AVSWWKIAAK KGSPNAMYNL ARMYESGQGV AKDKKEAQNW YRKAASYGQA
GAQLWMKKHR LV