Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_04070 |
Symbol | |
ID | 8394299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 483545 |
End bp | 486643 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644985171 |
Product | TPR repeat-containing protein |
Protein accession | YP_003142817 |
Protein GI | 257063145 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.350939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGATG CAGCCCATCT GTTCATAATG GAGGCCGCGC CGGAATCTCC GGTTTCGGAC CTCTTGGGCC GCCGCGTGTC CGTATCATCT GGCCGTATCG GCGAAGACGC TCCGCTGGAA GAAATGCCCT CGTTTGAATC AGACGGCGTC GAGCTGCGCG TGAAGGCGTA CCGCGTCCGC GTGTCGAGCG TCGGAGCCGA TCCGTATGCT CTAGCTATGA GCACGGCACG GGCCGAATAC GAGATGCTCG CGGCGCTCCC GACGACGGAC GGGCTTGCGG CCTGCGCGCT CGGCGTGCTG CAGACGGCCA AGGGCGACCA CCCCTGCGTG GTGTTCCAGG TCGCCGGGGC GTCGGACGGC GGAACGGGCG AAACGACCGA AGCCGGGGCC GCAGCTCAGG CTCAGGCCGA AACGGGAGCT CAAGCTGAAG CCGCTGCCGG CGCCGAGACA GAAGCTGCGA CCGAAACCCA ACCCTCTGCC CGGGACCACT TCGAAGCGGG CCTTCGCGCC GAAATCGGCG AAGGAGTTGA GCGCAACATC GGCACGGCCC TCTCCCATTA TCTTAAAGGC GCCGACCTGG GCGACCCCGA TTGCCAATAC CGTGCGGCGT ACCTTTTGGA CACGGAAGAA TCCCTCGCGT TCCAAAGGGG TCGGTGCGCC GCGCTGTACG AAAGCGCCGC ATCGTCCGGC AATGCGGACG CCCTGTCCAA CCTGGCTCTT TTCTATCTGT CGGGCGACTT GGTCGGACGC AACCCGGTCC GGGCAGCCGA ACTTATGGAA TCCGCCGCGA AGCTTGGCAA CGCAGCGGCC CAATACAACC TGGCGATCAT CTACCGCGAC GGCGAAGACG GCGTTCCCGC CGATCTGAAT CGGGCGATTC CCCTGTTCAA GGCGGCAGCC GAGCAAGGCG ATGCAGATGC CGCCCTGGCG GTGGCCGATG CATGCGCGCA AGGGGAAGGC GCCGTCAAGA ATCCGAAGGA AGCGGCGCGG TGGTACCGCA AGGCCGCCGA GGCGGGCCGC ATGGATGCCA TGTACGAGCT GGGTCTTCTG TACGAGCGGG GCAACGGCGT GACCGAAAAC CGTCGGGAAG CGGTCAGTTG GTACCGCAAG GCCGCAGACG CGGGCAACGC GGATGCCATG TTCCGGCTGG CGTCCATACG CCTGCACGGC AACGGCGCGA AAAAGGATCT CGCCGAGGCC TTCGACCTGT TCAAAAGGGC CGCCGAGGCA GGCCATCCCC AGGCCATGTT CAACACAGGC GTCATGTACG CCCATGGCGA CGGCGTCAAA AAGGACGCGA CGGAAGCCGC AAGCTGGTAC CGCAAGGCGG CGGATGCGGG CGTCACAGGC GCGATGTGCA ATCTGGGCAT CATGCATGAA CGCGGCGACG GCGTGGCCAA GGATCCGCAG GAAGCGGCCA GCCTGTACCG GAAGGCGTCG GACCTGGACA ATGCGCTGGG CGCATACAAC CTGGGCATCA TGCTCCTCAA CGGTTCCGGC GTGGCGAAAA ACCCGCAGGA AGCCGCACTC CACCTTCGCC GGGCGGCGGC GTTGGGCAAT ACGGAAGCCA TGATCAAGAT GGGCGAAGCG TATGAGTCGG GTGAAGGGGT TCGCAAGAAC AAGAAATCGG CGGTCAAGTT CTACAGGGAT GCGGCGTCGC AGGGCAACAC CGAAGCGATG TGCAAACTGG GCGCACTGTA CGAAGAAGGC AGCGGCGTGG ATCGCAACCG GCAGGAAGCC GCCGAGTGGT ACCGGAAGGC CGCGAAGCTC GGCAGCACCG AGGCGACGTG CGCCCTTGGA AAGCTGTGCC GCAAACATGA CGCGTCGACG GCATTCGGGC TGTTCGAGTC CGCCGCGAAA GAGGGCAACG CCGAAGCGAT GGGCATCCTG GCCGACATGC TTTCCCAAGG AGAAGGGACC GGGGCGGACC GCCAGACGGC CCTGCTCTGG TACTGCAAAG CAGCCGACGC CGGCAACGCC GAGGCCATGT ACAACCTGGG CGTCAAGTGC GCCAACGGCA TAGACGTCGA AAAAGACCAG CAGAAGGCGA TAGGCTGGTA CCGCAAGGCG GCCGATGCGG GCCATGCAGC CGCCATGTGC AGCCTGGGCA CCATATGCGA ATACGGCAAC GGCGTGACAA AGAACCTGGC CCAGGCGGTG AAATGGTACC GCGACGCTGC GAACCTGGGC AACCCGAACG CCATGTACAA CCTGGCCGTC AGGCTCGCGA ACGGCGGAGG CGTGAAGAAG AACGCCAAGC AGGCAGCGAA CTGGTACCGC AAGGCGGCGG ATGCGGGGCA TGCACCGGCC ATGAACAGCC TGGGCCTCAT GTACGAGCAG GGCGAAGGCG TGGCCAAGAA CCACGCCGAA GCGATGCGCT GGTTCCGCAA GGCGGCCGAT GCGGGCAACG TCATGGCGAT GTGCAATATG GGACGCATGC TCTCAACCGG CAAGGAAGCT TCGAAAAACC TGATGGAAGC GGCACAGTGG TACCGCAAGG CCGCCGAATT CGGCGAGACG GAATCCATGT ACAACCTTGG GCGCATGCTT GCCAACGGCC AAGGGACCGG GAAGAACCCT TTGGAGGCTG CGCAGTGGTT CCGCAGGGCG GCCGAAGACG GGCACGAGCT TGCCATGTAC CACCTGGGCG TCATGTATGC CAACGGCGAA GGCGTGGCAA GAAACCCCCA CGAGGCTTTG ACCTGGTATA GAAAGGCCGC AGACCTTGGA AACGCCAACG CCATGTACAA CCTGGGCGTC ATGCTTGCAG GCGGCATAGG CGTGGAGAGG AATCCGCAGC AAGCGGCGCG TTGGTACCGC AAGGCCATCG GCAAGGGACA TGTGGCCGCC ATGAACAACC TGGCGCTCAT GTACGAGCGC GGCGAAGGCG TCGAGAAGAA CCTCAAAGAG GCGGTCAGCT GGTGGAAGAT CGCCGCCAAG AAGGGGTCGC CGAACGCCAT GTACAACCTG GCCCGCATGT ACGAATCGGG CCAAGGCGTG GCGAAGGACA AGAAGGAAGC CCAGAACTGG TATAGGAAAG CCGCAAGCTA CGGGCAGGCA GGCGCGCAAC TCTGGATGAA GAAGCACCGG CTGGTTTAG
|
Protein sequence | MRDAAHLFIM EAAPESPVSD LLGRRVSVSS GRIGEDAPLE EMPSFESDGV ELRVKAYRVR VSSVGADPYA LAMSTARAEY EMLAALPTTD GLAACALGVL QTAKGDHPCV VFQVAGASDG GTGETTEAGA AAQAQAETGA QAEAAAGAET EAATETQPSA RDHFEAGLRA EIGEGVERNI GTALSHYLKG ADLGDPDCQY RAAYLLDTEE SLAFQRGRCA ALYESAASSG NADALSNLAL FYLSGDLVGR NPVRAAELME SAAKLGNAAA QYNLAIIYRD GEDGVPADLN RAIPLFKAAA EQGDADAALA VADACAQGEG AVKNPKEAAR WYRKAAEAGR MDAMYELGLL YERGNGVTEN RREAVSWYRK AADAGNADAM FRLASIRLHG NGAKKDLAEA FDLFKRAAEA GHPQAMFNTG VMYAHGDGVK KDATEAASWY RKAADAGVTG AMCNLGIMHE RGDGVAKDPQ EAASLYRKAS DLDNALGAYN LGIMLLNGSG VAKNPQEAAL HLRRAAALGN TEAMIKMGEA YESGEGVRKN KKSAVKFYRD AASQGNTEAM CKLGALYEEG SGVDRNRQEA AEWYRKAAKL GSTEATCALG KLCRKHDAST AFGLFESAAK EGNAEAMGIL ADMLSQGEGT GADRQTALLW YCKAADAGNA EAMYNLGVKC ANGIDVEKDQ QKAIGWYRKA ADAGHAAAMC SLGTICEYGN GVTKNLAQAV KWYRDAANLG NPNAMYNLAV RLANGGGVKK NAKQAANWYR KAADAGHAPA MNSLGLMYEQ GEGVAKNHAE AMRWFRKAAD AGNVMAMCNM GRMLSTGKEA SKNLMEAAQW YRKAAEFGET ESMYNLGRML ANGQGTGKNP LEAAQWFRRA AEDGHELAMY HLGVMYANGE GVARNPHEAL TWYRKAADLG NANAMYNLGV MLAGGIGVER NPQQAARWYR KAIGKGHVAA MNNLALMYER GEGVEKNLKE AVSWWKIAAK KGSPNAMYNL ARMYESGQGV AKDKKEAQNW YRKAASYGQA GAQLWMKKHR LV
|
| |