Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1334 |
Symbol | |
ID | 5711892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1386187 |
End bp | 1387725 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641267246 |
Product | tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_001532677 |
Protein GI | 159043883 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.278248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0416734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGGTT CCTGGCTTTC AAAGGTATTG TTTCCTATAA CGATCTGCAC GCTGATTGGT GGGCTTTCAT TGCAGGATGC CTTTGCCCGG TCATTACAGA TTGACGATCT GACCAGGGAG CGGGAACAGA GTAGCATCCT GCGCAGTCTG CAACTTGACG CCTTCAAGGA TCCGGAGCTT CTCCAACTTT TCAAAGACGC CTCCAGCGCC TTGCGGGAGG AGCGGTACGA GGACGCAGAG GCCGCAGCCC TCAGCCTGAC AGAAAAGGCA CCAGACGCGC CACAAGGCTG GCACATGTTG GGAATGGCCC GCGTGAAAAA CGGGTCCACT TCCGGTGCGC TTCACGCCCT AGACGCCGCG GCAGCGCGTT ACAAGAACAA CTCGGACCCT TTGGTCATCA AGGGAGATGT GCTGCGCGGG CTGGGTCGGT TTGACGATGC CGAGGCCGCT TATCGTCGCG CCGTCGAGAT TAATCCGGAC GATCTGCGCG GGGTCGAGGG GCTTGGTGCG ATGTTAGAGG CTGGGGGCAA AGCGGACGAA GCGATCACCG TCTATCAATC TGCTCTTGCA ACGAACCCTT CGGATATCGC CTTCGCGTTG AATACCGCGC GCCTCCAGGT GGCACTGGAC GATCTTTCCG CCGCACGGGA GACCCTAAAG AGGTTCGAGG CCTCAAATCC CGAAAACTCC CAGGTCAAAA TTGCACTCGG GCGAGTTGAA TTCCTTTCTG GAAACAATGA AGCCGCGACG GCCTATTTCG ATCAGGCGAT CCGCATCGCG CCCGAGAATT CGACCCAGTA TCTGCTAAAG GCACGCGCCC AGCTGAATGC AACCGAGTTT GCAGCGGCAG AAGAAACATT GCGCGGCGCG CGTGAGATCT TTCCGGATGA TCCGCAAGTG CCCTTTGAAC TCGCCAGTCT GTACGGAGTC ACGCGCGACT ACGCCCGGGC TGCCGAAGTG TTCGAGGCAG GCGCCGAAAG GTGGCCTGAT ATCATGGGAT TCCACTCTGG ACTCCTCAGG GCCTCTTATC GCCTGAATGA TTTTGATGCG GCCCTGTCCG CTGCAAGAGT GCTTTCAAGC CAGCCCGGCG CCTCTGCCGT CGACCACATC TGGCATGCAC TCGTCCTCGA GAGGCTCGAG CGGGTCGATA GCGCGATCAC CTCGTATGAT ACTGCACTGG AGCTTGATCC AACAAACTGG CTTGCCGCCA ACAATCTGGC AAACCTTTTG TTTGACCGTG CCCCAGACAG GGCATTGGAG CTTGCACGGC TGGCTCATCG GACAGCGTCC GAAAACATCT CGGTAAACAG GACACTCGCC TGGGCAGAAT TTTCCGCCGG CAACACCGAG GCCGCCCTCG CGCTCTACGA CTCTCTGACG CCAGAAGCTT CCGATGATCC GATCCAACTC TTCCGGCACG GGCAGGTTCT GATTGAAGCC GAAGAGGCCG AAGCAGGCCG CATCCTCATC CAAAAAGCTC TTGCACTTGA TGCTGAATTC AGGGGAGCAG AAGATGCTCG GAAGCTGCTC GCGGAATGA
|
Protein sequence | MFGSWLSKVL FPITICTLIG GLSLQDAFAR SLQIDDLTRE REQSSILRSL QLDAFKDPEL LQLFKDASSA LREERYEDAE AAALSLTEKA PDAPQGWHML GMARVKNGST SGALHALDAA AARYKNNSDP LVIKGDVLRG LGRFDDAEAA YRRAVEINPD DLRGVEGLGA MLEAGGKADE AITVYQSALA TNPSDIAFAL NTARLQVALD DLSAARETLK RFEASNPENS QVKIALGRVE FLSGNNEAAT AYFDQAIRIA PENSTQYLLK ARAQLNATEF AAAEETLRGA REIFPDDPQV PFELASLYGV TRDYARAAEV FEAGAERWPD IMGFHSGLLR ASYRLNDFDA ALSAARVLSS QPGASAVDHI WHALVLERLE RVDSAITSYD TALELDPTNW LAANNLANLL FDRAPDRALE LARLAHRTAS ENISVNRTLA WAEFSAGNTE AALALYDSLT PEASDDPIQL FRHGQVLIEA EEAEAGRILI QKALALDAEF RGAEDARKLL AE
|
| |