Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4071 |
Symbol | |
ID | 5714623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009958 |
Strand | + |
Start bp | 3168 |
End bp | 6497 |
Gene Length | 3330 bp |
Protein Length | 1109 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641276980 |
Product | parallel beta-helix repeat-containing protein |
Protein accession | YP_001542276 |
Protein GI | 159046607 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.041867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGTT CCTCCTACTT CAAGGATTAC ACTGCCCATC AAAACGCCAC CATCCTTGTT TCCAGCACAG AAGAACTCAA TGCCGCTGTG GCGGAGCTTG CTGGCACGAC CGGGGGTACG GTTCTGCTCT CGGGTGAGGC TGGTCCCTAC CGATTGGATG CGCGCAACCT CGGCGATAGT GCCGAGAGCG CGGTTCTGAT CACCTCCCAA GACTCGAACA ACCCCGCGCA ATTAAAGCAG GTCTATATCG ACAACAGTGT CTATCTCAGC CTGACCGATG TGACGATCGA TAGCGGGGAT TACGGATTCG AGCGGTCAGG CCATCTCCAT GACATCTATG TAAAGTCCGG AGCGCATCTG CAATTCGTCG ATCTGACCAT GACCAGCACT GCCGAAGCCC CCCTCGGCTT CGATGACGAC ACCGTGAAGG CCGAGGATGC CGTCTATCTG CGCAATGCCT CGGACGTTCT ATTCGCCGAC AGCACGATCT CGAATTACTA CCACGGTATC AGTTTGGCAG GGGTACGCGA TACGATCGTA ACCGGCAACG ACATCTCCGC CTTGCAGGGG GACGGTATTC GCGGCGGCGG CGTGAACAAT GTTCTGGTCT CGGACAATCA CATCCATGAT TTCTTGGGTG CGACCAACGA GTACAACCAC CCAGACATGA TCCAGTTCTG GACCGGGTTC CCAGAGGGGA TGACCAACAC CGACCTGACG ATTACCGGTA ACCTGCTCAA TGCGGGCGAA GGCTCCGCTG CTCAGGGCAT TCTCGTACAG AACGAGGCCT TCGATGACGC GGGAGACCCT TTGGAGGGAG TTTATGGTCA AAACCTGACG ATCACGGACA ATGTCATCCA TTCGGGCATG TCTAATGGCA TCGGGATCAT CGGATACCAA GGGGTCGAGG TTGCCAACAA TTCGGTGCTT TGGAACGAGG GCGCCGTGCT GTCCCAGACG GCTGATGCCG TACCGGCCTC CAACCCTCCC TGGATCTTGG TACGCAACAG CTTGGAAGTC GAGACCGCTG GCAATGTCGC TCACTTGGTC CGGATCGAAA CCGAGGACCA GGCCGAGACC AATTACTTTC TGGACTACGA TAATCCGAGT GCGCCGAATT ATGCCTATGC CCATGTGATC GGCCTTGCCG CCGATGGCCA GACGGACGTC TACGACTTGC AATTCCGGCC CGATAGCCCA CTCAACGGGG TCTCTGGCGC TGAGGCCAGC ACTTGGACGC CCAGCGAGGC CCCGGTGAAC GCCGTGATCT CCCAGCAGCC CAGTCTCGAG GACCGCGCAG CGGTGGAGTT GTCGGCCCGC TATTCGACTG TGGATGGACA ACTGGTTGAT CCCGAGACTG CACAAGTCTT CTGGACCTTG GCCGATGGCA CCGTCCTGGA GGGCCTGGAG GTCACGGTAA CTTTCGACAC GCCGGGGCTG CACGAGGTCA CGCTCGATGT CCTTGCGCCT GATGGCAGTT CCGACAGCGC CACGAGATCC GTCGACATCC GCGGAAACGG CCTGTTCGAG GCGGATTTCG CCCGTTCCGA AACCTTTGGC GACGGTGTCG AGATTGTCGA TCCCAATGGC GAAGCGTTCT CCAAGAAGGG CGTGTTCACT CTGGATGGCG ACAGTGTTTT CGAGGTGACC CGCGGAAACC CAGAGTTCGA GACGCTTCAT ACCCTGTCCA TGGACGTCTC ATTCAAACCG GACGCGTTCG AAAAAAACGG CTGGCTGGTG AAATGGCTCA AGGCGGTCGA TCTGCGGGTG ACAGACGAGA AGGGGCTATG GCTCAAAATC GAGACCGATG CCGACGTTTA CGAGATCCGC ACCGAGGGTA ACCTTCTTGA GAAGGGCGAA TGGGCTGACA TCGCGGTCGA TTTCGACAGC CATGCGGGGC AGATGCGCTT GTCTCTGGAC GGCGAGGAAC TCGGCCGGAC CGACGTCGAA GGCACCATCT CGGGGTCGCA ATACGACCTC ACCTTGGGAG AGGGCCGGGG GCGCAGCGCC GAAGGGGAGA TCGATTACTT CTCGATGACC ACCCCGCCTG CAGGATCGGT GCTTGAGATC GGAGACCGGT TTCCGGAGGA ACCAGAAGAG CCCGAAGACG AAACACCCCC GCCGGTGGGT GTGGACGATA GCTTCATCAC GGACGAGGAT GTAAAACTCT CGGGCGATGT CTTGGATAAC GATATCAATG GCCAAGGCGC TATCCTAACG GTGAGCCTAC TCAGCGATGT GTCCAACGGC ATACTACTGC TGAACAACGA CGGTACCTTT GATTACACCG CCGCACCGGA CTTTAACGGC TCCGATAGCT TTACCTACAC AGTGTCGGAT GGGTCGTTCA CTGATACGGC CACGGTCTCG ATCACAGTCA ACTATGTGAA CGACAATCCT GTCCTGACCG CAGACACGGT GACGACGAAG GAGGACATCT CGGTTAATAT CGACGTGGTG GTAAATGACA CAGATGTCGA TGGCGACACG TTGAGCGTGT CGGTGGTCGG CGCGGCTTCG AACGGAACTA CATCAATAAA CCTCGACGGA ACAGTATCTT ATACTCCTAA TCTCCATTTC TTCGGCACCG ACAGCTTTAT TTACACAGCC TACGACGGAC ATGGCGGCGT CAGGTCGTCA TCGGTTACCG TCACTGTCGA TGCGGTTAAC GACGCGCCGG TGATCGAAAT CTCCAACTTC GCGGTCGACG AAGAACAGAC AACTGTCGGT AAGATAATCG CTTCTGACGT CGAAGACCAC GATTTAGACT TTGCCATCTC GGGCGGCGCG GATGCCGACC TCTTCGACAT TACGGAAGAC GGCGAACTCA GCTTCCTCGT TGCTCCGGAT TTTGAAGCGC CATTAGATGT TGGTGGTACC TCTGGCGACA ATATCTACGA AGTCGAGGTA TCGGCCTTTG ACACTAGGGG TGCGTTGAGC AGTGCACTCT TCAACGTCGC AGTAAAGGAT GTAGATGAGG GCGTCAGGCC GATCGAGGTC CTCGGGACGG GGGACAACGA CGTGCTGACC GGAACCGTGG CAGACGAGCG CCTCATCTCA CATGGCGGCA GAATAGACCG GATGACCGGC GGTGGGGGTG CAGATGAATT TGTCTTCGGC CCTGAACTCG ACAACAACCA ACGCGAACTC GATATTATCT ACGACTACGA CACCGATGAT ACGATCATAC TGGGCTCTGA TGACTATCGA TTAATTTCTC TCGGAAATAG TGTATTGATC CATCACAACG ACGACGGCGA CTTGATCTAT GTGCTAGGAA CAGAAGCAGA GAGTTTAATG ATCGAAATTG AAAACTCCGC CGTTCTATAA
|
Protein sequence | MSGSSYFKDY TAHQNATILV SSTEELNAAV AELAGTTGGT VLLSGEAGPY RLDARNLGDS AESAVLITSQ DSNNPAQLKQ VYIDNSVYLS LTDVTIDSGD YGFERSGHLH DIYVKSGAHL QFVDLTMTST AEAPLGFDDD TVKAEDAVYL RNASDVLFAD STISNYYHGI SLAGVRDTIV TGNDISALQG DGIRGGGVNN VLVSDNHIHD FLGATNEYNH PDMIQFWTGF PEGMTNTDLT ITGNLLNAGE GSAAQGILVQ NEAFDDAGDP LEGVYGQNLT ITDNVIHSGM SNGIGIIGYQ GVEVANNSVL WNEGAVLSQT ADAVPASNPP WILVRNSLEV ETAGNVAHLV RIETEDQAET NYFLDYDNPS APNYAYAHVI GLAADGQTDV YDLQFRPDSP LNGVSGAEAS TWTPSEAPVN AVISQQPSLE DRAAVELSAR YSTVDGQLVD PETAQVFWTL ADGTVLEGLE VTVTFDTPGL HEVTLDVLAP DGSSDSATRS VDIRGNGLFE ADFARSETFG DGVEIVDPNG EAFSKKGVFT LDGDSVFEVT RGNPEFETLH TLSMDVSFKP DAFEKNGWLV KWLKAVDLRV TDEKGLWLKI ETDADVYEIR TEGNLLEKGE WADIAVDFDS HAGQMRLSLD GEELGRTDVE GTISGSQYDL TLGEGRGRSA EGEIDYFSMT TPPAGSVLEI GDRFPEEPEE PEDETPPPVG VDDSFITDED VKLSGDVLDN DINGQGAILT VSLLSDVSNG ILLLNNDGTF DYTAAPDFNG SDSFTYTVSD GSFTDTATVS ITVNYVNDNP VLTADTVTTK EDISVNIDVV VNDTDVDGDT LSVSVVGAAS NGTTSINLDG TVSYTPNLHF FGTDSFIYTA YDGHGGVRSS SVTVTVDAVN DAPVIEISNF AVDEEQTTVG KIIASDVEDH DLDFAISGGA DADLFDITED GELSFLVAPD FEAPLDVGGT SGDNIYEVEV SAFDTRGALS SALFNVAVKD VDEGVRPIEV LGTGDNDVLT GTVADERLIS HGGRIDRMTG GGGADEFVFG PELDNNQREL DIIYDYDTDD TIILGSDDYR LISLGNSVLI HHNDDGDLIY VLGTEAESLM IEIENSAVL
|
| |