Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1919 |
Symbol | |
ID | 5712912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2002691 |
End bp | 2004421 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641267844 |
Product | hypothetical protein |
Protein accession | YP_001533262 |
Protein GI | 159044468 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000840766 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 7.47349e-16 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGCAGTTA ACTCGCAGTT CCTGTTTTTC TATATCGATG AACTTGTTAA ACACCACTGG GTAGACTGGT TGGAAGGCCC GTTGGCCAAA GTGCCATCCG AAGAGCACAG AGGCGAGAGA GGATTCCAGC TGGCGGTCGG GGACACTGGA GCAGGTCCTC GAACTCGTGC TGGAGAAGCT TATCAACTCG GTATTCTCTC CGGTCGCATT GTTCGTCCTC GTATTGCGGG GGCGTCTTTC CCATCTGGTT GGCAGGCGGT GATTGAGGGT CGCGTTCCCT CCACTAAGCT TGGCTACTAT TCCTATGTCA TCGGAAGGGC GCTTCGCGAG AATCCTTCTG ATCCGAATCT TGATCGAGCA CTAGACTGCC TCGCAAGTAT TCTTATCCTA CACATAGGCT TTCAATCTGA TGGTCGCTGG TTCAATACAG TACTGCTTTT CCAGTATGCC AATAATCAAC TCTCAAGCTA CTTGGGACGA CAGCTGAACG ATGATGAGGT CAGATATCTA GCTGCTGCGG TAATCAATAC ACTCGACAGT GAGGAGAATG ATCTACTTTT GGATCGCTAT GTTGCTGACC TACCAATAGA GAGCCTCCGC TTCGATTTGA ACGATGGAAA GTCTTTGCAT TTTGACCTAA ACGGACTTCC CGATATTGTT TCAGACAAGA AAGTAGGGAT AAACGATTTT CAGAAAACTA AAGCAGCTCT CTTCTGGGCG GATCCTTCTC GCGGAATCTA TATGCGCGCG GCGAGATCGC GAGGAATTCG CCTAATTCAA GAATTGATCA AAGTAGATCA GAGGCTTGCT CAATCAGGCG AGAGCAATTT ACTCAACGAA GTTTTGGATT CCATTCAAAA CAACGAAAAT AATCCGGGCA TTTTTTCGGT TGCAGAAGAC GCATTTACAG ATTTTCCGGG TCTGCTGCGC GAATGGCAAG ATCTTGAGGC CGAGCTTTTT GACGCAACCA ACAAAGCCGT TGAAGCGGTC GACTTGGACA CAGCGCCCCC GATCTGGCTA GCTTCGCAGC ATGAGATACG TGGGTCGGTG CCAGCTGGGT CCGACGACGA AAGCGGAAGC GATGAAGTAG ATGAGGTTGA GGCAGAAACT GCCGCAATTG AAGGCTCAAC TACTGCTACT GACGAAGTTT CCATAGCTAG CGGTACTCAA GAAAAGAAAC CAAGCAAGAA AATTGAATTG CCGACGAAAG AAGCTCTTCG AGAATTTATC GAGAGCGAGC TGGACGAAAT TTCGGAAGCT CCCCCAGTGA CTGCTGCACC TACAGAAGTC TCAGGAAAAA AAGGAAAGCC TCGAGCGACA CAAACAGACT TTGCTGCGAA AGAAGCGCGA AACCGCAAGC TCGGTGAAGC AGGCGAGTAC TTTGTATTCC AATATGAAGT TATGAAACTC ACCGCCGCGG GCAGAGTGGA TCTGGCCAAG CGCGTCAAGT GGGTGTCAAA GGATATAGGC GATGGCCTTG GCTATGATAT TCGATCTTTC GATCAAGATG GCAATGAGGT TTTTCTTGAG GTGAAGACAA CGAATAGCGG AAGAGCAACA CCATTTTTTG TATCTAACAA CGAAGTTGCT GTTTCGGAAG AAAAAGGAGA CTCCTACCGT CTAGTAAGAG TGTTTAATTT TTCGAAGAAA CCGAGGTTCT TTTCGTTAAC AGGAAGCTTG TCCGAAGTGC TTCAGCTCGA AGCAACGTCA TATCGAGCTC GAGTGGTATA G
|
Protein sequence | MAVNSQFLFF YIDELVKHHW VDWLEGPLAK VPSEEHRGER GFQLAVGDTG AGPRTRAGEA YQLGILSGRI VRPRIAGASF PSGWQAVIEG RVPSTKLGYY SYVIGRALRE NPSDPNLDRA LDCLASILIL HIGFQSDGRW FNTVLLFQYA NNQLSSYLGR QLNDDEVRYL AAAVINTLDS EENDLLLDRY VADLPIESLR FDLNDGKSLH FDLNGLPDIV SDKKVGINDF QKTKAALFWA DPSRGIYMRA ARSRGIRLIQ ELIKVDQRLA QSGESNLLNE VLDSIQNNEN NPGIFSVAED AFTDFPGLLR EWQDLEAELF DATNKAVEAV DLDTAPPIWL ASQHEIRGSV PAGSDDESGS DEVDEVEAET AAIEGSTTAT DEVSIASGTQ EKKPSKKIEL PTKEALREFI ESELDEISEA PPVTAAPTEV SGKKGKPRAT QTDFAAKEAR NRKLGEAGEY FVFQYEVMKL TAAGRVDLAK RVKWVSKDIG DGLGYDIRSF DQDGNEVFLE VKTTNSGRAT PFFVSNNEVA VSEEKGDSYR LVRVFNFSKK PRFFSLTGSL SEVLQLEATS YRARVV
|
| |