Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3417 |
Symbol | |
ID | 5712475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3594961 |
End bp | 3596388 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641269346 |
Product | hypothetical protein |
Protein accession | YP_001534751 |
Protein GI | 159045957 |
COG category | [S] Function unknown |
COG ID | [COG4223] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAGATC CGAAGACCCC AAAAGACACC CCGGAAAATC AGGTTTCGGA CACAGATCAA CCTCAGGAGA GCCCGAAGCC CGACACGGAC ACCCCCGAGA CAGACCAGAT TGAAGACGCC GAAATCCTGG ACGAACAGCC CGGCGAAGTG GACACCGCCA AAGACAGGAA TAGCGAGACT GCGCAGGACC CGGTCGAGCC TGTCGGCGAG ACCAGCGCCA GCGACGAGAC CACGTCCGAG GCTGCGCGCA ATGACACGGA CGCGGACACC GCGGCAACCA TGGAAGAGAC ACGAGCCGAC GAGATAGATC GGAGCCATGA GACCCCGTCC TCAGACCCCG AACCCGCCTC CGAGCCGCAA GCAGCCGAAC CGGTGGAGCG AGTCGTCGAA AAGAAGGGGG GCTTCATGGG CCCATTCCTC GGCGGGGTCG TCGCGGCCGG CATAGGGTTC GGTCTGTGCT ATTACCTCGT GGATCAGGGC ATTCTCGCGT CCGGTGACCC AGACCCCTTC GCAGCGGAAC GCGCGCAAAT CAGCAACCTC GAAAACCAGA TCGCGGCGAT GCAGAGCGAG ATCGCAGCGG CAATAGAGGC TGGCAGCGAG GATCCTCGGA TGGACGCAGT GGTCGGGTCG GTTGAATCAG TCGAAACAGC CCTGGCGGAG ATTCAGGGCG AAGTCGGCGC GGTTCAGTCC GAGATTTCGG CGCAGTCGGA CATTCTCGCA AATCTCGAAT CGCAAATGGA GGCGATCGCT GCCCTGCCCG AAGGCACCGG CTCTGCCGAC ACCGCTGCCA TGGCGGCGTT GCAGGCGACT TTGGCGCAAC AGCAAGCCGA GAACGAGGCA ATGCAGGCCC AGCTGGCCGA GATGGCCGCC GCCGCAGAGG CGGAAATGGA GCAGGTCCGC GCCCAAGCGG GTGCCCTGCA AAACGAAACG CAAGCGGCAG TCGATGCAGC CACCAACCGC GCGGCGCTGG CGAATATCGC CGCCGCACTG GAAAACGGTG CACCCCTTGC CGCTTCGCTG GACAACCTCA CGGTGGAAGC GCCCGAGGCG CTCTCGGCAG TATCAGCTTC CGGGGTCGAA ACGCTTCTGG ACCTGCAACG GCAATTCCCG GCAGCGGCGC GGGCTGGCTT GGCGGAGTCG CTGAAAGCCA CGGTCAGTGA CGATCCGGTT GACCGTGCCG TAGCGTTCCT TCGGGCACAG GTCGGTGCTC GGTCGCTTGA GCCGCGGGAG GGCGATGACC CGGATGCCGT CCTGTCACGG GCACAAGAAG CCGTCAGCGC CGGGCAGCTC GAAGCCGCGC TTGCCGAGAT CTCCACACTT CCCGACGCCG GTCAGGCCGC AATGGCGCCT TGGATCGGGG CCGCCGAGGC CCGTGTCGCC GCGCTCGCCG CGTTCGACAC GCTGGCCGCT GAACTGAACT CCAACTGA
|
Protein sequence | MSDPKTPKDT PENQVSDTDQ PQESPKPDTD TPETDQIEDA EILDEQPGEV DTAKDRNSET AQDPVEPVGE TSASDETTSE AARNDTDADT AATMEETRAD EIDRSHETPS SDPEPASEPQ AAEPVERVVE KKGGFMGPFL GGVVAAGIGF GLCYYLVDQG ILASGDPDPF AAERAQISNL ENQIAAMQSE IAAAIEAGSE DPRMDAVVGS VESVETALAE IQGEVGAVQS EISAQSDILA NLESQMEAIA ALPEGTGSAD TAAMAALQAT LAQQQAENEA MQAQLAEMAA AAEAEMEQVR AQAGALQNET QAAVDAATNR AALANIAAAL ENGAPLAASL DNLTVEAPEA LSAVSASGVE TLLDLQRQFP AAARAGLAES LKATVSDDPV DRAVAFLRAQ VGARSLEPRE GDDPDAVLSR AQEAVSAGQL EAALAEISTL PDAGQAAMAP WIGAAEARVA ALAAFDTLAA ELNSN
|
| |