Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1552 |
Symbol | |
ID | 5712696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1614497 |
End bp | 1615753 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641267467 |
Product | hypothetical protein |
Protein accession | YP_001532895 |
Protein GI | 159044101 |
COG category | [S] Function unknown |
COG ID | [COG3395] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.531117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGG TACTGGGCTG TATTGCGGAT GATTTCACGG GCGCGACCGA TCTGGCCGGG TTGCTCGCAC GCAGCGGTGC CGCGGTGCGG CTGCGCATGG GCGTGCCCGA CGGCCCGCCG CAGGACACCG CCGGGATCGA GGTGATCGCG CTGAAGATCC GCACGGCGCC GGTGGCCGAG GCGGTGGCGC AGGCCCGCGC GGCGCTGGCG TGGTTGCGGG CGGCGGGGGC GGAGCGGGTA TTCTGGAAAT ACTGCTCCAC CTTCGATTCG ACGCCCCAAG GCAATATCGG ACCCGTGGCC GAGGCGCTGA TGGCGGATCT GGGGGTGCAG CAGACGATTT ACTGCCCGGC CTTTCCAGAA AACGGGCGCG CGGTTTTCAT GGGCAACCTG TTTGTCGGGC GCGACCCGCT CGACGAGAGC CCGATGAAGG ATCATCCGCT GACGCCGATG CGCGATGCCA ACCTGATGCG GCTGCTCGCG CCGCAGGTCA CGCGGCCCGT GGGGCTGGTG GACCGGCTTT GCGTGGCGCG GGGGGCAGGG GCCTTGCGCG CGGAGCTGGG TCGGCTGGAC GCGGCGGGCG TGGCCCATGT GGTGGTGGAT GCGGTGGCGG ATGCGGATCT GGGCGTGATC GCCGAGGCGT GCCACGACAT GCCGCTGATG ACCGGGGGCA GCGCAGTGGC CGCGCCCTTG CCCGGGCTGC TGAGCGGTGG GGCGGCGGAG GCGCGGGCCG CCGCGCCGGA CCTGGCCCCT GGGGCGGTGG TGCTGTCGGG GAGTTGTTCG GCCATGACCC GCGCGCAGGT GTCGGCGTAT CTTCGGCGTG CGCCGGGCTA CAAGCTCGAC CCGCTGGTGC TGCGCACCGA GGGGGCAGGG GCCGCGCTTG CCTGGTTGGA GGCGCAGGCG CTCCAGGATG CGCCGTTGGT CTACGCCACC GCCGAGCCCG GCGAGGTCCG CGCAGCCCAG CAGGCCCTGG GCGTGGCCGA GGCGGGCGCG CTGGTCGAGG ACGCGCTGGC CCGGATCGCC GTGGCCGCGC GGGACCGGGG CGCGCGGCGC TTCGTGGTGG CGGGCGGCGA GACCTCGGGG GCGGTGACCC AGGCGCTGGG GGTTGTGCAG CTCGATGTGC GCCGCGAGAT CGCGCCGGGC GTGCCCTGGT GTTTCGCAGA GAGCGGCGGT GTGGACATCG CACTGACCCT GAAATCCGGC AATTTCGGCG CCGAGAGCTT CTTTGCCGAT GCCTTGGCGC TGGTGGACAC CCTATGA
|
Protein sequence | MATVLGCIAD DFTGATDLAG LLARSGAAVR LRMGVPDGPP QDTAGIEVIA LKIRTAPVAE AVAQARAALA WLRAAGAERV FWKYCSTFDS TPQGNIGPVA EALMADLGVQ QTIYCPAFPE NGRAVFMGNL FVGRDPLDES PMKDHPLTPM RDANLMRLLA PQVTRPVGLV DRLCVARGAG ALRAELGRLD AAGVAHVVVD AVADADLGVI AEACHDMPLM TGGSAVAAPL PGLLSGGAAE ARAAAPDLAP GAVVLSGSCS AMTRAQVSAY LRRAPGYKLD PLVLRTEGAG AALAWLEAQA LQDAPLVYAT AEPGEVRAAQ QALGVAEAGA LVEDALARIA VAARDRGARR FVVAGGETSG AVTQALGVVQ LDVRREIAPG VPWCFAESGG VDIALTLKSG NFGAESFFAD ALALVDTL
|
| |