Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2794 |
Symbol | |
ID | 5713694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2953140 |
End bp | 2954270 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641268720 |
Product | hypothetical protein |
Protein accession | YP_001534128 |
Protein GI | 159045334 |
COG category | [S] Function unknown |
COG ID | [COG4260] Putative virion core protein (lumpy skin disease virus) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.10969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.809001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATCT TCGATTTCCT CTCCGGTCAG TTCATCGACG TGATCCACTG GACCGACGAC ACCCGCGACA CGATGGTGTG GCGGTTCGAG CGGCAGGGCC ACGAGATCAA GTACGGCGCC AAGCTGACCG TGCGCGAGGG CCAGTCGGCG GTGTTCGTGC ACGAGGGGCA GCTGGCGGAT GTGTTCACGC CGGGGCTCTA CATGCTCGAG ACCAACAACA TGCCGATCAT GACCAGTCTG CAGCACTGGG ATCACGGCTT TCGCAGCCCG TTCAAGTCCG AGATCTATTT CGTCAACACG ACGCGGTTCA ACAACCTCAA ATGGGGCACC AAGAACCCGA TCATGCTGCG CGATCCGGAG TTCGGGCCGG TGCGGATCCG GGCCTTCGGG ACCTATTCGG TGCGCGTGGT GGACCCGGCG CGGTTCCTGT CGGAGATCGT GGGCACCGAT GGCGAGTTCA CCATGGATGA GATCAGCTTC CAGATCCGCA ACATCATCGT GCAGGAATTC AGCCGGGTGA TCGCGGGCGC GGGCATTCCG GTGCTGGACA TGGCCGCCAA CACCGCCGAG CTGGGCAAGG GGGTGGCCAC GGCGATCTCC GAGACCATCG CGGGCTACGG GCTGTCCCTG CCGGAGCTTT ACATCGAGAA CATCTCGCTA CCCCCGGCGG TGGAGACGGC GCTCGACAAG CGGACCTCGA TGGGTCTGGT GGGCGATCTG GGCAAGTTCA CGCAATACTC GGCCGCCGAG GCGATGACGG CCGCCGGCAA GGCCGGGGGC GACAGTGGCA TGGGCGCGGG CCTGGGGGCC GGCATGGGCA TGGCCATGGG CGCGCAGATG GCCCAGGCCG GGCCCTGGGG CGCGCGCCCT GCCCCGGCAC CTGCGGCGGC CACCCCGGTC GCGCCGCCGC CCCCGCCGGT GGAGCATGTC TGGCACATCG CCGAGGGCGG CCAGACCAAG GGCCCGTTTT CCAAGGCCGA TCTGGGGCGC ATGGCCGCCG AGGGCGGGCT GACCCGCCAG ACCCATGTCT GGACGCCCGG GCAGGACGGC TGGATGCGGG CAGAGGATGT CACCGAGCTG GCGCAGCTTT TCACCATTCT GCCGCCCCCG CCCCCGCCGC CGGGCGCGTA A
|
Protein sequence | MSIFDFLSGQ FIDVIHWTDD TRDTMVWRFE RQGHEIKYGA KLTVREGQSA VFVHEGQLAD VFTPGLYMLE TNNMPIMTSL QHWDHGFRSP FKSEIYFVNT TRFNNLKWGT KNPIMLRDPE FGPVRIRAFG TYSVRVVDPA RFLSEIVGTD GEFTMDEISF QIRNIIVQEF SRVIAGAGIP VLDMAANTAE LGKGVATAIS ETIAGYGLSL PELYIENISL PPAVETALDK RTSMGLVGDL GKFTQYSAAE AMTAAGKAGG DSGMGAGLGA GMGMAMGAQM AQAGPWGARP APAPAAATPV APPPPPVEHV WHIAEGGQTK GPFSKADLGR MAAEGGLTRQ THVWTPGQDG WMRAEDVTEL AQLFTILPPP PPPPGA
|
| |