Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1457 |
Symbol | |
ID | 5712634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1514658 |
End bp | 1516022 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641267370 |
Product | type III restriction protein res subunit |
Protein accession | YP_001532800 |
Protein GI | 159044006 |
COG category | [S] Function unknown |
COG ID | [COG3421] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0747485 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAGC CCGACATCTC TAACGACATC ACCGGCAACC TTGCGCCCCG GATCGAGTTG CGTCCCTATC AGCGCACGGC GCTGGAACGC TGGCTGTTCT ACATCGACAA GTACGACGGA CGGCCCAAGG CCCCGCACCT GCTGTTCCAC ATGGCCACGG GCAGCGGCAA GACGGTCCTG ATGGCGGCAC TGATCCTGGA CCTGTACCGG CGCGGCTACC GCAACTTCCT GTTCTTCGTG AACTCGGCCC AGATCATCGA GAAGACCAAG GAAAACTTCC TTAATGCCGC ATCGGCCAAG CACCTGTTCG CGCCCACGAT CCGCATTGAC GACAAGCCCG TGGACATCCG CGCGGTGGAC ACCTTCGACG CGGTGTCAGG CGATGCGATC AACATCCACT TTACCACCAT CCAAGGGCTG CACACGCGGA TGCAGGCCCC GAAAGAGAAC GCCGTCACCA TCGAGGATTT CCGCGACTAC AAGGTGGTGA TGATCTCGGA CGAAGCACAC CACCTGAACG CAGAGACCAA GAAGACGCTT ACGGAAGGCG AGAAGGCAGA GAAGGCCAGT TGGGAAGGCA CGGTATCCGA GATTTTCCGC CAGCACCCCG AGAACATGCT GTTGGAGCTT ACGGCAACCG TGGACCTGAG CCACGATGCG ATCCGCGCGA AGTACGCCGA CAAGATACTC TACGACTATT CCCTGCGCCA GATTCGCGAA GACGGCTATT CCAAGGACAT CGAGTTGCGG CAGGCCGATT TACCACCGGC GGAACGGATG ATGCAGGCAA TGGTTCTGAG CCAGTACCGC CGCAAGGTGG CCGAGGCGCA TGGGCTGCAT TGCAAGCCGG TGATCCTGAT CAAGTCCAAG ACGATCAAGG ACAGCGCCGA TAACGAGGCC GCGTTCACGG CGATGGTGGC CGGGCTGACG GGCGAGGCGC TGGACGCGCT CAGGGCGGCC TCTGAGGGCG ATGAGACCCT TTCTCGGGCC TTCACCTTCA TCATGGATGA AAGGGCCATG AGCGGTGCGG ATTTCGCCCG TGAGCTACAG GGCGATTTTG CCCCCGAGAA GGTGGTGAAC GTTAACAACC CCAAGGATTT GGAAAACAGA CAGATAGAAC TCAATGCCTT GGAGGACCGC GACAACGAGA TACGGGTGAT CTTTGCCGTC GATAAGCTGA ACGAGGGCTG GGACGCGCTG AACCTGTTCG ACATCGTGCG GCTGTACGAT ACGCGCGACG GCAAGGCCAA CAAAGTGGGC AAGACCACAA TGGCCGAGGC TCAGTTGATC GGACGCGGTG CCCGGTACTT CCCCTTCATG GCTCCGGACC AGCCCGACGC GGCGCGGGAA AAAGCGCAAG TATGA
|
Protein sequence | MLKPDISNDI TGNLAPRIEL RPYQRTALER WLFYIDKYDG RPKAPHLLFH MATGSGKTVL MAALILDLYR RGYRNFLFFV NSAQIIEKTK ENFLNAASAK HLFAPTIRID DKPVDIRAVD TFDAVSGDAI NIHFTTIQGL HTRMQAPKEN AVTIEDFRDY KVVMISDEAH HLNAETKKTL TEGEKAEKAS WEGTVSEIFR QHPENMLLEL TATVDLSHDA IRAKYADKIL YDYSLRQIRE DGYSKDIELR QADLPPAERM MQAMVLSQYR RKVAEAHGLH CKPVILIKSK TIKDSADNEA AFTAMVAGLT GEALDALRAA SEGDETLSRA FTFIMDERAM SGADFARELQ GDFAPEKVVN VNNPKDLENR QIELNALEDR DNEIRVIFAV DKLNEGWDAL NLFDIVRLYD TRDGKANKVG KTTMAEAQLI GRGARYFPFM APDQPDAARE KAQV
|
| |