Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0992 |
Symbol | |
ID | 5710507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1017620 |
End bp | 1020436 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641266902 |
Product | type III restriction protein res subunit |
Protein accession | YP_001532335 |
Protein GI | 159043541 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.942343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCGTA AGGCCGGATT GATCCCCGCC CCTCGGCATG TGCCGGCGTG GGATGACCTG ACCGGACCGG AACAGATGGC CATGTTGACC GAGGTGGGTG CGGCACTTGG CCGAGGCACA GCCGGTTTCA TCGCCTCATC CGATCACGAT CATTTCCACC TGCGGGAAGC CAGTGGCGCT CGCCTGACAA CCGGCGGTCG CGATCCGCTC CTGCCGTTGC TCGCCGAACG GCTGGACACG GCGCAATCAG TCGACCTTGC GATTGCTTTC GCAATGGACA GCGGCGTGGC CTTGCTTGAG CCCTGGTTCC GGGAGCTGCT AGCGCGTGGT GGACGGCTCA GAATCGTGGT GGGCGACTAT CTCGACACGA CCGACCCGAC GGCGTTGGCG CGCCTGCTGG ACCTGGAAGG CGCCGAGCTC TTCGTGTTCG AAACGGGAGG CCTCAGCTTT CACCCGAAGG CCTGGCTGTT CCGTGCCGCC GATGCGCGCG GGGCGGCGAT TGTCGGCAGC TCGAACCTGT CTCAATCGGC GCTGACCGAT GGCGTGGAAT GGAACCTGCA TTCCGAGGAT GCAGCAGACA GCGTCGGAGC GGCCTTCAAG GACCTGCTGG CGCGACCTGA GGTTAAACCT TTGACGGCCG CATGGGTCGA TGCCTACGCG AAACGCCGCA GGGCGCGGCC GATGCCCGAG TTCACCGCCC GCGTCGTCGC CGAAGAAGGC CCGCCGCCAG AGCCGCACAC GATCCAGCAG GAGGCGCTGG CGGCGCTTGC ACAAAGCCGA TCAAAGGGCC ACCGCGCCGG GCTTGTCGTG CTGGCCACCG GGCTGGGCAA GACCTGGCTG GCCGCCTTTG ACACGATCCG TGCGATGGCG GGCCGCATCC TTTTCGTCGC CCATCGCGAC GAAATTCTGA CCCAAGCGAT GGCGGCCTTT CGCAAGGTCC GCCCAGACGC CAAGCTCGGC CGCTACACGG GTGCCGAGAA AGAAGCGGAC GCGGAAATCC TCTTCGCCTC GGTCCAGACG CTGGGCCGGA TCGGTCACCT GCGCCAATTC GACCGGGATC ATTTCGACTA CATCGTGGTC GATGAATTCC ACCATGCCGC CGCCCGCACC TATCAGAAGC TGATCGAGCA TTTCACGCCC GGCTTCCTGT TGGGCCTCAC CGCAACGCCA GACCGGACGG ATGGCGCCGA CTTGCTTGGC CTTTGCAGCG AAAACCTCGT CTACGAATGC GACCTGTTTC GCGGGATCGA CGCTGGGCAC CTGTCGCCCT TCCACTACTT CGGTGTGCCG GATGACGTGG ATTACGCACA GATCCCCTGG CGCTCCGGCC AGTTCGATCC GACCGCACTA GACGCAGCTC TAGCCACCGA GGCGCGAGCG CTTAACGCGC TGGACCAATA CCGCAAACGT GCAAGCGGCC CCGCGATCGG CTTTTGCTGT TCCGTGCGCC ACGCCGACTA CATGGCCGAG TACTTTCGCA CCGCCGGGTT GAACGCAATC GCGGTCCATT CGGGCCAAAG CTCCGCGCCG CGGGCAAGTT CCCTGGCAGC GCTCGGTCGC GGCGAGATCG ACATCCTCTT CGCGGTCGAC ATGTTCAACG AAGGCGTGGA CGTGCCCGAG ATCGGCACCG TCCTGATGCT ACGCCCGACC GAAAGCGCGA TTATCTGGCT GCAGCAACTC GGCCGCGGCC TGCGCCGCGT CGAAGGCAAG GTGCTGCAGG TGATCGACTA TATCGGCAAC CACCGCAGCT TCCTGACTAA AGTCGCGACC CTTCTGCGGG CCGGCGCAGG GGATCGCTCG ATCTCGACCA AACTGGACGC GCTCCAGGCC GGTGAATTCC AGATGCCCAA GGGCTGCGAG ATCACCTATG AGCTGCAGGT CATCGACATC TTGCGCAATC TCCTCCGCCC GAAGGAAGGC GCGGCGGAGC TCGAAGCCTT CTTTGTCGAC TTCCGGGACC GGACCGGCAT CCGCCCCACG GCTGTCGAAG TCTTCCGCAA CGGCTTCGAT CCCCGCACAT CGGGCCATGG CGGCTGGTTC GACTTCGTCC GGGACATGGG CGACGCCTTG CCCGAACGTC CTTTCGCAAC TCACGGTCGT CTCTTGGGTG AATTGGAACG GCCCAGGGGT TTGTCCCGAT CCGCCTTGAC CGGCTTGCGC GACGTTGTAG CCGGGCGGCA ACCGAGCGAT GGCGCGTCAG AGCTGGCCTT CAACCCGCAC CTGGCCCAAA GCAGCGACGG CTTTCGGCTG GCGCGACCTG ACGCTTCCGG AGAGCTGGAA CCGCTTGTGC GCGAGTTGGT CGAATGGCGC TTGGCGGAAA CGCCAATCGT CCGCGAGGCC GCCGAGGCGT CCGCGACCTT TGTCGGCAAG CCACAGGCTC TGGAGCTTTG GCAGCGCTAC TACGTGCCGG ACATCGCCGA GTTCTTTGGC GAAGAGTACA AGCAAAACGT CTGGCGCACC GGTATGAAGG CGCTGCCCGA CCGGAAGATA ATGATCCTTC TGGCAAATGT GTCGACCCAG GACCTGACCT ACGAGAACGC GTTTCTCAGC CCGTCACGGC TGAAATGGTT CAGCCAGAAC CAGACGCGCC AGGACAGCAA GCATGGCCGG GTCATCAGCG GTGAGGAAGG CCACGAGGTC CACATGTTCA TCCGCCGCGG GAACAAGGTG AACGGCAAGG TCAACCCATT CATCTACTGC GGGCAACCCA GGTTCGCCGG ATGGCAGGGT GAGAAGCCAA TTGAAGTACA ATGGGAATTG GCTTCACCAG CACCGCGCGA GCTTTGGGCG GAGCTCGGAA TTCCAAATCA AGACTGA
|
Protein sequence | MHRKAGLIPA PRHVPAWDDL TGPEQMAMLT EVGAALGRGT AGFIASSDHD HFHLREASGA RLTTGGRDPL LPLLAERLDT AQSVDLAIAF AMDSGVALLE PWFRELLARG GRLRIVVGDY LDTTDPTALA RLLDLEGAEL FVFETGGLSF HPKAWLFRAA DARGAAIVGS SNLSQSALTD GVEWNLHSED AADSVGAAFK DLLARPEVKP LTAAWVDAYA KRRRARPMPE FTARVVAEEG PPPEPHTIQQ EALAALAQSR SKGHRAGLVV LATGLGKTWL AAFDTIRAMA GRILFVAHRD EILTQAMAAF RKVRPDAKLG RYTGAEKEAD AEILFASVQT LGRIGHLRQF DRDHFDYIVV DEFHHAAART YQKLIEHFTP GFLLGLTATP DRTDGADLLG LCSENLVYEC DLFRGIDAGH LSPFHYFGVP DDVDYAQIPW RSGQFDPTAL DAALATEARA LNALDQYRKR ASGPAIGFCC SVRHADYMAE YFRTAGLNAI AVHSGQSSAP RASSLAALGR GEIDILFAVD MFNEGVDVPE IGTVLMLRPT ESAIIWLQQL GRGLRRVEGK VLQVIDYIGN HRSFLTKVAT LLRAGAGDRS ISTKLDALQA GEFQMPKGCE ITYELQVIDI LRNLLRPKEG AAELEAFFVD FRDRTGIRPT AVEVFRNGFD PRTSGHGGWF DFVRDMGDAL PERPFATHGR LLGELERPRG LSRSALTGLR DVVAGRQPSD GASELAFNPH LAQSSDGFRL ARPDASGELE PLVRELVEWR LAETPIVREA AEASATFVGK PQALELWQRY YVPDIAEFFG EEYKQNVWRT GMKALPDRKI MILLANVSTQ DLTYENAFLS PSRLKWFSQN QTRQDSKHGR VISGEEGHEV HMFIRRGNKV NGKVNPFIYC GQPRFAGWQG EKPIEVQWEL ASPAPRELWA ELGIPNQD
|
| |