Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4007 |
Symbol | |
ID | 5714536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009957 |
Strand | - |
Start bp | 71315 |
End bp | 73756 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641276919 |
Product | type III restriction protein res subunit |
Protein accession | YP_001542215 |
Protein GI | 159046545 |
COG category | [S] Function unknown |
COG ID | [COG4951] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.527137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGACA GAAGCGACAT TGAGGCGGAG TTGGCGCGGG TTCGAGCCCG TCTGGCCGAT CTGGATGTCG AACGAGCGCA GCTTCAGCGT GAAGTTGCAG CGCTTGAGGC TCGCCTTGCT GCAGAGCACG TGCCGGCAGT GAGACAACCC TCATTCGAGA ACGCCCCGGT TACGAATGCT TCACCGTCGC ACCAAAAGGT CGAGCTGTTC CGCCGCCTGT TCGCCGGTCG GCCTGACGTG TTCCCAGTGC GGTGGGAAAA CAGAAAAACT GACCGCTCGG GGTATTCGCC CTCCTGCGCC AACGAGTGGG TGAAGGGGAT TTGTGGCAAG CCAAAGGTCA AATGCGGCCA GTGTCCCCAT CAGAAGTTTA TCCCGCCGGA TGAAGGTGTC ATTGAGAGAC ACCTGCGCGG CGACGATGGT CGGGGCGGGG ATTTTGTCGC CGGCGTCTAT CCGCTCCTGC TCGGTGACAC ATGCTGGTTC CTGGCGGCGG ATTTTGACAA GGCATCCTGG GCGGAGGACG CCAACGCGCT GCTCGAAACC TGCCGAGTGA AGGGAGTGCC CGCAGCGTTG GAACGGTCGA GGTCGGGCAA CGGGGGGCAT GTCTGGATAT TCTTCTCCGA ACCGGTCTCG GCCCGTCTGG CGCGCCAGCT TGGATCGGTC CTGATCACGG AAACGATGGA ACGGCGGCCA GAAATCGGCT TTGCTTCCTA TGACCGGTTG TTTCCAAATC AGGATATCAT GCCGCTCGGC GGCTTCGGCA ACCTGATCGC ACTGCCGCTT CAGAACACAG CGCGCAAAGC TGAAAACAGC GTCTTTGTCG ATGCTAGCCT GCGGCCATAT GACGATCAGT GGGCCTATTT GTCTTCCTTG CCGCGATTGT CGGCGGCAGC AGTGACCCAG CTCGTCGAAG CCGCCGAGCT TTCCGGGCGA GTGCTGGGTG TGCGCATGCC GGTAGAAGAT GAGCAGGCGG ACGAACCGTG GAAAATGCCT CCGTCACGCC GCAGTACGCC GCGACGCCTC GATGTACCTG TTCCGACAAC CATCAAGGTG ACGGTCGCAG ACCAAATCTA TATCGACCGT TCGGACTTAC CTTCGGCCAT GATTGCGCAA TTGGTGCGGT TGGCGGCGTT CCAGAATCCC GAATTCTATC GCGCGCAGGC CATGCGACTG CCAACATTCG GCAAGCCACG CATCGTGTCC TGCGCCGAAC TGCATCCCCG ACACGTTGCC CTGCCCCGCG GTAGCTTCGA CGAAGCGATC AGATTCCTGT CTGATCACGG TGCGACAGCC GATCTGGACG ATTTGCGTGT AGACGGAGCT CGTTTGCCGG AGACGGTCTG CTTCGATGGC CAACTTCGCC AGCAGCAATC ACGAGCGTTT GACGCATTGG CCGAACACGA TACCGGCGTG CTTGCCGCCA CGACTGCATT CGGCAAGACA GTGGTAGCCT CGGCACTGAT TGCGCACCGC GCTCGCAATA CGCTGGTCTT GGTTCACCGC CGGGAATTGC TGAACCAATG GGTCGAACGG CTTGGCTCAT TTTTGCAGAT CGATCCCAAG CTGATAGGCA CCATCGGCGC CGGAAAACGC AAACTCACCG GTGTGATCGA TGTAGCGTTG ATTCAGAGTC TGGTTCGGAA GGGCGAAGTT GACGATATCG TTGCCGATTA TGGCCATCTT GTCGTCGATG AATGCCATCA CCTGTCCGCT GCGAGCTTTG AGCTTGTCGC CCGCAGATCG AAAGCGCGCT ATGTCGCCGG GTTGTCGGCG ACGGTCGCTC GAAAGGACGG ACATCATCCG ATCATCTTCA TGCAATGTGG CCCGGTGCGC CATCAGGTGA GCGCCAAATC GCAGGCAGCC GAAAGCGGAC TGCGCCATCG AGCGCGGGAA CGTCACACGA GATTCCGGCT GCCAGAACCC CTCGCCATGG CCGAGCGCCC GTCAATGCCC GCGATCTATG CCGCTCTGGC GGAGGACGAG GCCCGAAACG ATCTGATCTT CGACGACGTG CTGAAATCAT TGGAGGCCAA ACGCTCACCG ATCATACTAA CCGAGCGGAA GGATCACCTC GAGCACCTTC ATCAGCGGTT CTCCCGATTT GCGAAGAACA TCGTCGTGCT CCGTGGCGGC ATGTCCGCAA AGGACCGGAA GGCCGCGCAT GCGGCGCTGA ATGTGGATGA CGATGAGGAA CGGCTGATCC TCGCGACAGG GCGCTATATC GGCGAAGGCT TCGATGACGC GCGGCTCGAC ACCCTGTTCC TGACGATGCC GATCGCATGG AAGGGAACGC TGGCGCAATA TGTCGGCAGG TTGCACCGCC GACATGACGA CAAGAAGGAC GTGTTGGTGG TCGACTATGT AGACAGTTCG GTCCCGGTCC TCGCCAGAAT GGCGGCCAAA AGAAGAACCG GTTACCGGGC TCTCGGCTAT GTGATGGAAT AG
|
Protein sequence | MADRSDIEAE LARVRARLAD LDVERAQLQR EVAALEARLA AEHVPAVRQP SFENAPVTNA SPSHQKVELF RRLFAGRPDV FPVRWENRKT DRSGYSPSCA NEWVKGICGK PKVKCGQCPH QKFIPPDEGV IERHLRGDDG RGGDFVAGVY PLLLGDTCWF LAADFDKASW AEDANALLET CRVKGVPAAL ERSRSGNGGH VWIFFSEPVS ARLARQLGSV LITETMERRP EIGFASYDRL FPNQDIMPLG GFGNLIALPL QNTARKAENS VFVDASLRPY DDQWAYLSSL PRLSAAAVTQ LVEAAELSGR VLGVRMPVED EQADEPWKMP PSRRSTPRRL DVPVPTTIKV TVADQIYIDR SDLPSAMIAQ LVRLAAFQNP EFYRAQAMRL PTFGKPRIVS CAELHPRHVA LPRGSFDEAI RFLSDHGATA DLDDLRVDGA RLPETVCFDG QLRQQQSRAF DALAEHDTGV LAATTAFGKT VVASALIAHR ARNTLVLVHR RELLNQWVER LGSFLQIDPK LIGTIGAGKR KLTGVIDVAL IQSLVRKGEV DDIVADYGHL VVDECHHLSA ASFELVARRS KARYVAGLSA TVARKDGHHP IIFMQCGPVR HQVSAKSQAA ESGLRHRARE RHTRFRLPEP LAMAERPSMP AIYAALAEDE ARNDLIFDDV LKSLEAKRSP IILTERKDHL EHLHQRFSRF AKNIVVLRGG MSAKDRKAAH AALNVDDDEE RLILATGRYI GEGFDDARLD TLFLTMPIAW KGTLAQYVGR LHRRHDDKKD VLVVDYVDSS VPVLARMAAK RRTGYRALGY VME
|
| |