Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1956 |
Symbol | |
ID | 8603283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2311763 |
End bp | 2314930 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | putative type II DNA modification enzyme |
Protein accession | YP_003299561 |
Protein GI | 269126191 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.129503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGGC CCCACGGACC GGAGGACCCC CAGGACCCTC CCTGGCACCG GCTGCGCGGC GGCGCCCGCA AGGCCGTCGA GACACTGGGC ACCGGGCTGC TGCGGCACCC GGCCAACGCC GGCCTCCTGG ACCGCCACAG CCCGCAGGAC CTTCACCGGG CACTGCTGTG GACCGTCCAG CGGCTGCTGG TGCTGTTCCT GGCCGAAGAC CGCGGCATGC TGCTCGACCC GGACGCACCC GAGCAGGCCC GGGAACGCTA CCTTGCGCAC CACTCCACAG CCGTGCTGCG CCGCCGGGCC GCCGAAGCGG GCGGCGGGCA CGGCCGGCTG TGGCGGTCGC TGCGCACCGT CATCGACGGC CTCGGCGGCC ACGACGGACT GCCGGAGCTG GCACTGCCCG CCCTCGGCGG CATCTTCGCC CGCACCGACG CCGACCGGGT CCTGGAGGAC ACCGAACTGG ACGACGCCGA CCTGTGCGCG GCGGTCCGGG CGCTGTCCCA GATCCGGCAC GGACGCCCCG CCCGCCCCCG GCCCATCGAC TTTTGCCGCC TGGACGTCGC GGAACTGGGC GCGCTGCACG AGTCCCTGCT GAAGCACCGG CCGTGCCGGC AGAGTGCGGC CCGGCGGCGG GCGTTCGTCC TGCTGCCGGC CGGCACCGGA CGCAAAAGCG GCGGCTCCTA CTACACGCCC CCCGCCCTGG TCGACTGCCT GCTGGACTCG GCGCTGGACC CGCTGCTGGA CGAGGCCGTC AAAACCGGCC GCACCCGCCG CGAACAGGAA CGGGCCCTGC TGGCGCTGAC CGTCTGCGAC CCGGCCTGCG GCAGCGGCCG CTTCCTGGTC GCCGCCGCGG GCCGGATCGC CCGCCGCCTG GCCTTCGTCC GCACCGGCGA CCCGCAGCCG CCCCCCGCGG CGCTGCGGCG GGCCCTGCGC GAGACCCTCA CCACCTGCGT GTACGGCGTG GACCTCGACC CGCTGGCGGT CGAGCTGGCC AAGGTCGCGC TCTGGCTGGA GACCGGCGAG CCCGGCCGGC CGCTGCGCTC GTGGGACGAG CGCATCAAAG TCGGCAACGC CCTGCTCGGC GCCACCCCCG CCCTGCTGGC CGGCGGCATC CCCGACGCGG CCTTCCGGCC GCTGGAGGAG GACGACCGCG CCCTGACCGC CGCCCTGCGC GCCCGCAACA GGGCAGAACG CCGCAACCAC CCGCCCCTGC GCGCCTGGAC CAAGACGCAC GCCGACGCCT GGTGCGCGGC CTTCGTCTGG CCCAAGACCC CCACCGCCCC GCCGGCCATC ACCACCGCGA CCCTGCACGG CGACCTGCGG CGCCTCGCCC CGCGCACCCG CGCCGAACTG GAGGAGATCA CCGCCCGGTA CCGCTTCTTC CACTGGCCCC TGGAATTCCC GCAGATCTTC ACCGGCGAGG CGGGCGCCGG CGGTTTCGCC TGCGTGCTGG GCAACCCGCC CTGGGAGCGC GTCAAACTGC ACGAACGGGA GTTCTTCGCC GCCCGCGACG AGCAGATCGC CGCCGCCCCC GACGCCGCCG CCCGCCGCCG CCTGATCGCC GCCCTGCCGG AGCGGAACCC CGCGCTGCAC GCCGCCTTCA CCCGCGCCCG CCGGCAGGCC GAGGGAACCG CCCACTTCCT GCGCGCCTGC GGACGCTACC CGCTGACCGG ACACGGCGAC CTCAACACCT ACGCCGTCTT CGCCGAAGCG GGCCGCTCGC TGCTGAACCC CCGCGGCAGG ATGGGCCTCA TCGTCCCCAC CGGCATCGCC ACCGACGCCA CCACCCGCCG CTTCTTCCGG GACGTGGTGG AAAGCGGCTC GCTGGTCTCG CTGCTGGACT TCGAAAACCG CCGCCGCCTG TTCCGCGACG TCGATAGCCG CTTCCGCTTC ACCCTGCTGA CCCTGGCCGG CCCAGACCGC CGCGAACCGG CCGCGCAGTT CGCCTTCTTC CTGCACGACC CCGCCCAGGC CCAGGACCCG CGGCGGCGGT TTTCCCTCAC CCCCGGCCAG ATCGCCCTGC TCAACCCCAA CACCGGCACC TGCCCGTCCT TTCGCGGACG CCGGGACGCG CAACTGGTGC TGGAGATCTA CCGGCGCGTC CCGGTGCTGC GCGGCCCAGG CTGCGACCCG TGGGGGCTGT CGTTCCGGCG GATGTTCGAC ATGTCCAACG ACGCGCACCT GTTCTGGACC CGCGACCGCC TGGAGGACCC CGCCCAAGAA GGCGGCCCCT GGCGGCGGGA GGGCAACTGC TATGTGCGCG GCGAAGAGGT CATGGTGCCC CTCTACCAAG GGGTGATGGC CGACCTCTAC CAGCACCGCG CCGCGGATGT GGCCTGCAGC GACACCGCCG CCAAACGCAG GCACCGGCCC GTCCGCCTGA CCGAGAAGGA ATGGGCCGAC CCCGCACGGT TCGCCCAGCC CGCCTACTGG GTGCACGCCG CCGACCTCCC CGGCGACCTG CCGCCGTGGC TGCTGGGCTT CTCCAACGTC ACCAGCCCCA CCAACGAGCG GACCTTCCTG GCCTGCGCGC TGCCCCCCGT CGCCGTGGGC AACGCCGTCC CCCTCATCGG CACCGCCCGC CACCGGCTCC GCCCCGCCCT GCTGGCCTGC CTGTCGTCGC TGGTGTTCGA CTACTGCGCC CGCCAGAAGA TCGGCGGCAC CAACCTCAAC TACTTCTACG TCGAGCAGTT CCCGGTGCCG TCGCCGCAGC GGTTCTGCGA GCCGTTCGCC GCCGACCCCG CCACCGCCTT CCACGTCTGG GCGGAACGCC GGGTGCTGGA ACTGGTCTAC ACGGCCTGGG ACATGGCCCC CTTCGCCCGC GACCTGGGCG ATGACGGGCC GCCGTTCCGC TGGGACGCCG CCCGGCGCCG CCTGCTGCGG GCCGAGCTGG ACGCGGCGTT CCTGCACCTG TACGGCATCG GCCGCGACGA CGCCGAGCAC ATCCTGGCGT CCTTCCCCAT CGTGAAACGC AAGGACGAGG CCGCCTGCGG CTCCTACCGC ACCAGGGACC TGGTCATGGC GGCCTACGAC GCCATGGCCG CGGCCGCCGC CACCGGCCGC CCCTACCGGA CGGTCCTGGA CCCGCCGCCG GGACTCGGCC CCAGGCACCC TGCGCCCGCT GAAGATCCAC CGCCGTGCCG CTTGCCGCTA CGGTGGCGTC GTGAGTGA
|
Protein sequence | MSRPHGPEDP QDPPWHRLRG GARKAVETLG TGLLRHPANA GLLDRHSPQD LHRALLWTVQ RLLVLFLAED RGMLLDPDAP EQARERYLAH HSTAVLRRRA AEAGGGHGRL WRSLRTVIDG LGGHDGLPEL ALPALGGIFA RTDADRVLED TELDDADLCA AVRALSQIRH GRPARPRPID FCRLDVAELG ALHESLLKHR PCRQSAARRR AFVLLPAGTG RKSGGSYYTP PALVDCLLDS ALDPLLDEAV KTGRTRREQE RALLALTVCD PACGSGRFLV AAAGRIARRL AFVRTGDPQP PPAALRRALR ETLTTCVYGV DLDPLAVELA KVALWLETGE PGRPLRSWDE RIKVGNALLG ATPALLAGGI PDAAFRPLEE DDRALTAALR ARNRAERRNH PPLRAWTKTH ADAWCAAFVW PKTPTAPPAI TTATLHGDLR RLAPRTRAEL EEITARYRFF HWPLEFPQIF TGEAGAGGFA CVLGNPPWER VKLHEREFFA ARDEQIAAAP DAAARRRLIA ALPERNPALH AAFTRARRQA EGTAHFLRAC GRYPLTGHGD LNTYAVFAEA GRSLLNPRGR MGLIVPTGIA TDATTRRFFR DVVESGSLVS LLDFENRRRL FRDVDSRFRF TLLTLAGPDR REPAAQFAFF LHDPAQAQDP RRRFSLTPGQ IALLNPNTGT CPSFRGRRDA QLVLEIYRRV PVLRGPGCDP WGLSFRRMFD MSNDAHLFWT RDRLEDPAQE GGPWRREGNC YVRGEEVMVP LYQGVMADLY QHRAADVACS DTAAKRRHRP VRLTEKEWAD PARFAQPAYW VHAADLPGDL PPWLLGFSNV TSPTNERTFL ACALPPVAVG NAVPLIGTAR HRLRPALLAC LSSLVFDYCA RQKIGGTNLN YFYVEQFPVP SPQRFCEPFA ADPATAFHVW AERRVLELVY TAWDMAPFAR DLGDDGPPFR WDAARRRLLR AELDAAFLHL YGIGRDDAEH ILASFPIVKR KDEAACGSYR TRDLVMAAYD AMAAAAATGR PYRTVLDPPP GLGPRHPAPA EDPPPCRLPL RWRRE
|
| |