Gene Tcur_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_1956 
Symbol 
ID8603283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp2311763 
End bp2314930 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content74% 
IMG OID 
Productputative type II DNA modification enzyme 
Protein accessionYP_003299561 
Protein GI269126191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.129503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGGC CCCACGGACC GGAGGACCCC CAGGACCCTC CCTGGCACCG GCTGCGCGGC 
GGCGCCCGCA AGGCCGTCGA GACACTGGGC ACCGGGCTGC TGCGGCACCC GGCCAACGCC
GGCCTCCTGG ACCGCCACAG CCCGCAGGAC CTTCACCGGG CACTGCTGTG GACCGTCCAG
CGGCTGCTGG TGCTGTTCCT GGCCGAAGAC CGCGGCATGC TGCTCGACCC GGACGCACCC
GAGCAGGCCC GGGAACGCTA CCTTGCGCAC CACTCCACAG CCGTGCTGCG CCGCCGGGCC
GCCGAAGCGG GCGGCGGGCA CGGCCGGCTG TGGCGGTCGC TGCGCACCGT CATCGACGGC
CTCGGCGGCC ACGACGGACT GCCGGAGCTG GCACTGCCCG CCCTCGGCGG CATCTTCGCC
CGCACCGACG CCGACCGGGT CCTGGAGGAC ACCGAACTGG ACGACGCCGA CCTGTGCGCG
GCGGTCCGGG CGCTGTCCCA GATCCGGCAC GGACGCCCCG CCCGCCCCCG GCCCATCGAC
TTTTGCCGCC TGGACGTCGC GGAACTGGGC GCGCTGCACG AGTCCCTGCT GAAGCACCGG
CCGTGCCGGC AGAGTGCGGC CCGGCGGCGG GCGTTCGTCC TGCTGCCGGC CGGCACCGGA
CGCAAAAGCG GCGGCTCCTA CTACACGCCC CCCGCCCTGG TCGACTGCCT GCTGGACTCG
GCGCTGGACC CGCTGCTGGA CGAGGCCGTC AAAACCGGCC GCACCCGCCG CGAACAGGAA
CGGGCCCTGC TGGCGCTGAC CGTCTGCGAC CCGGCCTGCG GCAGCGGCCG CTTCCTGGTC
GCCGCCGCGG GCCGGATCGC CCGCCGCCTG GCCTTCGTCC GCACCGGCGA CCCGCAGCCG
CCCCCCGCGG CGCTGCGGCG GGCCCTGCGC GAGACCCTCA CCACCTGCGT GTACGGCGTG
GACCTCGACC CGCTGGCGGT CGAGCTGGCC AAGGTCGCGC TCTGGCTGGA GACCGGCGAG
CCCGGCCGGC CGCTGCGCTC GTGGGACGAG CGCATCAAAG TCGGCAACGC CCTGCTCGGC
GCCACCCCCG CCCTGCTGGC CGGCGGCATC CCCGACGCGG CCTTCCGGCC GCTGGAGGAG
GACGACCGCG CCCTGACCGC CGCCCTGCGC GCCCGCAACA GGGCAGAACG CCGCAACCAC
CCGCCCCTGC GCGCCTGGAC CAAGACGCAC GCCGACGCCT GGTGCGCGGC CTTCGTCTGG
CCCAAGACCC CCACCGCCCC GCCGGCCATC ACCACCGCGA CCCTGCACGG CGACCTGCGG
CGCCTCGCCC CGCGCACCCG CGCCGAACTG GAGGAGATCA CCGCCCGGTA CCGCTTCTTC
CACTGGCCCC TGGAATTCCC GCAGATCTTC ACCGGCGAGG CGGGCGCCGG CGGTTTCGCC
TGCGTGCTGG GCAACCCGCC CTGGGAGCGC GTCAAACTGC ACGAACGGGA GTTCTTCGCC
GCCCGCGACG AGCAGATCGC CGCCGCCCCC GACGCCGCCG CCCGCCGCCG CCTGATCGCC
GCCCTGCCGG AGCGGAACCC CGCGCTGCAC GCCGCCTTCA CCCGCGCCCG CCGGCAGGCC
GAGGGAACCG CCCACTTCCT GCGCGCCTGC GGACGCTACC CGCTGACCGG ACACGGCGAC
CTCAACACCT ACGCCGTCTT CGCCGAAGCG GGCCGCTCGC TGCTGAACCC CCGCGGCAGG
ATGGGCCTCA TCGTCCCCAC CGGCATCGCC ACCGACGCCA CCACCCGCCG CTTCTTCCGG
GACGTGGTGG AAAGCGGCTC GCTGGTCTCG CTGCTGGACT TCGAAAACCG CCGCCGCCTG
TTCCGCGACG TCGATAGCCG CTTCCGCTTC ACCCTGCTGA CCCTGGCCGG CCCAGACCGC
CGCGAACCGG CCGCGCAGTT CGCCTTCTTC CTGCACGACC CCGCCCAGGC CCAGGACCCG
CGGCGGCGGT TTTCCCTCAC CCCCGGCCAG ATCGCCCTGC TCAACCCCAA CACCGGCACC
TGCCCGTCCT TTCGCGGACG CCGGGACGCG CAACTGGTGC TGGAGATCTA CCGGCGCGTC
CCGGTGCTGC GCGGCCCAGG CTGCGACCCG TGGGGGCTGT CGTTCCGGCG GATGTTCGAC
ATGTCCAACG ACGCGCACCT GTTCTGGACC CGCGACCGCC TGGAGGACCC CGCCCAAGAA
GGCGGCCCCT GGCGGCGGGA GGGCAACTGC TATGTGCGCG GCGAAGAGGT CATGGTGCCC
CTCTACCAAG GGGTGATGGC CGACCTCTAC CAGCACCGCG CCGCGGATGT GGCCTGCAGC
GACACCGCCG CCAAACGCAG GCACCGGCCC GTCCGCCTGA CCGAGAAGGA ATGGGCCGAC
CCCGCACGGT TCGCCCAGCC CGCCTACTGG GTGCACGCCG CCGACCTCCC CGGCGACCTG
CCGCCGTGGC TGCTGGGCTT CTCCAACGTC ACCAGCCCCA CCAACGAGCG GACCTTCCTG
GCCTGCGCGC TGCCCCCCGT CGCCGTGGGC AACGCCGTCC CCCTCATCGG CACCGCCCGC
CACCGGCTCC GCCCCGCCCT GCTGGCCTGC CTGTCGTCGC TGGTGTTCGA CTACTGCGCC
CGCCAGAAGA TCGGCGGCAC CAACCTCAAC TACTTCTACG TCGAGCAGTT CCCGGTGCCG
TCGCCGCAGC GGTTCTGCGA GCCGTTCGCC GCCGACCCCG CCACCGCCTT CCACGTCTGG
GCGGAACGCC GGGTGCTGGA ACTGGTCTAC ACGGCCTGGG ACATGGCCCC CTTCGCCCGC
GACCTGGGCG ATGACGGGCC GCCGTTCCGC TGGGACGCCG CCCGGCGCCG CCTGCTGCGG
GCCGAGCTGG ACGCGGCGTT CCTGCACCTG TACGGCATCG GCCGCGACGA CGCCGAGCAC
ATCCTGGCGT CCTTCCCCAT CGTGAAACGC AAGGACGAGG CCGCCTGCGG CTCCTACCGC
ACCAGGGACC TGGTCATGGC GGCCTACGAC GCCATGGCCG CGGCCGCCGC CACCGGCCGC
CCCTACCGGA CGGTCCTGGA CCCGCCGCCG GGACTCGGCC CCAGGCACCC TGCGCCCGCT
GAAGATCCAC CGCCGTGCCG CTTGCCGCTA CGGTGGCGTC GTGAGTGA
 
Protein sequence
MSRPHGPEDP QDPPWHRLRG GARKAVETLG TGLLRHPANA GLLDRHSPQD LHRALLWTVQ 
RLLVLFLAED RGMLLDPDAP EQARERYLAH HSTAVLRRRA AEAGGGHGRL WRSLRTVIDG
LGGHDGLPEL ALPALGGIFA RTDADRVLED TELDDADLCA AVRALSQIRH GRPARPRPID
FCRLDVAELG ALHESLLKHR PCRQSAARRR AFVLLPAGTG RKSGGSYYTP PALVDCLLDS
ALDPLLDEAV KTGRTRREQE RALLALTVCD PACGSGRFLV AAAGRIARRL AFVRTGDPQP
PPAALRRALR ETLTTCVYGV DLDPLAVELA KVALWLETGE PGRPLRSWDE RIKVGNALLG
ATPALLAGGI PDAAFRPLEE DDRALTAALR ARNRAERRNH PPLRAWTKTH ADAWCAAFVW
PKTPTAPPAI TTATLHGDLR RLAPRTRAEL EEITARYRFF HWPLEFPQIF TGEAGAGGFA
CVLGNPPWER VKLHEREFFA ARDEQIAAAP DAAARRRLIA ALPERNPALH AAFTRARRQA
EGTAHFLRAC GRYPLTGHGD LNTYAVFAEA GRSLLNPRGR MGLIVPTGIA TDATTRRFFR
DVVESGSLVS LLDFENRRRL FRDVDSRFRF TLLTLAGPDR REPAAQFAFF LHDPAQAQDP
RRRFSLTPGQ IALLNPNTGT CPSFRGRRDA QLVLEIYRRV PVLRGPGCDP WGLSFRRMFD
MSNDAHLFWT RDRLEDPAQE GGPWRREGNC YVRGEEVMVP LYQGVMADLY QHRAADVACS
DTAAKRRHRP VRLTEKEWAD PARFAQPAYW VHAADLPGDL PPWLLGFSNV TSPTNERTFL
ACALPPVAVG NAVPLIGTAR HRLRPALLAC LSSLVFDYCA RQKIGGTNLN YFYVEQFPVP
SPQRFCEPFA ADPATAFHVW AERRVLELVY TAWDMAPFAR DLGDDGPPFR WDAARRRLLR
AELDAAFLHL YGIGRDDAEH ILASFPIVKR KDEAACGSYR TRDLVMAAYD AMAAAAATGR
PYRTVLDPPP GLGPRHPAPA EDPPPCRLPL RWRRE