Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3182 |
Symbol | |
ID | 8604527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3682111 |
End bp | 3685200 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003300761 |
Protein GI | 269127391 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGT TGGCGCCTGG GGTTTACGAG CATGTGGTGA CCGGTGAGCT GCGGAGGCGG CTGCGGACGG TCGATGCGGA GCTGGTGCAC CGGGACGGAC TGGAGGCCGT GGACGCCGAG GAGCTGCTCG TCCGGCATCT GGCGGTGCTG ACGCGGCGGG CGTTGCGGAT CGTCGGGGAG CAGGGGGAAG GGGCGCTCGG GCGGCAGGTC GAGACCGCCA ACCGGATCGC GGAGGCGATC GGGCGGCTGG TGCCGGGGGC CGTGGTCGAA GGCGACCGGG TGGCCGAAGA GGGACAGGTG CTGCTCGGGG TCGCCGGCGG ACGGCTGCCC GATGGGACGG TGGCGTTTCC GCGGCGGCCC GAGATCCCGC TGTCGATGGG AGCGCTGCTG GTCAACGGGC GGGATCAGCC GCAGCTCGGC CGGGAGATCG CCAAAGAGCT GGCCTCGGCG GACCAGGTGG ATCTGATCTG CGCGTTCATC AGGTGGCATG GGCTGCGGCT GATCGAGGAG GACCTGCGCG CCTTCACCGC CCGGGGCGGG CGGCTGCGCG TCATCACCAC CACCTACCTG GGGGCCACCG AGCAGCGGGC CGTGGACCGC CTGGTGGAAC TGGGAGCCGA GGTCAAGATC TCCTATGACG TGCAGCGGAC CCGCTTGCAC GCCAAAGCGT GGCTGTTCCA CCGCCGCTCG CGGCTCGACA CGGCCTATGT GGGCTCTTCG AACCTGTCGC GCAGCGCCAT GCTGGACGGG CTGGAGTGGA ACGTGCGGAT CGCCCGGGCG GAGCAGCCGC ACGTGATCGA GACCTTCGCG GCGACCTTCG AGGAGTACTG GAACGACCCG TCTTTCGAGG TCTACGACGA TCCCGGGCGG CTGGCGCGGG CACTGGCCGC CGCGCGAGGA GAGCGGGCAC CCGTCCCGCT GGAACTCGGA GGCGGGCATG TCGAGCCCTA CCCCTACCAG CAGGAGATCC TGGACGAGCT GGAGGCGGCC CGGCAGGTGC ACGGGCACTG GCGGAACCTG GTCGTCATGG CCACCGGGAC CGGCAAGACC GTCGTGGCCG CGCTGGACTA CCGGCGGCTG CGGCGGCAGG GCAAGGTCGA CTCGCTGCTG TTCGTGGCGC ACCGCAAGGA GATCCTGGAG CAGAGCCTGC GCACCTTCCG GCACGTCCTC GGGGACGGCG CGTTCGGGGA GCTGCTGGTC GACGGCCACC GGCCCGTCCA GTGGCGGCAC GTGTTCGGGT CCATCCAGTC GCTGTCCCAG CTTCCCCACC TGGAGCCGGA CCGCTTCAGC ATGCTCATCG TGGACGAGTT CCACCATGCG GCCGCCGCCT CCTACGCCAG GCTGCTGGAC CGGCTCGAAC CGAATGTGCT GGTGGGGCTG ACCGCCACCC CCGAACGCCC GGACGGCGAG GACATCCTGC GCTGGTTCGA CGCAGGCCGT TTCACCGTCG AGCTGCGGCT GTGGGAGGCG CTGGAACGGG GACTGCTCGT CCCCTTCCAC TACTTCGGCA TCCATGACGG CACCGACCTG TCAAAGGTCG GCTGGCGGCG CGGCGCCGGT TACGACACCG AGGAGCTGAC CAACCTCTAC ACCGGCCTGG ACTCCCGGGT GGCCATGGTC GTCGAGGCGC TGCAGCGCAA GGTCGGCGAT CTGAGCTGCA TGCGGGCGGT CGGCTTTTGC GTGAGCATCG CGCACGCCGA GTACATGGCC GAAGAGTTCA ACCGCCGCGG CATCCCCTCC CGGCCGGTCA CCTCCAAGAC CCCGCGCGAG GAGCGCGAGG AGTCCTTGTG CCTGCTCCAG GACGGCGAGC TGAAGGCCGT CTTCACCGTC GACCTGTTCA ACGAGGGTGT GGACGTGCCG CAGATCGACA CCGTGCTGTT CCTGCGTCCC ACCGAAAGCG CCACGGTCTT CCTGCAGCAG CTCGGCCGCG GCCTGCGGCC GGCGCCGGAT AAGGCCGTCC TCACCGTGCT GGACTTCGTG GGGCACCAGC GCAAGGAGTT CCGTTTCGAC CGGCGCTTCA CGGCGCTGAC CGGCATTCCC CGCAGCCGGC TGCTCAAGGA GGCCGAGGCG GGCTTCCCCA CGCTGCCGCC CGGATGCGCC ATCGACCTGG ACCGCGAGGT CAGCAAGATC GTCCTGGCGA ACATCAGGCA GGCCCTGGGC CGGCGCCGCG CTGAGCTGGT CGGCGAACTG CGCGGAATGG GCGACCCGTC GCTGCCGGAG TTCCTCGCCG AGACCGGGCT GGAGCTGGAG GACCTCTACC GCGGCGGGCG GGGCGGCTGG GCCCGGCTGC GCCGGGACGC GGGCCTGGAC GCCCGCCCGG TCACCTCGGC CAAGGACGAC CAGGCGCTGG GCAGGGCGAT CGGCCGCATG CTGCACGTGG ACGACCCCGA ACGCTTGCAG TTCCTGCGGG ACCTGCTGAG CCGCCCGCAG CCGCCCCGGC CGTCCGGCAC CGACCTGCGG CGCACCCGGC TGCTGTCCAT GCTGAACACC TTCTTCGACG CCTCCAGGCC GCTCAGCGCC TTGGAAGGGC ACCTGGAGCG GCTGTGGGCC AACCCGGCGC GGCGGGAGGA GATGCTCGCC GTCATCGATG TGCTGTGGGA CCGGATCCGC CGGGTCACCC CGGTCGTCCC CGAGGTCGCG CACCTGCCGT TGCGGCTGCA CGCCCACTAC ACCCGGGGCG AGGCCCTGGC CGCCTTCGGG ATGGAGGTCA CCTCGTCCAT GGTCGCCGGG GTGCAGTGGC TTCCCGAGGA GAAGGCCGAT GTCTTCCTGG TGACCATCGA CAAGGACGAA AAAGAGTTCT CACCCACCAC GATGTACAAC GACCGGGCGG TCGACCCGAC GCTCTTCCAC TGGGAGTCCC AGTCCCGCAC CCGGGAGGAC TCCGAGACCG GCCAGCGCTA CATCCACCAC GTGGAGTGGG GCACCAGCGT CCACCTCTTC CTGCGCAAGC AGAAGGGCGA CCCCTACACC TACGCCGGGC CGATGACCTA CCAGGACCAT GAGGGTGAAC GCCCCATGCG CATCCACTGG CGTCTTGCCC ATCCGCTGCC GCCGGAGGTG TTCCACTACG CCAAGGTGGG TGTCGGCTGA
|
Protein sequence | MTELAPGVYE HVVTGELRRR LRTVDAELVH RDGLEAVDAE ELLVRHLAVL TRRALRIVGE QGEGALGRQV ETANRIAEAI GRLVPGAVVE GDRVAEEGQV LLGVAGGRLP DGTVAFPRRP EIPLSMGALL VNGRDQPQLG REIAKELASA DQVDLICAFI RWHGLRLIEE DLRAFTARGG RLRVITTTYL GATEQRAVDR LVELGAEVKI SYDVQRTRLH AKAWLFHRRS RLDTAYVGSS NLSRSAMLDG LEWNVRIARA EQPHVIETFA ATFEEYWNDP SFEVYDDPGR LARALAAARG ERAPVPLELG GGHVEPYPYQ QEILDELEAA RQVHGHWRNL VVMATGTGKT VVAALDYRRL RRQGKVDSLL FVAHRKEILE QSLRTFRHVL GDGAFGELLV DGHRPVQWRH VFGSIQSLSQ LPHLEPDRFS MLIVDEFHHA AAASYARLLD RLEPNVLVGL TATPERPDGE DILRWFDAGR FTVELRLWEA LERGLLVPFH YFGIHDGTDL SKVGWRRGAG YDTEELTNLY TGLDSRVAMV VEALQRKVGD LSCMRAVGFC VSIAHAEYMA EEFNRRGIPS RPVTSKTPRE EREESLCLLQ DGELKAVFTV DLFNEGVDVP QIDTVLFLRP TESATVFLQQ LGRGLRPAPD KAVLTVLDFV GHQRKEFRFD RRFTALTGIP RSRLLKEAEA GFPTLPPGCA IDLDREVSKI VLANIRQALG RRRAELVGEL RGMGDPSLPE FLAETGLELE DLYRGGRGGW ARLRRDAGLD ARPVTSAKDD QALGRAIGRM LHVDDPERLQ FLRDLLSRPQ PPRPSGTDLR RTRLLSMLNT FFDASRPLSA LEGHLERLWA NPARREEMLA VIDVLWDRIR RVTPVVPEVA HLPLRLHAHY TRGEALAAFG MEVTSSMVAG VQWLPEEKAD VFLVTIDKDE KEFSPTTMYN DRAVDPTLFH WESQSRTRED SETGQRYIHH VEWGTSVHLF LRKQKGDPYT YAGPMTYQDH EGERPMRIHW RLAHPLPPEV FHYAKVGVG
|
| |