Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4558 |
Symbol | |
ID | 8605920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 5165200 |
End bp | 5168577 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | WD-40 repeat protein |
Protein accession | YP_003302121 |
Protein GI | 269128751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGCGAGG TGATCGATTA CGGGCGATTT GCGGAGCGGA TGCGGCGGGC CGCCGAGCGG GGGCGGTCGG CGTTCTGGGA GTTGCTGCGC GATTTCCAGC GCGAATGGGG TTATCAGGAC CCCGAGGGGC CGCCGCGCCG GTCGCGGCGG GCCGAGGACG AGTTCGATGA GGACGGCTTC GAAGAGGATG ACTTCGATGA GGACGACGCC GCCCGGGTGG ACGTCTCCCT GCCGGTCCCG GAGGCGTTGA AAGAGTGGTG GGATCTGCCG TTCAACTCGT TCGTCCACTC GCCGGAGCTG TATTACACGC ATCCGGAGTG GCCGCCCACG ATCCGGCCCG ACCCCACCGG ATACGGCTCT GGCGGCGCGC TGCCCGTCCG CAACCCGTTC GTCGGTCCCG ATGAGGACCG GCGGGTGTGC GTGTTCATGG CCGAGTACGA GTACTGCAAC GAGTGGGGGT ACCTGGCCGC CGAAGCCGGC CGGCCCGATC CCAGAGTGCT GGTCAGCATC GAGGACGACT GGGTGCTGCA GGCCCGGTCG ATCTCGGAGT TCTTCCTGCA GCTGGCCGTC GAACGCATCC CTTACCGCCA CGGCTGGCGG CTGTGCCTGG GACTCCACGA GGTCGTCGAA CACCCGGGAT TGCTGGACCG GCTCCGCGCC ACCTACCCGG AGATGGGGCT GCTGCCCTGG CGGGAGCTGG GAGTCGACAG CCTCGCCTAC GGGGCACCGG ACGCGATCAT CTATCACGCC CGCGGCGAGG GCGCGGACTT CCCGCTGATG ATCTGCGCCC GCGCCCGTGA GGCGCTGCTG CGGGTGGCGC GCACGCTGCG GATCGACGTG GTCGGCTACG CGATCGAATC CCCCGAGCAC GGGCCGGACA GCGGCCAAGA AGCCGATATC GGGCCGGTGG CGCTCGTCGA AGGCGATGCC GATGGGTTGG GCCGCTGGAC GGTGACCGCG GTGCACCCGG CGGAGCCCGC TTCCTCGTTC GCCGAGCCGT CCGGACAGGT CGTGTCGGCC ACTGACCGGG ACGGGACGAT CACGGTCACC GCAGATGCCG GCGGCCGGGT GCATGTGCGG CTCGACGGTG CCGAGCCGCA GGTCCGGAAA CTGCACGAAG CCGAGGTGAC CGCCGTCGCC TGCGTGCGTC CGGACGGCGG CCCGCTGGTG GTGACCGGGG ACGCCGAGGG AACCGTTCGC CTGTGGGAGC CGCCCGGCCA GCCGCCGAAA CGCCCTTTCG CCCACGGTGA GGGGGCGGTG TGCGCGCTGG CGGCGGCGGT GCTGCCGACC GGGCCGGCGG TCGCCGCCGC CTGGACCGAC GGGCAGGTCC GAGTACAGGA TGTGCGCTCG GGGGTGAGCG CGGACTTTTG GCTGGGCACC GGCATCACCG GCCTGGAGCT GCGCCCGGAA GACGAGCTGC GGGTAAAAGC AGGCAACGGC ACCACCGTGC TGCGGCTGGA CCCGGAGCGG CTGTGGCCGC ACCGCAAGAT CCGGCTGCGG CTGAACGAGA TCGACTGGGC GTCCTTGCGG CACGCCAGGG GCAGCGCCCG CGACATCCCA GATGTGATCA TGAAGGTGGC CACACCCCGC GGCGGCGAGG TGGGGGAGGG CCTGAACGAG CTGGAAGCCA GGCTGCTTCG TGAGGACGCG GTGTACCCGG CGACCTCGGC CGCCGTGCCG TTCCTGGTGG AGCTGGCCTG CAACCGGCCA CTGGGGGATC CTGAGCGGCT GTTCCGCCTG CTGTCGCGGA TCAGCCGGGC GCACGAGGAG ATCGACGACG AGCGGATGCG CTCCCTGGCC CTGGAAGGGC GGGCGGCGGT CGAGGCCCAG ATCCCCGTCC TGGTGGCGGC GCTCGGCAGT TCCGACCCGG TGGTTCGTCA CGGGGCGGCC TTCGTGTTGG CCGACTTCCC CGAGCGCGCC TTGGAGCTGG TACCGGTGCT GCAGGCGCGG TATGCCGCGG AGACCGACCC GGTGACGGCC GCTGCGCTGG TGCTGGCCAT CGGTGACCTG ACCGAGGAAC GCCGGGACGC GCCGCTGGAG TGGCTGCGCG GCCTGCTCGC CGACGCCCCC TGCCGGGAGG TGCGGGCCGC CGCGGCGGTG GCGCTGCTGT GGTGCGGCGT CCCGGAGTTC TCCGGTGAGC TCGCCCGAGC CGTCGAAGAG GAACTGACCG CCGCTGAACC GGCGCTGCGA GAGACGTTCT GGGTCTTGGA GTACGGCTTC ACCGACCTGC TGCTGGGGGC CATGGGCGAC CACCCCGGCG AGCTGATCGA GTTGATCGGC TCCCTGCTGG AGAGCGCCGA GCACGACGTC CCCGCCTGGC AGGTGCGGCG GGCCGGGAAG GTGATGCGGA CCTGGCGGGC GGCGCCGCAA CGGCTGCTGC CCCTGCTGGC CGGCCTGCTG AACGGGGAAA GGGAGTCCAC CGGCCGGGCC GCCCTGGAGG AGATCGAGTG GTGCGGTCCC GCCGCCGCCC AGGTGGCCGA CGCCCTGGTG GCGCTGCTGG ACAGCCCTCA GCTTTGGCGG TCGCGCGGTG CGCTGGAGGT GCTGGCCCGG CTCGGCGACG CCCGCTGCCT GCCGCAGCTC ACCCGTGAGC TGGCCGCAGA CCGGATGCTG TTCTATCTGG AGGACATGGT GGGAGGCATG GCCGAGCACG CCGATGAACT GGTGCCGGTG TTGCGTCCCA TCCTGTCCGC CGCCCACGGC GACGTCCCGG CCGGGCTGGT CACCGGGGTC GGCCGCTGGG GAGTGCGGGC CGCGCCGCTG GTGCCGGAAC TGACCGGCCT GCTGGAGGAC GATGGGGCCA GAAAGGAGGC GGCGTCCGCG CTGGGCGCCA TCGGCGAGGC CGCCGCCGGG GCCGTCCCGG TGCTGCGCCG CTGTCTGCAC CAAAGCCGCG CTGAGGAGGA GCGTCAGGAG GCGGCGTGGG CGCTGTGGCG GATCACCGGG GACGGCGCGG AGGCGCTGGA GGTGCTGGCC GATGCGCTCC GGCCGAGCCT GTCGGTGGAG ATTGCCGAAC GGCTGCTCGA CCTGGGGCGG GCGGCCGCCC CGGCCGTCCC GCTGCTGCGT CCGCTGCTGG AGGAGGCACC GGATGAGGAG TCGGCCACCG CGGCGGCCTG CGTGATCCAC CACGCCACCG GTGACACCGC AGTGCTGCCG CGGGTGATCG AAGCGGTGGA GACGACGGCC GCCACCCCGC TGGGCATGCT CGCCGTCCGC GCTCTCAACG GTCCCGGGGC CGCCGCCGCG GTGCCGCGAC TGCGGGAGAT CATCGACTCT GCGCGGGTCC TGGCCCATCC TGACGACCGC GAGGCCATTC CCCGTGACCT GGCCTACCGG GCCCTCGCCG CCGAGGCCCT GGCCCGCATC ACCGACTCCC CGTCCTGA
|
Protein sequence | MGEVIDYGRF AERMRRAAER GRSAFWELLR DFQREWGYQD PEGPPRRSRR AEDEFDEDGF EEDDFDEDDA ARVDVSLPVP EALKEWWDLP FNSFVHSPEL YYTHPEWPPT IRPDPTGYGS GGALPVRNPF VGPDEDRRVC VFMAEYEYCN EWGYLAAEAG RPDPRVLVSI EDDWVLQARS ISEFFLQLAV ERIPYRHGWR LCLGLHEVVE HPGLLDRLRA TYPEMGLLPW RELGVDSLAY GAPDAIIYHA RGEGADFPLM ICARAREALL RVARTLRIDV VGYAIESPEH GPDSGQEADI GPVALVEGDA DGLGRWTVTA VHPAEPASSF AEPSGQVVSA TDRDGTITVT ADAGGRVHVR LDGAEPQVRK LHEAEVTAVA CVRPDGGPLV VTGDAEGTVR LWEPPGQPPK RPFAHGEGAV CALAAAVLPT GPAVAAAWTD GQVRVQDVRS GVSADFWLGT GITGLELRPE DELRVKAGNG TTVLRLDPER LWPHRKIRLR LNEIDWASLR HARGSARDIP DVIMKVATPR GGEVGEGLNE LEARLLREDA VYPATSAAVP FLVELACNRP LGDPERLFRL LSRISRAHEE IDDERMRSLA LEGRAAVEAQ IPVLVAALGS SDPVVRHGAA FVLADFPERA LELVPVLQAR YAAETDPVTA AALVLAIGDL TEERRDAPLE WLRGLLADAP CREVRAAAAV ALLWCGVPEF SGELARAVEE ELTAAEPALR ETFWVLEYGF TDLLLGAMGD HPGELIELIG SLLESAEHDV PAWQVRRAGK VMRTWRAAPQ RLLPLLAGLL NGERESTGRA ALEEIEWCGP AAAQVADALV ALLDSPQLWR SRGALEVLAR LGDARCLPQL TRELAADRML FYLEDMVGGM AEHADELVPV LRPILSAAHG DVPAGLVTGV GRWGVRAAPL VPELTGLLED DGARKEAASA LGAIGEAAAG AVPVLRRCLH QSRAEEERQE AAWALWRITG DGAEALEVLA DALRPSLSVE IAERLLDLGR AAAPAVPLLR PLLEEAPDEE SATAAACVIH HATGDTAVLP RVIEAVETTA ATPLGMLAVR ALNGPGAAAA VPRLREIIDS ARVLAHPDDR EAIPRDLAYR ALAAEALARI TDSPS
|
| |