Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4135 |
Symbol | |
ID | 8605491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4719168 |
End bp | 4722176 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003301701 |
Protein GI | 269128331 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCAC GCGATGCAGT CGGGATCCGC AGGCGCAGCG CCGGGGAGAT ACTGGCCGGG CTGGGGGCGC TGGTGGCGCT GTCGGCGCTG GTCGTCGGGG TGCCGTGCGC GCTGCTGCTG GCCTTCGGCT CGCCGCTGCC GGAGCGGATG CCCACCCTGG AGGACCTGAC CGGCCGGATC GGCCCCTCGG CGATCATCAC CGTGCTGGTC ACCCTGGTGT GGCTGGCCTG GCTGCAACTG GTGGTCTGCG TCCTGGTGGA GGTGCACGCC GGGATCCGCG GCGTCGGGGT GCCGGTCCGG GTGCCGCTGG CCGGCGGGCT GCAGCCGCTG GTGCACCGGC TGGTGCTGGC CGCGCTGCTG CTGTTCACCA CCGCCAGTGC GGTCATGCCC GCCTTCTCGG GCCGGGACAT GCCCTCCACC GCGCCGGTGG CCGCGGTGCA GACCGACCAC TTTCAGCTGG TGGCGGCCAC CACGCCGCTT GCGCGGGCCG AGCCGTCCGG CACGGCCGAA GAAGGGGCGC GCGCCCTCGC CGAGAGCGTC GCGGCCGAAC CGACCGAAAA GACGCAGACC ACCAAGATCT ACCGGGTGCA GCCGCCGCAG GGCCGCCACC ACGAGAGCCT GTGGGAGATC GCCGAGCGGT GCCTGGGCGA GGGCCGCCGC TACAAGGAGA TCTACGAGCT GAACAAGGGC CGGGTGCAGC CCGACGGCAG CAAGCTGACC TACGCCAGCC TCATCCGGCC CGGCTGGATC CTGGAGATGC CCGCCGACGC CGTCGGCGTC CAGGTCGTGC CGGTCGAAGA CCTGGAGGAG TACTTCCGCT ACGGGCACCC CAAGCCCGAC CCCGAACCCC GCGGCCAGGA CGGGCGGGAA ACACCCCGGC AGGCCCCCGA AGAGGCCCCG CCCGCACCGT CTGCTCCGCC TTCGCAACCA CCCGCCGCAC AGACCCCCGA GCCGCCCGCT GAGCAGACCC CTGCGCCGCC CGCCGAACAG GCCCCGCAGC CGCCGGCCGC GCAGGCCCCG CAGCGTCCCG CCGAGCAGGC CCCCGCCCGG CCGGACGAGC CGGCCCAGCA GCCGCACGCC CACCGGCTCG AACCCGCCGA CGAGGGGCCG CCGCCGGTCA CCGGGCTGTC GCTGCCGTCG ATCGACCTGG GCTGGCCGCA CGGGCTGGCG GCGGCCTCGC TGCTGGCCGC CGGCGTGCTG ACGGCGCTGG GCCGGCGGCG CCGCGAGCAG ATGTGGCACC GCGGATTCGG CACCATGATC GCCCGTCCCG AGGGGGAGGC CGCCCGCGCC GAGGAGGCGC TGCGCGTCGG CCAGGACCCC GAGGGAGCCC AGCTGCTGGA CCTGAGCCTG CGCCAGCTGT CACGGGCGCT GGCCGACCGG GGCCGCACCC TGCCCACCGT CTACGGCGTC CACCTGGGCG CCGAGAGCCT GGACCTGTGG GTCGCCCCCG CCGACCCCAA CCCGCCCGAG CCCTGGCGCG CCTTCGACGA CGGGCAGGTG TGGCGGCTGT CGGCCGACAC GCTGCCCGCC CTGCGGGAGG CCGCGCTGGG CGACGTGCTG GCCCCCTACC CGGGCCTGGT GTCCATCGGC ACCAACGCCA ACGGCCGGAT CCTGGTGGAC CTGGAGGCCG CCCAGGGGCT GATCGCGGTG CGCGGCCCGG AGGAGACCCG GCGGGCGGCG CTGGCCGCCA TCGCCCTGGA GCTGGCCACC AACCGCTGGT CGGACCACAT GCGCATCACC CTGGTCGGCT TCGACCCCGA CCTGGCGCGC AACCTCGCCG AGATCGCCCC CGACCGCATC CGCACCGTGG CCTCCCTGCA GGAGGCGCTG CCGGAGCTGG AGGGCCGCAG CGAGGAGGTG CGCCAGGCGC TGGCCGCCTC CGGCGCCGAC TCGGTGCTGA CCGGCCGCTG CCGCGGCGTG TTCGGCGAGG CATGGATGCC GCACTACCTG ATCATGGCCG ACCAGCCCAC CGACGCCGAG ACGGCCCGCC TGGTGGCGCT GGCCCGCACC GGCCGGCGCA TGGCCTCCGG CTACCTGGTC GCCGGCGAGG TGCCGGGCGC CACCTGGACC TGGGACGTCA CCGCCGACGG CCGGCTGCAC GCCGGGGTGC TGGGCTTCGA CGTGCAGGCC CAGCTGGTGC GCCCCGAGCA CTACCAGGCG GTGGCCGACC TGTTCCGCAC CGCCTCCCGC ACCCAGGGCG CTCCGCTGCC GGGCCCCGCC GACGGGGAGC AGCCGCCGTC CTTCGACCAC CGTCCCGAAG TGGACATCCG GCTGCTCGGC CCGATCGAGG TGGACGCGCC GGGCCCGATG GACGAAAGCC GCCGGGCGCT GTGCACCGAG GTGCTGGTGT ACCTGGCCAC CCATCCCGGC GGGGTGCACC CCACGGTGCT GAGCGGGGCG ATCTGGCCGC GCGGCGTCAG CGCCGGGGTG CGCGACGCCT GCATCGCCCG CGTCTCCGAC TGGCTGGGCC GCGACTCGCG CGGCCGTCCC AACCTCTACT ACGACGAGCG CGGACGCATC CGGCTCGGCT CGGAGGTGCG GGTGGACTGG TCGGTGTTCC GCTGGCTGGT GTGGCGTTCG GCGGCCGAAC CGGCCTCCGA GACCGCCTAC CTGTCCTACG CCCTGGACCT GGTGCGCGGC CCGCTGCTGG CCGACCGGCC GCGCGGCCGC TACGGCTGGC TGGCCGCCGA CCAGCTGGAG TACGAGGCCA CCGCCCGCGT CATCGACGTC GCCCACCGCC TGGCCGTGCT GCGGCTGGAG GAGGGCGACG CCCACGGCGC GGTGAACGCC GCCCGGGCCG GGCTGCGCAT GGTCCCCGAC GACGAGGGCC TGTGGCGCGA CCTGCTGCGC GCCACCCACG CCACCGGCGA CGCCACCCAG GTGCAGGTGG TGGTGGACGA GCTGCGCCGC AGGCTCGGCC GCGACCCGCT GATGGACCAC CTGCAGCCGG AGACCGAGGC CCTCATCGAG GAGCTGGTGC CGCACTGGCG TCAGGTCGCC CACCGGTGA
|
Protein sequence | MSPRDAVGIR RRSAGEILAG LGALVALSAL VVGVPCALLL AFGSPLPERM PTLEDLTGRI GPSAIITVLV TLVWLAWLQL VVCVLVEVHA GIRGVGVPVR VPLAGGLQPL VHRLVLAALL LFTTASAVMP AFSGRDMPST APVAAVQTDH FQLVAATTPL ARAEPSGTAE EGARALAESV AAEPTEKTQT TKIYRVQPPQ GRHHESLWEI AERCLGEGRR YKEIYELNKG RVQPDGSKLT YASLIRPGWI LEMPADAVGV QVVPVEDLEE YFRYGHPKPD PEPRGQDGRE TPRQAPEEAP PAPSAPPSQP PAAQTPEPPA EQTPAPPAEQ APQPPAAQAP QRPAEQAPAR PDEPAQQPHA HRLEPADEGP PPVTGLSLPS IDLGWPHGLA AASLLAAGVL TALGRRRREQ MWHRGFGTMI ARPEGEAARA EEALRVGQDP EGAQLLDLSL RQLSRALADR GRTLPTVYGV HLGAESLDLW VAPADPNPPE PWRAFDDGQV WRLSADTLPA LREAALGDVL APYPGLVSIG TNANGRILVD LEAAQGLIAV RGPEETRRAA LAAIALELAT NRWSDHMRIT LVGFDPDLAR NLAEIAPDRI RTVASLQEAL PELEGRSEEV RQALAASGAD SVLTGRCRGV FGEAWMPHYL IMADQPTDAE TARLVALART GRRMASGYLV AGEVPGATWT WDVTADGRLH AGVLGFDVQA QLVRPEHYQA VADLFRTASR TQGAPLPGPA DGEQPPSFDH RPEVDIRLLG PIEVDAPGPM DESRRALCTE VLVYLATHPG GVHPTVLSGA IWPRGVSAGV RDACIARVSD WLGRDSRGRP NLYYDERGRI RLGSEVRVDW SVFRWLVWRS AAEPASETAY LSYALDLVRG PLLADRPRGR YGWLAADQLE YEATARVIDV AHRLAVLRLE EGDAHGAVNA ARAGLRMVPD DEGLWRDLLR ATHATGDATQ VQVVVDELRR RLGRDPLMDH LQPETEALIE ELVPHWRQVA HR
|
| |