Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2598 |
Symbol | |
ID | 8603935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3024427 |
End bp | 3027381 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003300191 |
Protein GI | 269126821 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00127194 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGGC CGGGCCCGAT CCTGCACCGG CGGCCGGAGG ACGACCCGTC CGGCATCGCG GAACTGGTGG AAGAGGCGGC GAGACTGCTG CCCGAGCAGG CGCCTTTGCA GGCGTTCGTC CACCACAACC CCCTGCACGC CTTCGAGCAC CTGCCGTTTG AGGAGGCCGC GGTCCGCGCG GCGCGGCTGT ATGGCACCGA GCCCTTCCCG AGCGAGGACT TCTTCGCCCA GTGCCTGGAA GCCGGGCGCA TCCGGCCCGA GGACCTCGAC GCCGTGGCGG CCGCCGAGGG CCTCGACGAT GAGACCCCGA TCGTGCCCGG CGGCCCGACC CGCCGGGCGT TCTGCGCGCT GCGGCTGCGG CACCCGTTCG AGATCCCCCA GGGCGCCGCC CTGGAGTGGA CGCTGCAGGA GACGGACGCG CTGGAGCGCA TCACCGGCCT GGCCGGGCCG AAGCGGCGCG CCGAACTGCT GCGGCAGGCC CAAGGGGACG AGCGGCGCCT GCTGAGACGG CTGTGGGACG ACCTGGAGAA GGCCGCCCCG CCTCAGCGGC CCGGCGCCCC CTCGGCCCGG CGGCGCGACC AGATCCTGGC CCTCACCGGC GTGGACACCG ACCGCCTCGT CCATCCGGTG CTGATCCGGC TGGTGGGGGC GTTCTTGGAC CAAGGGGTCG CCCGCCGGCC GATGCCCGGG CGCCGACTGG GCTTCCTGCG CGCCTTCCGG CGGCTGTACG GCCTGCGCGG CGGGCCGCCC GACCGGTTCC TGCGCGGGCT GGCCGGTGAG CTGAGCCGGC AGGAGGCCGA GGGCTGGACG GCGGAACGGA CCGTCGCCTG GGCGCTCGGG CGGCTCGCCG TGCCCGCAGG CGAGCGGGCC GCGGTGATCC GGGACACGCT GCTGTCGCTG CCCGGCTGGG CGGGCATGAT GCGCCGCCTG GAACGGCATC CCCAGAGCGC GCCGGTCCGG GCGCCCGCGG CACGGCTGAT CGACTACCTG GCCGTGCAGC TCATCTTGGA CGTGCAGGCG GCCCGGCATG CGGTCACCGG GCACCTGGGC CCGCACGCTG CGATCGCCGA CCTGGATGTG CTCTCGGCGG CGGACGCCTC GGCCGGGCGG CGGCCCGACC GGGAACTGGT GTACGAGGCG TTCGTCCTGG CCCAGGTGAT GCCGGTCGAT ACGGCGGTGC TCGCCGACCG CGCCGCCGCG GCGGCCTGGC TGGCGGCGGT CAGGGCCTGC GACGCGGTCG AACGGCGCCG GCTGCTGCAA CTGGCCCATG AGCGAAGGCA CCGGATCGAG GTGCTGGACG CGCTCACCGC CCACCAGACG ATCGCGGCCA GACCGGTGCC CCCTCCCCGC TTCCAGGCCG TCTTCTGCAT CGACGAGCGG GAGGAGGCCC TGCGGCGGCA CCTGGAGGAG CACTTCCTGC ACGTGGAGAC CTTCGGGTAC GCCGGTTCCT TCGGCGTCGC CATGCTCTAC CGGGGGATGG AGGACGTCCG CGCCCGTCCG CTGTGCCCGG CCGCCGTCAC GCCCCGCCAC CTGGTCGAAG AGGTCGCCGT CACCGGGACG GCCCGGCGCC CGCAGCGGCT GCGGGACCTG TGGGAGCGGC AGATCGCGGC CGGGAGCAGG ACGCCGGTGC GGGGCTGGCT GCTGACCCTG CTGCTCGGAC TCGCCCATCT CCTCGTCCTG GCCGCCCGCT GCCTGCCGCC GCACCGGCCA CGGCGGACCG GACGAGGCGG CACCGGACCC CGTACCCGGC TGGCACTGGA ACGGCCTCCC GGACAGGGCG CGGACGGCGA AGACGGGCCG CTGCGGGGAT ACACGGTGCC GGAGATGACC CGGATCGTGT CGACCGTGCT GCGCACCATC GGCCTGACCG GCGGCCTCGC CCCCCTGGTG CTGATCGTCG GGCACGGCTC ATCCAGCCTG AACAACCCGC ACGAGGCCGC GCACGACTGC GGAGCCGCCG GCGGCGGCCG CGGCGGGCCG AACGCCCGCG CGTTCGCCGC CATGGCCGAC CATCCCGGCG TCCGGCGCGC CCTGCGGCGC GAGGGCCTGC ACATCCCCGA CGACACCTGG TTCGTCGGCG CCTGCCACGA CACCTGCACC GACACGATCA CCTACTACGA CGAGGACCTG GTGCCCGAGC GGTGCCGCGG CGCGCTGCGG GCGGCCAAAC GGGCCATGGC CAGGGCGTGC ACGCTCACCG CGCACGAGCG GTGCCGCCGG TTCGAGTCCG CCCCCGCCGA CCTGCCCATC GGCCGGGCCC CGGCCCACGT GGCGGGCCGC GCGGTGGATC TCGGCCAGCC CCGCCCCGAG TACGGGCACG CCACCAACGC CGTCTGCATC GTGGGACGCC GCTCGCGCAC CCGGGGCCTG TTCCTGGACC GCCGGGCCTT CCTGGTCTCC TACGACCCCG GCGGCGACCC CACCGGGGAG CTGCTGGCGG ACCTGCTGGC GGCGGTCGGG CCGGTGTGCG CCGGGATCAA CCTGGAGTAC TACTTCAGCC GCATCGACCC GGCCGGCTAC GGCTGCGGGA CCAAACTCCC GCACAACATC GCCGGCCTGC TGGGCGTGAT GGACGGGCAC GCCTCCGACC TGCGCACCGG CCTGCCCTGG CAGATGGTGG AGATCCACGA GCCGCTGCGG CTGCTGCTGG TGGTGGAGGC CGCGCCGCAG CGGCTGGAGG CGATCCTGCG CACCCGCCCG GACCTGGGAC GCCTGGTGAC CAACGGGTGG ATCCGCCTGG TGGCCTGGGA CCCGGACTCC GCCGCGATGC ACGTCCATGA CCGCGGCGTC TTCCGGCCCC ATGTGCCGCA GGGCGCGGCG CTCCCGGTGG TGCACCGCTC GGTGCGGTTC TACGCGGGCG CCCGCGGCCA CCTGGGCTTC GCCCACGTGA CCGCCGCGTT CGCGACGGGG GACGGCTTCC TGCCCCTGCC GGGGGCCGCC GGGAGGCGGT CATGA
|
Protein sequence | MNRPGPILHR RPEDDPSGIA ELVEEAARLL PEQAPLQAFV HHNPLHAFEH LPFEEAAVRA ARLYGTEPFP SEDFFAQCLE AGRIRPEDLD AVAAAEGLDD ETPIVPGGPT RRAFCALRLR HPFEIPQGAA LEWTLQETDA LERITGLAGP KRRAELLRQA QGDERRLLRR LWDDLEKAAP PQRPGAPSAR RRDQILALTG VDTDRLVHPV LIRLVGAFLD QGVARRPMPG RRLGFLRAFR RLYGLRGGPP DRFLRGLAGE LSRQEAEGWT AERTVAWALG RLAVPAGERA AVIRDTLLSL PGWAGMMRRL ERHPQSAPVR APAARLIDYL AVQLILDVQA ARHAVTGHLG PHAAIADLDV LSAADASAGR RPDRELVYEA FVLAQVMPVD TAVLADRAAA AAWLAAVRAC DAVERRRLLQ LAHERRHRIE VLDALTAHQT IAARPVPPPR FQAVFCIDER EEALRRHLEE HFLHVETFGY AGSFGVAMLY RGMEDVRARP LCPAAVTPRH LVEEVAVTGT ARRPQRLRDL WERQIAAGSR TPVRGWLLTL LLGLAHLLVL AARCLPPHRP RRTGRGGTGP RTRLALERPP GQGADGEDGP LRGYTVPEMT RIVSTVLRTI GLTGGLAPLV LIVGHGSSSL NNPHEAAHDC GAAGGGRGGP NARAFAAMAD HPGVRRALRR EGLHIPDDTW FVGACHDTCT DTITYYDEDL VPERCRGALR AAKRAMARAC TLTAHERCRR FESAPADLPI GRAPAHVAGR AVDLGQPRPE YGHATNAVCI VGRRSRTRGL FLDRRAFLVS YDPGGDPTGE LLADLLAAVG PVCAGINLEY YFSRIDPAGY GCGTKLPHNI AGLLGVMDGH ASDLRTGLPW QMVEIHEPLR LLLVVEAAPQ RLEAILRTRP DLGRLVTNGW IRLVAWDPDS AAMHVHDRGV FRPHVPQGAA LPVVHRSVRF YAGARGHLGF AHVTAAFATG DGFLPLPGAA GRRS
|
| |