Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1501 |
Symbol | |
ID | 8602815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 1720003 |
End bp | 1722900 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003299118 |
Protein GI | 269125748 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.257169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCCT ACACGCGCAT CCGGACGGCC TCCTGGCTGC CGGACTCACC GGACGCCTCG GTCACCGCCC GCTATCAGGC CCTGCCCTTC CCCGAACAGT GGCGGGAGGT GCTGCTGGAG CTGTGCAACG CGGGCCGTCC CACCGACGCC GAGCCCTACC GGACCGTGCC CACCCGCCGG ATGGAACAGG TCCTGCAGAC GTTCGCCCCC GACTACCTGG TGCTGCCGCG CCCGCGTGAC GGGCGCGGAC ACTGGCTGCT GGTGCCCGAA GGCGTCGAAC GCCTGCCGGA TCAGGTTTTC CGGGCGCTGT ACAACGCCTG GCTGAGCGAC CTGCGACCCG ACATGGCCAA AGACCCCCAC TACCGGGAGC TGCTGGCCAA GGCCCGCGCG ATGCTGGACG ACGCCCCGCC CCGCTGGGAA CCGGTGGAGC TGGAGCTGCT GCGCTGCCCG GTCACCGAAG GCGGCACCGC CGCGCCGCTC GCCCACCAGT ACCCGCTCAC CACCGACTGG TTCGCCCGGA AGATCCTCGC CCTGGAGCCC TATGACTATG GAAGCGGCAC CCTGCGTTTT CACGCGGTGC CGCGCGGCCC CCGCGACCAG GGCGCCGAAC TGGTGTCAGA GCCGCTGCGG TTCGACAAGG ACGGGCAGAG GTGGTGGTAC TCGATCACCC TCAACGTCAC CTTGCACACC GTCCCCTTCG AGCCGCTGCC ACGGATCCAC CTGCACACCG GGATCCGCCG CTGGGCGACC CGGGTCGGCG CCGCTGGCCG TCTGTACCTG CCCCGCCGGC GGCGCACCAC GGTGCTGCTG CGGCCGCGGG TGCCCTGGCT GCCAGGCGCG TCCCGCTCGG ACCGGTTCGC CGTCGCCCGG TTGGAACGCC GCTGGGACCG GGAGGCGCGG GACTGGGTCA CCGGCTGGGT CGAAGGCGGG CCGGCGGGGA TGCTGCACCG GTTGTCGCTG TCGTCGTTCC CCGACGCCGA GGCCATCGTG ACGGCGCCGG AGGAATGGCT CGCCGACGAC ATGAGCGCCG CCGTCGTGTA CAGCACCGCC ATGGGCTCCC ACGGGGTTCT CCCGGGGTTG ATGCCCCATC AGCGTTCAGA GCTCGTGGCC TGGGCCGAGC AGGCGTTCGC TCCGGAGCTG CGTCCCGAGC CGGAGCGGGT GCGCACCCGG CTGGGCCGTT CCACACCGCT CAACACCCCG CCCAAGACGA AGACGTCCGA GTACGACAGG CGCCGCGCCG CGAAGGTGCG GGCCGCCGCG GCGCACGCGA TGCGGGCGCT AGGTGCCGTC GAGGACGACC GGGCCGTGCT GGAGGCGTGG CTGCTGTGGC AGACGCCGCA GATGCGAGAT GAGGCCGTCG AGGCGTTCAT CGGCCTTCTC GGTCTGGGCG GCGGGTCGAG AGCGGCCATC GATGCGCTGC TGAGCGCCGC TTCAGGTCCG CGCGGCCTGC TGGAGTGGCG CACGCCGGAG CTGCTCGTGC GGCTGCACTG CCGGCGTCTG ACCGAAGGGC TCGGCGACGA CCTGGCGCTG CCGGAGGGAC GCCGGACCAA GGCGATGGTC ACCGCCGCGG TCGCCGAGCG GCGCGACCGG GCCGCGCGCT GGATGGCGGC CGAGCGTCCG GGCACGGCGC CGTCGCTGGC GCTGGTGGAA CTCGACCGCG CCGCCGACTT CACCACTTCT GATCACGATC CGAAGTTCGC GCTGCGGCTC GGCTTCGCCA AGGCCGGGCT CCTCACCCAG TTCGTGGCGG TGCCCAAGAA GACCGCGGGC TACGACTCGA CCCGCAACAT CGGGCACCGG GCCGAGAAGG CCTGGGACGA CGGGCTGCGG CAGCTGGGGG TGCGGGTGCT CCCCGAGCAC GGGCTGAGCG ACGGGCTGCC CGAAGGACTG CGCTATGCGG CGATCTGGCT GGTGCGCAAG AACCGGACGA GCCGGACCCG CTGGGCCGGG CACGTCCCGG TCGCCGTGCT GGTGGCCCCC GGTCCTGAGG AGGGAATCGC CGAGGTGCGG GGCTGGGACG CCGAGGCGGA CGGCGGAGCG GGCGCCTGGA TCCCCTATCC GTCGCTGCTG CTGCGGCTGA CCGAGCGGGT CGACGTCTCC TCCGTCCTGG CCGCCGACGA CGAAGACGAG GAGGCCTTCA GACGACCGGA CTACCACCGG GAGATGGAGC GGCAACGCCG GCAGGTCGAG GAATGGCTGC AGACGGTGCT GCGCACCCTG CGAGGCGTCC CGACCCTGCT GCTGGCCTGC GCGCAGAACG TCCGTTCGCA CTGGACGTGG CTGCAGGACG GCCAGGTGCA GCGCGACCGG GTGCGCACCG GGGTGGCCCC GCACCGCAGG CTCGACCCCG ACCTGCGGCT GCTGCGGGTC CGCAAGACCA CGGGCCGGGA GACCCCCCAG TGGTGGGGCG TCCACCCCAA GGACGGGCTG AACGGCCTGG CCGCCCACCT GTGGGTGGCG CCCCGTCCTG ACGGAGCCGG CGGCGGCCGG GTGTTCTGGA GCACCACGCC CAAACCCTCG CAGTTCAAGG ACTCGGCCGT CAGCGGCGAC AAACTCGGCG GCCGGCCGCT CACTCGCGGC TCCGGCAGGC CGACCATCGA CGCCGACAAG GTCGGCTGGA ACCCCGGTCT GGTGGAGCTC GCCGTGCTCG GCTGCCACGA GGACGACGGC GACGATCCGG AGGCGCTGGC GATGGCCGCC CACCACCTGC GCCAGCCCCC GGACTACCCC CAGGCCCTCG CCCTGCCGCT GCCCCTGCAC CTGGCCGAAC TGGCCCAGGA GTACGTCCTC CCGCTGCCGC CGGAGGACTC CGAAGAAACC GCACACGACG AGGATCCGCC GGGCGAGGGA CCTTCAGAAC CGGAAACCGC CCAGCCCTTC GCTCCCAACG GGGAGTGA
|
Protein sequence | MPAYTRIRTA SWLPDSPDAS VTARYQALPF PEQWREVLLE LCNAGRPTDA EPYRTVPTRR MEQVLQTFAP DYLVLPRPRD GRGHWLLVPE GVERLPDQVF RALYNAWLSD LRPDMAKDPH YRELLAKARA MLDDAPPRWE PVELELLRCP VTEGGTAAPL AHQYPLTTDW FARKILALEP YDYGSGTLRF HAVPRGPRDQ GAELVSEPLR FDKDGQRWWY SITLNVTLHT VPFEPLPRIH LHTGIRRWAT RVGAAGRLYL PRRRRTTVLL RPRVPWLPGA SRSDRFAVAR LERRWDREAR DWVTGWVEGG PAGMLHRLSL SSFPDAEAIV TAPEEWLADD MSAAVVYSTA MGSHGVLPGL MPHQRSELVA WAEQAFAPEL RPEPERVRTR LGRSTPLNTP PKTKTSEYDR RRAAKVRAAA AHAMRALGAV EDDRAVLEAW LLWQTPQMRD EAVEAFIGLL GLGGGSRAAI DALLSAASGP RGLLEWRTPE LLVRLHCRRL TEGLGDDLAL PEGRRTKAMV TAAVAERRDR AARWMAAERP GTAPSLALVE LDRAADFTTS DHDPKFALRL GFAKAGLLTQ FVAVPKKTAG YDSTRNIGHR AEKAWDDGLR QLGVRVLPEH GLSDGLPEGL RYAAIWLVRK NRTSRTRWAG HVPVAVLVAP GPEEGIAEVR GWDAEADGGA GAWIPYPSLL LRLTERVDVS SVLAADDEDE EAFRRPDYHR EMERQRRQVE EWLQTVLRTL RGVPTLLLAC AQNVRSHWTW LQDGQVQRDR VRTGVAPHRR LDPDLRLLRV RKTTGRETPQ WWGVHPKDGL NGLAAHLWVA PRPDGAGGGR VFWSTTPKPS QFKDSAVSGD KLGGRPLTRG SGRPTIDADK VGWNPGLVEL AVLGCHEDDG DDPEALAMAA HHLRQPPDYP QALALPLPLH LAELAQEYVL PLPPEDSEET AHDEDPPGEG PSEPETAQPF APNGE
|
| |