Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3730 |
Symbol | |
ID | 8605083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 4279277 |
End bp | 4282183 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function UPF0182 |
Protein accession | YP_003301301 |
Protein GI | 269127931 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.155339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCTTCC GGACTCCCGG ATTCAGCCGG CGGCTGGGTA CCGGCCGGAC TCGATTGCTG TTGCCCATAT TGGCGACATT GGCCGCGCTG GTCGTGGCCT ATCTGGTGTT CACCACCGTC TGGACCGACC TGCTGTGGTA CCGGTCGGTC GGCTTCTCCT CCGTCTTCAC CACACAGCTG TGGGCCAGGG TCGGCCTGTT CGTGGGGTCG GGCCTGCTGC TGGCACTGAT CGTCGGTGCC AACATGGTGA TCGCCTACCG GCTGCGGCCG TCTTACCGGC CGCTGTCGGT GGAGCAGCAG GGGCTGGAGC GCTACCGGGC CGCGGTGGAC CCGCACCGCC GCCTGATCGG TTTCGGCATC GTGACCATGC TGGCGCTGCT GACCGGCTCG TCGATGGCCG GCCAGTGGCC GGTGTGGCTG GCGTTCGTGA ACCGCACGCA GTTCGGCGTC AAGGACCCGC AGTTCGGCAA GGACGTCTCG TTCTACGTCT TCACCTACCC GTTCGCGCGG CTGGTGCTGG GCTTTTTGTT CGCCGCGGTG ATCCTGTCGC TGCTGATGGC GCTGATGGTC CACTACCTGT ACGGGGGGCT GCGGCTGCAG GGGCCCGGCG ACAAGGCCAG CCCGCCGGCC AAGGCCCACC TGTCGGTGCT GGTGGGCCTG TTCGTGCTGC TCAAGGCGGC GGCGTACTGG TTCGACCGGT ACGGGCTGGC CAACTCCGAA CGCGGCGTGG TCAGCGGCCC CGGCTACACC GACCTGAACG CGGTGCTGCC GGCCAAGACG ATCCTGGCGG TCATCGCGGT GATCTGCGCC GCGCTGTTCT TCGTCAACAT CTGGCGGCGC GGGATGATGC TGCCGGGGGT CGGGCTGTCG CTGATGGTGG TGGCGGCGAT CCTGCTCGGC GGCGTCTACC CGCTGCTGAT CCAGCAGTTC CAGGTCAAGC CGGACGAGCT GGCCAAGGAA CGGCAGTACA TCCAGCGCAA CATCGACGCC ACCCGCCGCG CCTACGGGGT GGACAAGGCG GAGGTGATCC CCTACGGCGG GCAGCCCGAA AGCGACCCGG CCAAGCTGTC CTCCGAGGCG CGGGCGCTGA CCGGGGTGCG GCTGCTGGAC CCGAACGTGG TCGGCGAGAC CTTCCAGCAG CTGCAGCAGG GCCGTAACTT CTACCGCTTC CCCGACACGC TGGATGTGGA CCGCTACCAG ATCGACGGCA AGACCAGGGA CGTGGTGGTG GCCGTCCGCG AGCTGTCGGG CGCGCCGGCC GGGCAGCAGA GCTGGGTGAA GGACCGGCTG GTCTACACCC ACGGCTACGG GTTCGTCTCC GGCTACGGCG AGCAGCTGGC CGGCAACGGC ACCCCGCAGT GGGTGACCAA GGACATGCCG CCCACCGGCG AGCTGAAGAT CGACAAGCCG CAGATCTACT TCGGGGAGCT GTCCAACAGC TACTCGATCG TCGGCGGCAA GGGCCAGCAG GAGCTGGACT ACCCCGACGA CAGCCCCGCC GGGCAGAAGA ACACCACCTA CACCGGCAAG GGCGGCGTGC CGGTGGACTC GCTGTTCAAC CGCCTGCTGT TCGCCACCAA GTTCTCCGAC CGCAACATCC TGCTGTCGGG GGCGATCAAC GAGGGCGCCA AGATCCTCTA CCACCGGACG CCGCGGGAGA TGGTGCAGCG GGTGGCGCCC TGGCTGACGC TGGACGGCAA CCCCTACCCG GCGGTGGTGA ACGGCCGGAT CGTGTGGATC CTGGACGGCT ACACCACCTC CAACGGCTAC CCGTACGCCG AGCGGATGAG CCTGGGCGAC GCCACCCGCG ACACCGTCAC CGACACCCGT TCGGCGGTGG CCCGCCAGGC CAACGACCAC ATCAACTACA TCCGCAACTC GATCAAGGCG ACGGTGGACG CCTACGACGG GACGGTCCGG CTGTACCTGT GGGACGAGAA CGACCCGGTG GCCAAGACCT GGATGAAGGT CTTCGACGGC ACCGTGCTGC CCAAGAGCGC CATCTCCCCG GAGCTGATGC AGCACTTCCG CTACCCGCAG GACCTGTTCA AGGTGCAGCG GCAGGTGCTG GCGCGCTACC ACGTGACCCA GGCCGACGCC TTCTACGGCG CGCAGGGCTT CTGGCAGGTG CCGCAGGACC CGACCTCGCC GGGCCGGGCC CAGCCGCCGT ACTACCTGAG CCTGAAGCTG CCCGGTGACC AGAGCGCGCA GTTCTCGCTG ACCACGGTGT TCAACCCGCG CGGCCGTCCC AACCTGGCGG CGTTCATGGC GGTGGACTCC ACGCCCGGCC CCAACTACGG GCGGATCCGG ATCCTGGAGC TGCCGCGCAA CTCGCTGATC CAGGGCCCCG GCCAGATCCA GAACTCCTTC GAGGCCGACA CCGCGGTCAA GGACGTGCTG TTCAAGCTGC GCCAGGGCGG CACCCGGACG GTGCCCGGCA ACCTGCTGAC GCTGCCGTTC GGCGGCGGGC TGCTGTATGT GGAGCCGATG TACGCCCAGG CGGCCGGCGG CTCGGAGCAG GAGCCCTACC CGGTGCTGCG GCAGGTGCTG GTGGCCTTCG GTGACAAGGT CGCCGCCGGC GACACCCTGG ACGCGGCGCT GGAGCAGCTG TTCAAGGGCG GCGGCGCGGC CCCGCCGGCG CAGCAGGATG ACGCCGAGCC GCCTTCCGGC GACGACCTGA ACGCCGACGC CCGCCAGGCC CTGGCGGACG CCCAGCGGTA CTTCCAGGAG GGCCAGGATG CGCTGAGCAA GTCGCCGCCG GACTGGGCGG CCTACGGCGA GGCGCAGCGG AAGCTGCAGG ACGCGCTGAA CCGCCTGGCC GAGGCCCAGC GGGCATCGGC CCAGCAGTCC CAGTCCTCCC CCAGCCCGTC CCCGTCCGGC TCGGCGAGCC CGTCCCCCGG GGAGTGA
|
Protein sequence | MTFRTPGFSR RLGTGRTRLL LPILATLAAL VVAYLVFTTV WTDLLWYRSV GFSSVFTTQL WARVGLFVGS GLLLALIVGA NMVIAYRLRP SYRPLSVEQQ GLERYRAAVD PHRRLIGFGI VTMLALLTGS SMAGQWPVWL AFVNRTQFGV KDPQFGKDVS FYVFTYPFAR LVLGFLFAAV ILSLLMALMV HYLYGGLRLQ GPGDKASPPA KAHLSVLVGL FVLLKAAAYW FDRYGLANSE RGVVSGPGYT DLNAVLPAKT ILAVIAVICA ALFFVNIWRR GMMLPGVGLS LMVVAAILLG GVYPLLIQQF QVKPDELAKE RQYIQRNIDA TRRAYGVDKA EVIPYGGQPE SDPAKLSSEA RALTGVRLLD PNVVGETFQQ LQQGRNFYRF PDTLDVDRYQ IDGKTRDVVV AVRELSGAPA GQQSWVKDRL VYTHGYGFVS GYGEQLAGNG TPQWVTKDMP PTGELKIDKP QIYFGELSNS YSIVGGKGQQ ELDYPDDSPA GQKNTTYTGK GGVPVDSLFN RLLFATKFSD RNILLSGAIN EGAKILYHRT PREMVQRVAP WLTLDGNPYP AVVNGRIVWI LDGYTTSNGY PYAERMSLGD ATRDTVTDTR SAVARQANDH INYIRNSIKA TVDAYDGTVR LYLWDENDPV AKTWMKVFDG TVLPKSAISP ELMQHFRYPQ DLFKVQRQVL ARYHVTQADA FYGAQGFWQV PQDPTSPGRA QPPYYLSLKL PGDQSAQFSL TTVFNPRGRP NLAAFMAVDS TPGPNYGRIR ILELPRNSLI QGPGQIQNSF EADTAVKDVL FKLRQGGTRT VPGNLLTLPF GGGLLYVEPM YAQAAGGSEQ EPYPVLRQVL VAFGDKVAAG DTLDAALEQL FKGGGAAPPA QQDDAEPPSG DDLNADARQA LADAQRYFQE GQDALSKSPP DWAAYGEAQR KLQDALNRLA EAQRASAQQS QSSPSPSPSG SASPSPGE
|
| |