Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2752 |
Symbol | |
ID | 8604095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 3205137 |
End bp | 3206834 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003300336 |
Protein GI | 269126966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000166333 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTGACA CGACCACCGA TCTCTCCCGG CGCGCATCCG AAGAGGAGAC CGCCGAAAAG CCCGGCCGCC CCAGAAGGGC ACGGCGGATC GCGGGGCGGG TGGCCACCGC CGCGGCGTGC CTGGTCGTGG TGGCGGGCTT CACCATGCCC AACGACCTGG ACGGCCTGAC GCCGAGCGCC TTCGTGCGGC TGCCGCTGGA GATCGTCCTC GGCCTCGCCG TGGTGGCCGC CGTGCCCCGG GCGCGGCGTG CGGTGGCGGC GCTCCTGGGC GCGGCGCTCG GCCTGCTGGT CATCGTGAAG GTCATCGACA TGGGCTTTCA CGCGACGCTG GACCGGCCGT TCCACCCGGT GTTCGACTGG TCCCTGTTCG GGCCCGCGCT GGAGTACCTC GACCAGTCGG CCGGGCGGGC CACCGCGATC GGCGCCGCCG CCGGGGCGGT GGCGCTGGCG GTGGCCGTGC TCGCCGCCAC GACCCTGTCG CTGCTGCGGC TGAGCCGGCC GGTGGTGCGG CACCGCGCCG CGGCCGCCCG CGCCGCCGTG GCGCTGGGAG CCGTCTGGGT CGCCTGCGCC GTGCTCGGCG CCCGGCTCCC ACCGGGCGTC CCGGTCGCCG CCGTCGCCTT CGACCGTGCC CTGCAGATCC CCGCGGACCT GCGGGAACAG CGCAGGTTCG CCGCGCAGGC CGGCCGGGAC GCCTTCGCGC ACACCCCCGG CGAGCAGATG CTGACCGCGC TGCGCGGCAA GGACGTCGTC TTCGCGTTCA TCGAAAGCTA CGGCCGCGAC GCGCTGGAGA ACCCCACGTT CGCCGCGCGG GTCGGCGCGG TGCTGGAGGA CGGGAACCGC CGGCTGCGCA AGGCCGGGTT CGAGGCCCGC AGCGCCTTCC TCACCTCGCC GACGGTGGGC GGCAGCAGCT GGCTGGCCCA CGCCACGCTG CTGTCGGGGC TGTGGATCGA CAACCAAAAA CGCCACCGCG ACCTGGTCAC CAGCGACCGG CTGACCCTCA CCGGGGCCTT CCGGCGCGCG AAATGGCGGA CGGTGGCCGT CATGCCCGGC AACACCAAAC CCTGGCCGGA GGGGAACTTC TACGGATATG ACAAGGTTTA TCCCCGCGCG GCCCTGGGTT ATAAGGGTCC GCCTTTCAAC TGGGACACCC CGCCCGACCA GTACACATTG TCGTTCTTTG AACGCACGGA ACGGGCCAGG CGCGACCGCC CGCCGCTGAT GGCGGAGATC CCGCTGGTGT CCAGCCATTC GCCGTGGGCG CCCACTCCCC GCCTGGTGGG CTGGGACGAG GTGGGCGACG GTGAGGTCTT CGGCCCGATC GCGGCGGCCG GCCAGCGCTG GCAGGACGCC TGGCGCACTC CCGAGCGGAT GCGCGCCGCC TACCGGGGCG CCATCGAGTA CACGCTGGCC GCGCTCCTGT CCTACGTGGA GACCTACGGC GATGAGAACC TGGTGGTGAT CTTCGTCGGC GATCACCAGC CCGCCCCCGT CATCACCGGC CCCGACGCTA GCCGGGACGT GCCGATCGCG ATCGTCGCCC GCGACCGGGC CGTGCTGGAG CGGATCTCCG GGTGGGGCTG GCAGGAGGGC GTGAAACCGG GGCCGGACGC CCCGGTCTGG CGCATGGACG CCTTCCGTGA CCGTTTCCTG ACCGCCTTCG GGACACGTCC GCGGCCGGAT TCTCCGTCAT CGCCGTGA
|
Protein sequence | MTDTTTDLSR RASEEETAEK PGRPRRARRI AGRVATAAAC LVVVAGFTMP NDLDGLTPSA FVRLPLEIVL GLAVVAAVPR ARRAVAALLG AALGLLVIVK VIDMGFHATL DRPFHPVFDW SLFGPALEYL DQSAGRATAI GAAAGAVALA VAVLAATTLS LLRLSRPVVR HRAAAARAAV ALGAVWVACA VLGARLPPGV PVAAVAFDRA LQIPADLREQ RRFAAQAGRD AFAHTPGEQM LTALRGKDVV FAFIESYGRD ALENPTFAAR VGAVLEDGNR RLRKAGFEAR SAFLTSPTVG GSSWLAHATL LSGLWIDNQK RHRDLVTSDR LTLTGAFRRA KWRTVAVMPG NTKPWPEGNF YGYDKVYPRA ALGYKGPPFN WDTPPDQYTL SFFERTERAR RDRPPLMAEI PLVSSHSPWA PTPRLVGWDE VGDGEVFGPI AAAGQRWQDA WRTPERMRAA YRGAIEYTLA ALLSYVETYG DENLVVIFVG DHQPAPVITG PDASRDVPIA IVARDRAVLE RISGWGWQEG VKPGPDAPVW RMDAFRDRFL TAFGTRPRPD SPSSP
|
| |