Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_0054 |
Symbol | |
ID | 8601346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 55918 |
End bp | 57303 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003297700 |
Protein GI | 269124330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 71 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCGGGA CGTTCCCGTT GACGCGGCGC AGGGTGCTGC TGTCCGGGGC GGTGGCCGCC ACGGTGGCGG GGATCTGCGG CGGGACGTCC CGTCCGGTGT CGGGCCGGGG GCGGCCCAAC GTGCTGCTGC TGGTCACCGA CGATCAGCCG TTGCACACCG AGTGGGCCAT GCCGGTTCTG CGCGACATGA TCAAACGGAG CGGTGTGCGT TTCACCCGCG CCTATGCCAC GACGCCGCTG TGCGGGCCGT CCCGTGCCTC GATCCTGTCG GGGCGGTACG CCCATAACCA CGGGGTGCTG CAGAACGGCC GTCCCGAGCG GCTGGATCAG AGCACCGTCC TGCCCCGCTA CCTGCGGGAG GCCGGGTACC GCACCGCGAT GTTCGGCAAG TACCTCAACG GCTGGGACGT CCACCAGGCT CCGCCGCACT TTGACGAGTA CGCCCTCATG CACCCGCCCA AGTACGGCGA GACCTGGTGG AACGTCAACG GGAAGGTGAG CAGGAAGCGC GCCTACAGCA CCTCCCTCAT CAGGGACCAC GCCGTGCGGT TCCTGCGGCG GCACCGCGCG AGCGGACGCC CGTGGTTCCT GTACCTGACT CCCTACGCCC CCCACGCGCC GTTCACCCCT GAGGCGCGGT ATGCGAATCT GAGCGTCCCT TCATGGCGCG GCAACCCCGC CGTCGCCGAG TCGGATAAGC GGGACAAGCC GTTCTATATT CGGCGGTCCG ACCCCGATCT TCACCGTGCC CGCCGCATCC GCGCCGGTCA GCTGCGCACC TTACGCTCCG TGGACGACCT GCTCGGCGCG GTGCGGGACG AGCTGCGCGC CCAGCGCCGG CTGGACGACA CGCTCATCAT CGTCATCAGC GACAACGGCT ACTGCTGGGG GGATCACGGC TGGCACGCCA AGAGCGTCCC CTACTCCCCC GCGGTCCGCA TCCCGCTGTA CCTGTCGTGG CCGGCCGGCG GGCTCGGCAG GGGCGCCACC GACGACCGGC TGGTGGCCAA CATCGACATC ATGCCCACGA TCTTGGACGC GGCGGGCATC GATCCCGGCG CCGCGAGACT GGACGGCCGG TCGCTGCTGC GTCCCGGGGA ACGCGACCGG CTGCTGTTGG AATGGTGGAA GAGGGGCCCG GGGCAGGCCG GGCATAGCTG GGCGGCCACG GTCACGCGGG ACTACCAGTA CATCGAGCAC TACGACACCA TCTTGCGCCG GGGCAGGCCG GTGGGGTCGG GAGCGGTGGT GCATCGCGAG TACTACGACT TGCGCAAGGA CCCGCACCAG CTCACCAACC TGCTGCACCG CACGGGGTCC GGCGTGGCGC GGCGGCTGGA CGTGGCGGGT CTGTCCGCCC GCCTGGCGGC CGACCGGAGG GCCTGA
|
Protein sequence | MSGTFPLTRR RVLLSGAVAA TVAGICGGTS RPVSGRGRPN VLLLVTDDQP LHTEWAMPVL RDMIKRSGVR FTRAYATTPL CGPSRASILS GRYAHNHGVL QNGRPERLDQ STVLPRYLRE AGYRTAMFGK YLNGWDVHQA PPHFDEYALM HPPKYGETWW NVNGKVSRKR AYSTSLIRDH AVRFLRRHRA SGRPWFLYLT PYAPHAPFTP EARYANLSVP SWRGNPAVAE SDKRDKPFYI RRSDPDLHRA RRIRAGQLRT LRSVDDLLGA VRDELRAQRR LDDTLIIVIS DNGYCWGDHG WHAKSVPYSP AVRIPLYLSW PAGGLGRGAT DDRLVANIDI MPTILDAAGI DPGAARLDGR SLLRPGERDR LLLEWWKRGP GQAGHSWAAT VTRDYQYIEH YDTILRRGRP VGSGAVVHRE YYDLRKDPHQ LTNLLHRTGS GVARRLDVAG LSARLAADRR A
|
| |