Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2981 |
Symbol | |
ID | 9157149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3092908 |
End bp | 3095208 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003647916 |
Protein GI | 296140673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATCC CCGACTACGC TCGCGGCTAC GAAGGATTCG CGGGACGGAT CGGGCGGACG GAGGCGGAGT CCGAACCGGC GTGGCCGGCC GAGCGCCGCG CTCGCCCCGG ATCTCCCAAC ATCATCGTCG TCCTCGTCGA CGATATGGGC TTCTCCGATA TCGGGCCGTA CGGCTCGGAG ATCCCCACAC CGCATCTGGA CGCCCTGGCC GCACGCGGAA TCCGATCCGT CAACCACCAC ACCACCCCGG TGTGTTCACC CGCCCGCGCA GCTCTGCTCA CCGGTATCAA TCCGCACCGG GCGGGCTACG CTTCGGTGGC CAATTCCGAT CCGGGCTACC CGAATCTGCG CCTGAGCCTG GCCGATGACG TGCTGACCCT GCCCGAGATC CTCCGTGAGG CCGGCTACGC CACCTACGCC GTGGGCAAAT GGCATCTCGC GAAGGACTCC CGGCTCGGCC CGGACGCGGA CCGCGGATCA TGGCCTTTGC AAAGGGGATT CGATCACTAC TACGGCTCCC TGGAGGGACT CAACTCGTTC TTCCACCCGA ACCAGCTGGT ACGCGACAAC ACCGCCGATC CGGTCACGGA GTACCCGGAT GACTTCTACG TGACCGACGC ACTCACCGAC ACCGCGACGA GCTGGCTCAA GGACCTGCGC GCGCACGATG CCGACAAACC CTTCTTCCTG TACTTCGCCC ACATCGCGAT GCACGGACCA CTGCAGGTCA AGGAATCCGA CCTCCTCCGA GGCGACGTGG ACTACGCCCG CGGCTGGGAC ATCGTGCGGG AGAGACGGTT CGCTCGGCAA CGCGAACTCG GGCTGTGGGG CGACGGGGTC GAACCCGCCG GTCGCAACCG TGAGCCCGGA TACGACGTGC CCGCCTGGGA GGAGCTCACC CCCGACCGGC AGCGCCGGTT CGCGAAGTAC ATGCAGGTGT ACGCCGCGAT GGTGCGCACC GTCGACGACA GCCTGGGCCG GCTGCTCGCC ACAGTCGACG AGCTGGGCGA ACTCGACGAC ACCATCGTGG TGTTCAGCTC CGATAACGGC GGCACCGCAG AGGGCGGTCC GGAGGGAACG CGCAGCTACT TCGCCGAATT CGCGAAGTTC GCCTCCGGCA CCGCACCCGA GGGCTGGGAG GGCGACGTCG ACCACCCCGA GGAACTCATC GGATCCGCCC GACTCGGCGT GCACTACCCA CGGGGCTGGG GACAAGTCTC GAACACCCCG TTCCGGTTCT ACAAAGGACA GACATTCGCC GGCGGCGTCC GCGTTCCGCT CGTGCTGTCC TGGCCGGGCG GACTGGACGC CCGTGGCGTG CGCGACCAGT ACTCCTTCGT CACCGACGTG GCACCGACGC TGCTGGACCT CGCCGGGGTC GGCACTCCGG GCACCCGCAA CGGCATCGCC GCGAAGCAAC GCGATGGACT CTCACAGCGA ACCGCCTGGT CGGACCCGAC AGCACCGTCG GCGCGATCAC ATCAGTACTC GGAGTTCCGG GGCCACCGCG GGTATTACCG CGACGGCTGG AAACTGCTGA GCCGCTTCGA CCCCGGTGAC GATCCGGTGA GCCCGCGATG GGAGCTGTAC GACAATCGCA CCGACCCGGC GGAGACTCGC GATCTGGCTG CCGAGCGTCC CGCCCTGGTG GCCGAGCTCG CCGCGGAGTG GGAGGAGCAG GCCTGGCGGA ACACCGTGTT CCCCATCGTG ATCGACGGCT CCGCGCGCAA CCCCGCCGAA CGCCGCCTCG CCGAGCCGGT GCGCCTGCTC CCGGGCACGC CGGTCCTGGA GCGGTACCGA TCGGCGAAAC TGGTGCAGTA CCGGGACTTC CGGGTGGAGA TCGATGCCGC TGTCGGGATC GCCGACGAAG GCGTGCTGGT GGCCCACGGC GACGCTTTGG GCGGATACCT GGTGTATGTC GAAGCAGGCG AGATCGTGGT CGGCTACAAC GCCTACGGCC GGTATCACGA GGCGAGGGCA CCGATCGAGC CAGGAGAGCA CCGGATCGAC CTGGCCGCAA CGGTTCGCCC GGGGCTGCGC TGGGACCTGC TACTCAGCAT CGACGGAGCC CCCGCCGCGC ACCTTCCGAA CCAGGTGCAA CTGATCGGCA TGGCGCCGTG GACGGGTATC TCGGTGGGAC TCGACGCCCG CGGCCCGGTG GCGTGGGACC TGCGCGAGCG GCGCGGCACC TTCCCCTATT CGGGCGTAGT GCGATCCGTG ACATACAGAC CGGGCCCGAT CGGGGTTCCC GACCGCGATA TCGAGCGGCT CGAACAGGAA GCCGAGGAGG AAGCGGACTG A
|
Protein sequence | MTIPDYARGY EGFAGRIGRT EAESEPAWPA ERRARPGSPN IIVVLVDDMG FSDIGPYGSE IPTPHLDALA ARGIRSVNHH TTPVCSPARA ALLTGINPHR AGYASVANSD PGYPNLRLSL ADDVLTLPEI LREAGYATYA VGKWHLAKDS RLGPDADRGS WPLQRGFDHY YGSLEGLNSF FHPNQLVRDN TADPVTEYPD DFYVTDALTD TATSWLKDLR AHDADKPFFL YFAHIAMHGP LQVKESDLLR GDVDYARGWD IVRERRFARQ RELGLWGDGV EPAGRNREPG YDVPAWEELT PDRQRRFAKY MQVYAAMVRT VDDSLGRLLA TVDELGELDD TIVVFSSDNG GTAEGGPEGT RSYFAEFAKF ASGTAPEGWE GDVDHPEELI GSARLGVHYP RGWGQVSNTP FRFYKGQTFA GGVRVPLVLS WPGGLDARGV RDQYSFVTDV APTLLDLAGV GTPGTRNGIA AKQRDGLSQR TAWSDPTAPS ARSHQYSEFR GHRGYYRDGW KLLSRFDPGD DPVSPRWELY DNRTDPAETR DLAAERPALV AELAAEWEEQ AWRNTVFPIV IDGSARNPAE RRLAEPVRLL PGTPVLERYR SAKLVQYRDF RVEIDAAVGI ADEGVLVAHG DALGGYLVYV EAGEIVVGYN AYGRYHEARA PIEPGEHRID LAATVRPGLR WDLLLSIDGA PAAHLPNQVQ LIGMAPWTGI SVGLDARGPV AWDLRERRGT FPYSGVVRSV TYRPGPIGVP DRDIERLEQE AEEEAD
|
| |