Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3336 |
Symbol | |
ID | 9157510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3434995 |
End bp | 3436443 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003648259 |
Protein GI | 296141016 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.990519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCACACC TACAGGAGAG CACCGGCACG CGGGAGAACG TGATCCTCGT GCACTGGCAC GACCTGGGAC GCCACCTCAC CTGTTACGGC GCGGAGGGCG TGGTCAGCCC GCACCTCGAC CGCCTCGCCG CCGAGGGCAT CCGATTCACC GATGCGCACG CCACCGCGCC CCTGTGCTCG CCCGCGCGCG GCTCGCTGTT CACCGGTCTG CACCCTCACC GCAACGGTCT CGTCGGGCTC GCCCACCACG GTTTCGAATA CCGGGCGGGC GTGCAGACGT TGCCGGCGCT GCTCGGCGCC GCGGGCTATC GCACCGCGCT GTTCGGCATG CAGCACGAGA GCGCCGACCC CTCCCGTTTG GGTTTCGACA CCGCCGACGT CTCTGATTCG CTGTGCGGCT ACGTGGTCGC CCAGTCTCAG CAGTGGCTGA CCGATGCCGC CGCGCGCGAC GAGCCGTTCT TCCTGACCGC CGGATTCTTC GAGACCCACC GCCCCTACCC GGCCGATCAG TACGAACCCG CGGACCCCGA GGCGATCGGT GTGCCCGGCT TCCTCCCGGA CACCCCACAG GTCCGCGAAG ACCTGGCCGG CCTGCACGGC AGCATCACCG AGGCGGATGC GGCGGTGGGA CGGTTGCTCG ACACCGTCGA CGAGCTGGGG CTGGCCGAGA ACACGTGGAT CGTCTTCATC ACCGACCACG GGCTGGCCTT CCCCCGTGCC AAGTCCACGC TCTACGCCGA GGGCACCGGT GTGGCCCTCA TCGTGCGCCC GCCCCGCGGT CGTGACCTGC GCCCCCGCGT CTACGACGAC CTGTTCTCCG GCGTCGATCT CACCCCGACG GTGCTGGACC TGCTCGGCGT GCCCCTCCCG GACGATCTCG ACGGCGAGAC GCACGCCGCG GAACTGACCG AACCGGCGGG AGACACGGTG CGCGCCGGGC TGTTCACGCA GAAGACCTAT CACGACGCCT ACGATCCGAT CCGCGCCGTG CGCACCAAGC AGTTCAGCTA CATCGAGAAC TACGCCGACC GGCCCGCGCT GCTGTTGCCG CTCGATATCG CCGATAGTTC CTCTGCCGGC TCACTCGACC CCTACGAGGT CGCCGCTCCG CGTCCCCGGC GTGAGCTCTA CGACCTCGTC GTCGATCCGT ACGAGCGGCA CAACGTGATC GATGAGCCGG CGTATCAGTG GGTGGCGCGC AGGCTCGGGG CAGTGCTCGC CCGCTGGCGC GCGGAGACCG GTGACGTGCT CCCGACCGAA GCCGAGGGCA CGGCGATCGC CGAACGCTTC ATGGCCGAGT TCTTCGCAGG CCGCGCGAAG CCGGACGCCG ACGCGGTGCC GCTGCCCTCG CGCCGTCCGC AGGGCGCCCG TCGTGAACTC ACCGCCCGCG AACGGGCCGA GCAGGTCCGC GATGAAGCGG GTGACCGAGT CCGGGCGGTC AACCACTGA
|
Protein sequence | MAHLQESTGT RENVILVHWH DLGRHLTCYG AEGVVSPHLD RLAAEGIRFT DAHATAPLCS PARGSLFTGL HPHRNGLVGL AHHGFEYRAG VQTLPALLGA AGYRTALFGM QHESADPSRL GFDTADVSDS LCGYVVAQSQ QWLTDAAARD EPFFLTAGFF ETHRPYPADQ YEPADPEAIG VPGFLPDTPQ VREDLAGLHG SITEADAAVG RLLDTVDELG LAENTWIVFI TDHGLAFPRA KSTLYAEGTG VALIVRPPRG RDLRPRVYDD LFSGVDLTPT VLDLLGVPLP DDLDGETHAA ELTEPAGDTV RAGLFTQKTY HDAYDPIRAV RTKQFSYIEN YADRPALLLP LDIADSSSAG SLDPYEVAAP RPRRELYDLV VDPYERHNVI DEPAYQWVAR RLGAVLARWR AETGDVLPTE AEGTAIAERF MAEFFAGRAK PDADAVPLPS RRPQGARREL TARERAEQVR DEAGDRVRAV NH
|
| |