Gene Tpau_2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2981 
Symbol 
ID9157149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3092908 
End bp3095208 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content68% 
IMG OID 
Productsulfatase 
Protein accessionYP_003647916 
Protein GI296140673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATCC CCGACTACGC TCGCGGCTAC GAAGGATTCG CGGGACGGAT CGGGCGGACG 
GAGGCGGAGT CCGAACCGGC GTGGCCGGCC GAGCGCCGCG CTCGCCCCGG ATCTCCCAAC
ATCATCGTCG TCCTCGTCGA CGATATGGGC TTCTCCGATA TCGGGCCGTA CGGCTCGGAG
ATCCCCACAC CGCATCTGGA CGCCCTGGCC GCACGCGGAA TCCGATCCGT CAACCACCAC
ACCACCCCGG TGTGTTCACC CGCCCGCGCA GCTCTGCTCA CCGGTATCAA TCCGCACCGG
GCGGGCTACG CTTCGGTGGC CAATTCCGAT CCGGGCTACC CGAATCTGCG CCTGAGCCTG
GCCGATGACG TGCTGACCCT GCCCGAGATC CTCCGTGAGG CCGGCTACGC CACCTACGCC
GTGGGCAAAT GGCATCTCGC GAAGGACTCC CGGCTCGGCC CGGACGCGGA CCGCGGATCA
TGGCCTTTGC AAAGGGGATT CGATCACTAC TACGGCTCCC TGGAGGGACT CAACTCGTTC
TTCCACCCGA ACCAGCTGGT ACGCGACAAC ACCGCCGATC CGGTCACGGA GTACCCGGAT
GACTTCTACG TGACCGACGC ACTCACCGAC ACCGCGACGA GCTGGCTCAA GGACCTGCGC
GCGCACGATG CCGACAAACC CTTCTTCCTG TACTTCGCCC ACATCGCGAT GCACGGACCA
CTGCAGGTCA AGGAATCCGA CCTCCTCCGA GGCGACGTGG ACTACGCCCG CGGCTGGGAC
ATCGTGCGGG AGAGACGGTT CGCTCGGCAA CGCGAACTCG GGCTGTGGGG CGACGGGGTC
GAACCCGCCG GTCGCAACCG TGAGCCCGGA TACGACGTGC CCGCCTGGGA GGAGCTCACC
CCCGACCGGC AGCGCCGGTT CGCGAAGTAC ATGCAGGTGT ACGCCGCGAT GGTGCGCACC
GTCGACGACA GCCTGGGCCG GCTGCTCGCC ACAGTCGACG AGCTGGGCGA ACTCGACGAC
ACCATCGTGG TGTTCAGCTC CGATAACGGC GGCACCGCAG AGGGCGGTCC GGAGGGAACG
CGCAGCTACT TCGCCGAATT CGCGAAGTTC GCCTCCGGCA CCGCACCCGA GGGCTGGGAG
GGCGACGTCG ACCACCCCGA GGAACTCATC GGATCCGCCC GACTCGGCGT GCACTACCCA
CGGGGCTGGG GACAAGTCTC GAACACCCCG TTCCGGTTCT ACAAAGGACA GACATTCGCC
GGCGGCGTCC GCGTTCCGCT CGTGCTGTCC TGGCCGGGCG GACTGGACGC CCGTGGCGTG
CGCGACCAGT ACTCCTTCGT CACCGACGTG GCACCGACGC TGCTGGACCT CGCCGGGGTC
GGCACTCCGG GCACCCGCAA CGGCATCGCC GCGAAGCAAC GCGATGGACT CTCACAGCGA
ACCGCCTGGT CGGACCCGAC AGCACCGTCG GCGCGATCAC ATCAGTACTC GGAGTTCCGG
GGCCACCGCG GGTATTACCG CGACGGCTGG AAACTGCTGA GCCGCTTCGA CCCCGGTGAC
GATCCGGTGA GCCCGCGATG GGAGCTGTAC GACAATCGCA CCGACCCGGC GGAGACTCGC
GATCTGGCTG CCGAGCGTCC CGCCCTGGTG GCCGAGCTCG CCGCGGAGTG GGAGGAGCAG
GCCTGGCGGA ACACCGTGTT CCCCATCGTG ATCGACGGCT CCGCGCGCAA CCCCGCCGAA
CGCCGCCTCG CCGAGCCGGT GCGCCTGCTC CCGGGCACGC CGGTCCTGGA GCGGTACCGA
TCGGCGAAAC TGGTGCAGTA CCGGGACTTC CGGGTGGAGA TCGATGCCGC TGTCGGGATC
GCCGACGAAG GCGTGCTGGT GGCCCACGGC GACGCTTTGG GCGGATACCT GGTGTATGTC
GAAGCAGGCG AGATCGTGGT CGGCTACAAC GCCTACGGCC GGTATCACGA GGCGAGGGCA
CCGATCGAGC CAGGAGAGCA CCGGATCGAC CTGGCCGCAA CGGTTCGCCC GGGGCTGCGC
TGGGACCTGC TACTCAGCAT CGACGGAGCC CCCGCCGCGC ACCTTCCGAA CCAGGTGCAA
CTGATCGGCA TGGCGCCGTG GACGGGTATC TCGGTGGGAC TCGACGCCCG CGGCCCGGTG
GCGTGGGACC TGCGCGAGCG GCGCGGCACC TTCCCCTATT CGGGCGTAGT GCGATCCGTG
ACATACAGAC CGGGCCCGAT CGGGGTTCCC GACCGCGATA TCGAGCGGCT CGAACAGGAA
GCCGAGGAGG AAGCGGACTG A
 
Protein sequence
MTIPDYARGY EGFAGRIGRT EAESEPAWPA ERRARPGSPN IIVVLVDDMG FSDIGPYGSE 
IPTPHLDALA ARGIRSVNHH TTPVCSPARA ALLTGINPHR AGYASVANSD PGYPNLRLSL
ADDVLTLPEI LREAGYATYA VGKWHLAKDS RLGPDADRGS WPLQRGFDHY YGSLEGLNSF
FHPNQLVRDN TADPVTEYPD DFYVTDALTD TATSWLKDLR AHDADKPFFL YFAHIAMHGP
LQVKESDLLR GDVDYARGWD IVRERRFARQ RELGLWGDGV EPAGRNREPG YDVPAWEELT
PDRQRRFAKY MQVYAAMVRT VDDSLGRLLA TVDELGELDD TIVVFSSDNG GTAEGGPEGT
RSYFAEFAKF ASGTAPEGWE GDVDHPEELI GSARLGVHYP RGWGQVSNTP FRFYKGQTFA
GGVRVPLVLS WPGGLDARGV RDQYSFVTDV APTLLDLAGV GTPGTRNGIA AKQRDGLSQR
TAWSDPTAPS ARSHQYSEFR GHRGYYRDGW KLLSRFDPGD DPVSPRWELY DNRTDPAETR
DLAAERPALV AELAAEWEEQ AWRNTVFPIV IDGSARNPAE RRLAEPVRLL PGTPVLERYR
SAKLVQYRDF RVEIDAAVGI ADEGVLVAHG DALGGYLVYV EAGEIVVGYN AYGRYHEARA
PIEPGEHRID LAATVRPGLR WDLLLSIDGA PAAHLPNQVQ LIGMAPWTGI SVGLDARGPV
AWDLRERRGT FPYSGVVRSV TYRPGPIGVP DRDIERLEQE AEEEAD