Gene Tpau_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3336 
Symbol 
ID9157510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3434995 
End bp3436443 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content70% 
IMG OID 
Productsulfatase 
Protein accessionYP_003648259 
Protein GI296141016 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.990519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACACC TACAGGAGAG CACCGGCACG CGGGAGAACG TGATCCTCGT GCACTGGCAC 
GACCTGGGAC GCCACCTCAC CTGTTACGGC GCGGAGGGCG TGGTCAGCCC GCACCTCGAC
CGCCTCGCCG CCGAGGGCAT CCGATTCACC GATGCGCACG CCACCGCGCC CCTGTGCTCG
CCCGCGCGCG GCTCGCTGTT CACCGGTCTG CACCCTCACC GCAACGGTCT CGTCGGGCTC
GCCCACCACG GTTTCGAATA CCGGGCGGGC GTGCAGACGT TGCCGGCGCT GCTCGGCGCC
GCGGGCTATC GCACCGCGCT GTTCGGCATG CAGCACGAGA GCGCCGACCC CTCCCGTTTG
GGTTTCGACA CCGCCGACGT CTCTGATTCG CTGTGCGGCT ACGTGGTCGC CCAGTCTCAG
CAGTGGCTGA CCGATGCCGC CGCGCGCGAC GAGCCGTTCT TCCTGACCGC CGGATTCTTC
GAGACCCACC GCCCCTACCC GGCCGATCAG TACGAACCCG CGGACCCCGA GGCGATCGGT
GTGCCCGGCT TCCTCCCGGA CACCCCACAG GTCCGCGAAG ACCTGGCCGG CCTGCACGGC
AGCATCACCG AGGCGGATGC GGCGGTGGGA CGGTTGCTCG ACACCGTCGA CGAGCTGGGG
CTGGCCGAGA ACACGTGGAT CGTCTTCATC ACCGACCACG GGCTGGCCTT CCCCCGTGCC
AAGTCCACGC TCTACGCCGA GGGCACCGGT GTGGCCCTCA TCGTGCGCCC GCCCCGCGGT
CGTGACCTGC GCCCCCGCGT CTACGACGAC CTGTTCTCCG GCGTCGATCT CACCCCGACG
GTGCTGGACC TGCTCGGCGT GCCCCTCCCG GACGATCTCG ACGGCGAGAC GCACGCCGCG
GAACTGACCG AACCGGCGGG AGACACGGTG CGCGCCGGGC TGTTCACGCA GAAGACCTAT
CACGACGCCT ACGATCCGAT CCGCGCCGTG CGCACCAAGC AGTTCAGCTA CATCGAGAAC
TACGCCGACC GGCCCGCGCT GCTGTTGCCG CTCGATATCG CCGATAGTTC CTCTGCCGGC
TCACTCGACC CCTACGAGGT CGCCGCTCCG CGTCCCCGGC GTGAGCTCTA CGACCTCGTC
GTCGATCCGT ACGAGCGGCA CAACGTGATC GATGAGCCGG CGTATCAGTG GGTGGCGCGC
AGGCTCGGGG CAGTGCTCGC CCGCTGGCGC GCGGAGACCG GTGACGTGCT CCCGACCGAA
GCCGAGGGCA CGGCGATCGC CGAACGCTTC ATGGCCGAGT TCTTCGCAGG CCGCGCGAAG
CCGGACGCCG ACGCGGTGCC GCTGCCCTCG CGCCGTCCGC AGGGCGCCCG TCGTGAACTC
ACCGCCCGCG AACGGGCCGA GCAGGTCCGC GATGAAGCGG GTGACCGAGT CCGGGCGGTC
AACCACTGA
 
Protein sequence
MAHLQESTGT RENVILVHWH DLGRHLTCYG AEGVVSPHLD RLAAEGIRFT DAHATAPLCS 
PARGSLFTGL HPHRNGLVGL AHHGFEYRAG VQTLPALLGA AGYRTALFGM QHESADPSRL
GFDTADVSDS LCGYVVAQSQ QWLTDAAARD EPFFLTAGFF ETHRPYPADQ YEPADPEAIG
VPGFLPDTPQ VREDLAGLHG SITEADAAVG RLLDTVDELG LAENTWIVFI TDHGLAFPRA
KSTLYAEGTG VALIVRPPRG RDLRPRVYDD LFSGVDLTPT VLDLLGVPLP DDLDGETHAA
ELTEPAGDTV RAGLFTQKTY HDAYDPIRAV RTKQFSYIEN YADRPALLLP LDIADSSSAG
SLDPYEVAAP RPRRELYDLV VDPYERHNVI DEPAYQWVAR RLGAVLARWR AETGDVLPTE
AEGTAIAERF MAEFFAGRAK PDADAVPLPS RRPQGARREL TARERAEQVR DEAGDRVRAV
NH