Gene Tpau_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1037 
Symbol 
ID9155177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1063475 
End bp1064863 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content70% 
IMG OID 
ProductSterol 3-beta-glucosyltransferase 
Protein accessionYP_003646009 
Protein GI296138766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTGT GCATCCTCGC TATGGGCAGC CGAGGCGATG TCCAACCCAC GATCGCGATC 
GGGCTGGCGC TGCAGAGCCG CGGGCACGAT GTGACGATCG CCGCGATGGG TGATCCGCTG
GTGAAGCTGA TCCGCTCCGC GGGTATCGAC GCGCACCGGC TCAACGAGAT CGTCCCCGAC
TACGACGACG ACTACCGCGA GGTGATCCAC CGACCGGTGG AGCGGATGCG GGCCTACGGC
CGGTTCCTGG TGCGCAACAT CGCCACCATC TCGTACGAGA TCGAGACCGT GTGCCGCGAC
GCCGATGTGG TGCTGACCCA TTCGGACGCG GTCGATTTCG CGCTGCCCAT CACTCGCCGC
ACCGGCGCCT CGATCATCAG CTACCGATTC TTCCCTGGAA CCACCAACAG CGTGTACCCG
ATGACGCAGT ACACCCCGGC CGGGTTGACC AGCGATGTGC TGTCGCGGTC GCCGCGCATG
GTCAAGCGGG CCACCTGGGC GCTCGGCGAC TCGTTCACCT GGACGCACGT GCGCGCCGCC
GTCAACTTCC ACCGCATGTC GGTCGGCGAG GCGCCGTACC GCTCGCGGCG CGCGAAGAAC
CGCGACGCCC ACGAGATCGT CGATCTGCAG CTCTACGACC CCGCCCTCAC CCCCGATCTG
GTGCCCGAGT TCAGCCGCAC CCGGCCGATG CTCGGTTTCC TGGAGGTGCC CTCGGACGCG
TGGCTGCGCG AGGGCAAGCA GTCCCGCACC GACGCCGACC TGATGGACTG GATCAAGGCC
GGCGATGCGC CGATCTACTG GGGCTTCGGC AGCATGCGGA TCGCCGACCC CGACGGCAAG
GCCCGCATCT TCGCGCAGGT CTGCAAGGAG CGGGGCCGGC GCGGACTCAT CGTCTCCGGC
TGGAGCGATC TCACCAGTGA GGACCTCGGC GACCACATGC GGGTCGTCAA CGAGGTCGTG
CACTCGGAGG TACTCCCGCA CTGCGCCGCC GCCGTGCACC ACGGCGGCGC CGGCACCACC
GCCGCATCGC TGCGCGCCGG CCTGCCCACC CTGATCTGCC CGGTGCTCGC CGATCAGCCC
TTCTGGGGCG CGCGGGTCAC CGACCTCGGC GTCGGCGCCT GCCTCCCGAT GCGCAACGTC
ACCCCGGAAC GGTTGCACGC CGCCTTCGAC AAGCTGCTCG ATCCGGCCAC CCGCCGCCGC
GCGCAGCGCA CCTCGTCGTT GATCGACCTC GGTGACATCC CCGCCCGCCG GGCCGCGCTG
ATCATCGAGT CCATCGCCGA GGACGACGGG GTCGATGTCG CGGGCAGACT GGTCGAGACC
ATCGCGGCAC CCGCCACCGT CGACTTCACG GCCCCGGCAC CGGTGCCGCT CGCCCCGGCG
GAGGTGTGA
 
Protein sequence
MKVCILAMGS RGDVQPTIAI GLALQSRGHD VTIAAMGDPL VKLIRSAGID AHRLNEIVPD 
YDDDYREVIH RPVERMRAYG RFLVRNIATI SYEIETVCRD ADVVLTHSDA VDFALPITRR
TGASIISYRF FPGTTNSVYP MTQYTPAGLT SDVLSRSPRM VKRATWALGD SFTWTHVRAA
VNFHRMSVGE APYRSRRAKN RDAHEIVDLQ LYDPALTPDL VPEFSRTRPM LGFLEVPSDA
WLREGKQSRT DADLMDWIKA GDAPIYWGFG SMRIADPDGK ARIFAQVCKE RGRRGLIVSG
WSDLTSEDLG DHMRVVNEVV HSEVLPHCAA AVHHGGAGTT AASLRAGLPT LICPVLADQP
FWGARVTDLG VGACLPMRNV TPERLHAAFD KLLDPATRRR AQRTSSLIDL GDIPARRAAL
IIESIAEDDG VDVAGRLVET IAAPATVDFT APAPVPLAPA EV