Gene Tpau_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_0998 
Symbol 
ID9155138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1020511 
End bp1023549 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content66% 
IMG OID 
ProductBeta-galactosidase 
Protein accessionYP_003645970 
Protein GI296138727 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00145387 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTTC ACCTACGCGA TCTCGCTGAT CCCGAGCACT TCGCGGAGGG GACCGTTGCC 
CCCCATTCGG ACCATCGCTG GTTCCGGAAC CGAGACGAGG CGCTGGCGGG GATCAGTTCC
TTCGAACAGA GCCTCAACGG CATGTGGAAG TTCGACTATG CGCCGAATCC GCAGTCCGCG
CCCGAGGGCT TCGAGCGGCT CGACTGCGAT GTCGACGACT GGGCTGAGAT CGAGGTCCCC
GCGCACATCC AGTTGCAGGG GTACGACCGG CCGCAGTACG TCAACGTGCA ATACCCGTGG
GATGGGCGCG AGCAGCTCGA ACCGGGGCAG GTCCCCATGA GATTCAACCC GGTCGGGTCG
TACGTGCGGA CCTTCGAGCT CGACGCACCC CTCGGGCCGG GGGAGCGGCT CACGCTGCAC
TTCGCCGGCG TGGAGAGCGC GCTGGCGGTG TGGGTGAACG GGATCTACGT GGGCTACGCC
GAGGACTCGT TCACACCCTC GGAGTTCGAC ATCACCGACT ACCTGACGTC GGCCGAGAAC
CGGATCGCGT GTCGCGTCTT CAAGTGGTGC TCCGCCTCGT GGCTCGAGGA CCAGGACCTC
TTCCGCTTCT CGGGCATCTT CCGTGATGTG ACTCTGCACC GCCACCCGGC GACGCACATC
ACCGACCTGG TCGTGTCCAC CGATATCGCA GATGATTTCA GCACGGCCGA GGTCTCGGTC
GCCGTCACGC TTCGCGGAGC GGGAATGGTC CGCGGCGTCC TGACCGGAGT CGGCGACCTG
GTCTCGGCGG GTGCGGGCCG GCTCGCCGTC GCGGTGGACT CCCCGCAGCT GTGGAGCAGC
GAATCCCCTC ACCTCTACGA CCTCGTACTC GAGGTGTCCG ACGACCGCGG CGACGTGACC
GAGATCGTGC CGGTGAAAGT CGGCATCAGG CGCGTCGGCA TCGAAGACGG TGTGTTCAAG
GTCAACGGGC GACGCGTGGT GTTCAACGGC GTCAACCGGC ACGAGTTCGG CCTGAAGGGG
CGCGTGGTCA CCCGCGAGGA GACGGAGTCG GATCTCCGCT TCATGAAGGC GCACAACATC
AATGCGGTTC GCACGAGCCA CTACCCCAAC AACACGTTCT TCTACGAGCT GTGCGACATC
TACGGCGTGT ACGTGATCGA TGAGGTCAAC CTGGAGACGC ACGGGACGTG GGCGGATACC
CCGGTGCTGG CGACGCCGGA CACCGCACTG CCCGGCGACC GCCCCGAGTG GCTGGACAAC
GTCCGTGCCC GCGCCCGCAA TATGGTGGCC CGAGACCGCA ACCACTGCAG CATCGTGATG
TGGTCGTGCG GCAACGAATC CTCCGGGGGC CGGAATCTCC TCGAGGTCTC TCGGCTCCTC
AAAGCGGAGG ACACCCGTCC GGTCCACTAC GAGGGGATCT CCATGGACCC GCGGTATCCG
GAGACGAGCG ACGTCGTCAG CAGGATGTAC CTACCCGTGG ACGATGTCGA GGCCTATCTG
CTGGAGCATC GCGATAAGCC GTACATCCTC TGCGAGTACG CCCATGCCAT GGGCAACTCG
TTCGGAGCGG TCGACCGATA CGTCGACCTC GCGTATCGCG ACGAGCTCTT CCAGGGCGGA
TTCATCTGGG ACTTCGTCGA CCAAGCACTC CCCGCACGGA ACGCGGACGG CAGTGAGTAC
CTCGGTTACG GAGGAGACTT CGGTGACCGA CCGAATGACG CCGACTTCTC CGGGAACGGA
ATACTGTTCG CCGACAGGTC GCCCAAACCG TGCGCGGAAG AAGTCAAACG CCTGTATCAA
GGCTTCGTCT TCACGATCGG TCGGTCGTCG GTGGAGATCG AGAACCGCAT GATGTTCACG
AGTTCGGCGG ACTTCCGCTG CGTCGCGCAG CTCTCCTACG GGGGGACCAT CGTCGAAGAG
GCGGAGATCG ACACGCGTGT GGACGCCGGC TCGGTCGGCG CGTACTCGCT GCCGTTCGTC
GTCGACACCG CGCAGCTCGA TGCCGCGGTC GACGTCTCGC TCCGGTTGCG AACGGCCACC
GACTGGGCCG GTGCGGATCA TGTGGTGGCC GCTGACCAGC GAGTGTTCCC GAATCGCCGT
CGTGTGCCCG ACGGTCGGCC GCCGCAGGGA AGCCTCGAAC TCATCGAGGG ACGTCACAAC
ATCGGTGTCC GGGGCGAGGG CTTCGACGTC CTGTTCTCGG TGCTGCACGG AGGCCTGGTT
TCGTACCGCG TGGGCGAGGG TGATACCTAT CGCGAGCTGC TCGATTCCAT GCCTCTGCCG
AACTTCTGGC ACGCGCCCAC GTCCAACGAG CGAGGCTGGA AAATGCCTGC GCGGGACGGT
ATGTGGCTCG TGGCGAGTCG GTACCCGCGC CCCGACGCCG GAGCGGGGCG AACGTCAGTG
GAGAGGGCCG ACGACGGCGC GGTCATGGTC CGCTGCCGCT ACATCCTGCC GACCTCTCCG
GAGAGCACGT GCTCGGTGGA GTACACAGTG ACTCCCGACG GGAGGGTAGC GGTGCAAGTG
GACGTCGACC CCGCTCCGGG CCTTCCCGAT ATGCCGGAGT TCGGTATGTC GTTGGCGCTT
CCGGCGCCGT ATCACCGCTT GACCTGGTTC GGAGACGGAC CCCACGAGTG CTACGTCGAT
CGTCGCGCCG CCGCGCGTCT GGGAATCCAT TCGATCGACA CCCGCGAAGC ACTGACCCCC
TACATCCGCC CGCAGGAAGC GGGGAACCGG ACAGGAGTCA GATGGGCCGA GGTGACCGAC
GAGCACGGGT ACGGAATGCG TCTCGAGGGG CGCGAGAGCA TGGAGCTCGC GGTCACGCCG
TGGACGCCGT ACGAGGTGGA GAATGCCCGT CACCCCGAAG ACCTCCCGCC GATCCGCCGC
ACGATCCTTC GTCCGGCACT GATGCGCCGG GGAGTGGGCG GTGACGATTC GTGGGGATCG
CTGCCCCATC CGGAGTACCG CCTGCCCGCG GGGCAGCGGA TGCGATTCGC GTTCGACTTC
CTCGGTATCG CCCCAGAGGG CCGCGGGACC TCCGGTTAG
 
Protein sequence
MTFHLRDLAD PEHFAEGTVA PHSDHRWFRN RDEALAGISS FEQSLNGMWK FDYAPNPQSA 
PEGFERLDCD VDDWAEIEVP AHIQLQGYDR PQYVNVQYPW DGREQLEPGQ VPMRFNPVGS
YVRTFELDAP LGPGERLTLH FAGVESALAV WVNGIYVGYA EDSFTPSEFD ITDYLTSAEN
RIACRVFKWC SASWLEDQDL FRFSGIFRDV TLHRHPATHI TDLVVSTDIA DDFSTAEVSV
AVTLRGAGMV RGVLTGVGDL VSAGAGRLAV AVDSPQLWSS ESPHLYDLVL EVSDDRGDVT
EIVPVKVGIR RVGIEDGVFK VNGRRVVFNG VNRHEFGLKG RVVTREETES DLRFMKAHNI
NAVRTSHYPN NTFFYELCDI YGVYVIDEVN LETHGTWADT PVLATPDTAL PGDRPEWLDN
VRARARNMVA RDRNHCSIVM WSCGNESSGG RNLLEVSRLL KAEDTRPVHY EGISMDPRYP
ETSDVVSRMY LPVDDVEAYL LEHRDKPYIL CEYAHAMGNS FGAVDRYVDL AYRDELFQGG
FIWDFVDQAL PARNADGSEY LGYGGDFGDR PNDADFSGNG ILFADRSPKP CAEEVKRLYQ
GFVFTIGRSS VEIENRMMFT SSADFRCVAQ LSYGGTIVEE AEIDTRVDAG SVGAYSLPFV
VDTAQLDAAV DVSLRLRTAT DWAGADHVVA ADQRVFPNRR RVPDGRPPQG SLELIEGRHN
IGVRGEGFDV LFSVLHGGLV SYRVGEGDTY RELLDSMPLP NFWHAPTSNE RGWKMPARDG
MWLVASRYPR PDAGAGRTSV ERADDGAVMV RCRYILPTSP ESTCSVEYTV TPDGRVAVQV
DVDPAPGLPD MPEFGMSLAL PAPYHRLTWF GDGPHECYVD RRAAARLGIH SIDTREALTP
YIRPQEAGNR TGVRWAEVTD EHGYGMRLEG RESMELAVTP WTPYEVENAR HPEDLPPIRR
TILRPALMRR GVGGDDSWGS LPHPEYRLPA GQRMRFAFDF LGIAPEGRGT SG