Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0998 |
Symbol | |
ID | 9155138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1020511 |
End bp | 1023549 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Beta-galactosidase |
Protein accession | YP_003645970 |
Protein GI | 296138727 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00145387 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTTC ACCTACGCGA TCTCGCTGAT CCCGAGCACT TCGCGGAGGG GACCGTTGCC CCCCATTCGG ACCATCGCTG GTTCCGGAAC CGAGACGAGG CGCTGGCGGG GATCAGTTCC TTCGAACAGA GCCTCAACGG CATGTGGAAG TTCGACTATG CGCCGAATCC GCAGTCCGCG CCCGAGGGCT TCGAGCGGCT CGACTGCGAT GTCGACGACT GGGCTGAGAT CGAGGTCCCC GCGCACATCC AGTTGCAGGG GTACGACCGG CCGCAGTACG TCAACGTGCA ATACCCGTGG GATGGGCGCG AGCAGCTCGA ACCGGGGCAG GTCCCCATGA GATTCAACCC GGTCGGGTCG TACGTGCGGA CCTTCGAGCT CGACGCACCC CTCGGGCCGG GGGAGCGGCT CACGCTGCAC TTCGCCGGCG TGGAGAGCGC GCTGGCGGTG TGGGTGAACG GGATCTACGT GGGCTACGCC GAGGACTCGT TCACACCCTC GGAGTTCGAC ATCACCGACT ACCTGACGTC GGCCGAGAAC CGGATCGCGT GTCGCGTCTT CAAGTGGTGC TCCGCCTCGT GGCTCGAGGA CCAGGACCTC TTCCGCTTCT CGGGCATCTT CCGTGATGTG ACTCTGCACC GCCACCCGGC GACGCACATC ACCGACCTGG TCGTGTCCAC CGATATCGCA GATGATTTCA GCACGGCCGA GGTCTCGGTC GCCGTCACGC TTCGCGGAGC GGGAATGGTC CGCGGCGTCC TGACCGGAGT CGGCGACCTG GTCTCGGCGG GTGCGGGCCG GCTCGCCGTC GCGGTGGACT CCCCGCAGCT GTGGAGCAGC GAATCCCCTC ACCTCTACGA CCTCGTACTC GAGGTGTCCG ACGACCGCGG CGACGTGACC GAGATCGTGC CGGTGAAAGT CGGCATCAGG CGCGTCGGCA TCGAAGACGG TGTGTTCAAG GTCAACGGGC GACGCGTGGT GTTCAACGGC GTCAACCGGC ACGAGTTCGG CCTGAAGGGG CGCGTGGTCA CCCGCGAGGA GACGGAGTCG GATCTCCGCT TCATGAAGGC GCACAACATC AATGCGGTTC GCACGAGCCA CTACCCCAAC AACACGTTCT TCTACGAGCT GTGCGACATC TACGGCGTGT ACGTGATCGA TGAGGTCAAC CTGGAGACGC ACGGGACGTG GGCGGATACC CCGGTGCTGG CGACGCCGGA CACCGCACTG CCCGGCGACC GCCCCGAGTG GCTGGACAAC GTCCGTGCCC GCGCCCGCAA TATGGTGGCC CGAGACCGCA ACCACTGCAG CATCGTGATG TGGTCGTGCG GCAACGAATC CTCCGGGGGC CGGAATCTCC TCGAGGTCTC TCGGCTCCTC AAAGCGGAGG ACACCCGTCC GGTCCACTAC GAGGGGATCT CCATGGACCC GCGGTATCCG GAGACGAGCG ACGTCGTCAG CAGGATGTAC CTACCCGTGG ACGATGTCGA GGCCTATCTG CTGGAGCATC GCGATAAGCC GTACATCCTC TGCGAGTACG CCCATGCCAT GGGCAACTCG TTCGGAGCGG TCGACCGATA CGTCGACCTC GCGTATCGCG ACGAGCTCTT CCAGGGCGGA TTCATCTGGG ACTTCGTCGA CCAAGCACTC CCCGCACGGA ACGCGGACGG CAGTGAGTAC CTCGGTTACG GAGGAGACTT CGGTGACCGA CCGAATGACG CCGACTTCTC CGGGAACGGA ATACTGTTCG CCGACAGGTC GCCCAAACCG TGCGCGGAAG AAGTCAAACG CCTGTATCAA GGCTTCGTCT TCACGATCGG TCGGTCGTCG GTGGAGATCG AGAACCGCAT GATGTTCACG AGTTCGGCGG ACTTCCGCTG CGTCGCGCAG CTCTCCTACG GGGGGACCAT CGTCGAAGAG GCGGAGATCG ACACGCGTGT GGACGCCGGC TCGGTCGGCG CGTACTCGCT GCCGTTCGTC GTCGACACCG CGCAGCTCGA TGCCGCGGTC GACGTCTCGC TCCGGTTGCG AACGGCCACC GACTGGGCCG GTGCGGATCA TGTGGTGGCC GCTGACCAGC GAGTGTTCCC GAATCGCCGT CGTGTGCCCG ACGGTCGGCC GCCGCAGGGA AGCCTCGAAC TCATCGAGGG ACGTCACAAC ATCGGTGTCC GGGGCGAGGG CTTCGACGTC CTGTTCTCGG TGCTGCACGG AGGCCTGGTT TCGTACCGCG TGGGCGAGGG TGATACCTAT CGCGAGCTGC TCGATTCCAT GCCTCTGCCG AACTTCTGGC ACGCGCCCAC GTCCAACGAG CGAGGCTGGA AAATGCCTGC GCGGGACGGT ATGTGGCTCG TGGCGAGTCG GTACCCGCGC CCCGACGCCG GAGCGGGGCG AACGTCAGTG GAGAGGGCCG ACGACGGCGC GGTCATGGTC CGCTGCCGCT ACATCCTGCC GACCTCTCCG GAGAGCACGT GCTCGGTGGA GTACACAGTG ACTCCCGACG GGAGGGTAGC GGTGCAAGTG GACGTCGACC CCGCTCCGGG CCTTCCCGAT ATGCCGGAGT TCGGTATGTC GTTGGCGCTT CCGGCGCCGT ATCACCGCTT GACCTGGTTC GGAGACGGAC CCCACGAGTG CTACGTCGAT CGTCGCGCCG CCGCGCGTCT GGGAATCCAT TCGATCGACA CCCGCGAAGC ACTGACCCCC TACATCCGCC CGCAGGAAGC GGGGAACCGG ACAGGAGTCA GATGGGCCGA GGTGACCGAC GAGCACGGGT ACGGAATGCG TCTCGAGGGG CGCGAGAGCA TGGAGCTCGC GGTCACGCCG TGGACGCCGT ACGAGGTGGA GAATGCCCGT CACCCCGAAG ACCTCCCGCC GATCCGCCGC ACGATCCTTC GTCCGGCACT GATGCGCCGG GGAGTGGGCG GTGACGATTC GTGGGGATCG CTGCCCCATC CGGAGTACCG CCTGCCCGCG GGGCAGCGGA TGCGATTCGC GTTCGACTTC CTCGGTATCG CCCCAGAGGG CCGCGGGACC TCCGGTTAG
|
Protein sequence | MTFHLRDLAD PEHFAEGTVA PHSDHRWFRN RDEALAGISS FEQSLNGMWK FDYAPNPQSA PEGFERLDCD VDDWAEIEVP AHIQLQGYDR PQYVNVQYPW DGREQLEPGQ VPMRFNPVGS YVRTFELDAP LGPGERLTLH FAGVESALAV WVNGIYVGYA EDSFTPSEFD ITDYLTSAEN RIACRVFKWC SASWLEDQDL FRFSGIFRDV TLHRHPATHI TDLVVSTDIA DDFSTAEVSV AVTLRGAGMV RGVLTGVGDL VSAGAGRLAV AVDSPQLWSS ESPHLYDLVL EVSDDRGDVT EIVPVKVGIR RVGIEDGVFK VNGRRVVFNG VNRHEFGLKG RVVTREETES DLRFMKAHNI NAVRTSHYPN NTFFYELCDI YGVYVIDEVN LETHGTWADT PVLATPDTAL PGDRPEWLDN VRARARNMVA RDRNHCSIVM WSCGNESSGG RNLLEVSRLL KAEDTRPVHY EGISMDPRYP ETSDVVSRMY LPVDDVEAYL LEHRDKPYIL CEYAHAMGNS FGAVDRYVDL AYRDELFQGG FIWDFVDQAL PARNADGSEY LGYGGDFGDR PNDADFSGNG ILFADRSPKP CAEEVKRLYQ GFVFTIGRSS VEIENRMMFT SSADFRCVAQ LSYGGTIVEE AEIDTRVDAG SVGAYSLPFV VDTAQLDAAV DVSLRLRTAT DWAGADHVVA ADQRVFPNRR RVPDGRPPQG SLELIEGRHN IGVRGEGFDV LFSVLHGGLV SYRVGEGDTY RELLDSMPLP NFWHAPTSNE RGWKMPARDG MWLVASRYPR PDAGAGRTSV ERADDGAVMV RCRYILPTSP ESTCSVEYTV TPDGRVAVQV DVDPAPGLPD MPEFGMSLAL PAPYHRLTWF GDGPHECYVD RRAAARLGIH SIDTREALTP YIRPQEAGNR TGVRWAEVTD EHGYGMRLEG RESMELAVTP WTPYEVENAR HPEDLPPIRR TILRPALMRR GVGGDDSWGS LPHPEYRLPA GQRMRFAFDF LGIAPEGRGT SG
|
| |