Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0997 |
Symbol | |
ID | 9155137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1019286 |
End bp | 1020374 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | galactokinase |
Protein accession | YP_003645969 |
Protein GI | 296138726 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00864415 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTGC GCGGCTATGC CCCCGGCCGG ATCAATCTGA TCGGTGAGCA CACGGACTAC AACGACGGCT ATGCGCTGCC GATCGCGCTC GGGGTGGGTG CCACGGCGCG GTTCGATCCG TCGATCTCGG ACAGCATCGC CGTTTCCTCG CGCGAGGAAG GGGCCGCGGC GGCCATCCCG CTGGACACCA GTCCCGGGAC GGGCGCGGTG CGAGGCTGGC CCGGGTACGT CGCAGGATGT GTCTGGGCAC TGCGCGAGCA CGGTGTTCGC GTGCCGGGAG GCGCAATGAC GATTGCTTCC GATGTTCCTG TGGGAGCCGG GCTCTCGTCT TCCGCTGCGA TCGAGTGCGC CGTGCTGGAG GCCTTGGTGG CGGCATCGGG GTCCGAGGCC CCTGACCGGA CCACTTTGGC CCGCATCGCG CAGCGCGCGG AGAACGAGTA CGTCGGTGCT CCCACGGGGC TGCTGGACCA GATGAGCAGC CTGTACGGCG AGCAGGACAC CGCCCTGCTC CTGGACTTCC GTTCGCTGGC CGTCGATCGT GTCCCCATGA ACCTGGGGAC CGCGGTCCTC CTCGCGATCG ATTCGCGGAC ACCGCACCAG CACGCCGGTG GTGAGTACGG TGCGCGCCGG CGATCGTGTG AGGCGGCAGC GGCAGAACTG GGGCTGTCCT CGCTTCGCGA TGCGTCCGAC GGTGCGTGGA CCCGGACCGA CGATGCCGTC ACGGCTCGCC GGGCACGGCA CGTCATCACC GAGAACGCGA GAGTCCTCGC AGCTGCGGAT GCCCTCGCAT CGGGTGACTT CACCCGGTTC GGCGAGTTGA TGGTCGAATC GCATCATTCG ATGCGAGATG ACTTTGAAAT CACCGTTCCG GCTATCGATT TCATCGCGGA TGAGGCCTGC CGGTTCGGCG CCTACGGCGC TCGTATGACG GGCGGCGGCT TCGGCGGCAC CGTGGTGGTG CTGGCACCTG CATCAGCAGC GGAGCGGATC GTGTCGGAGC TCCCCGAGGC GGTCCACTCG GCGGGACACC CGCGTCCCAG CATTGCGGCG GTCCGCCCCG GCGGCGGTGC GTTCGCTGAG AAGACCTAA
|
Protein sequence | MTVRGYAPGR INLIGEHTDY NDGYALPIAL GVGATARFDP SISDSIAVSS REEGAAAAIP LDTSPGTGAV RGWPGYVAGC VWALREHGVR VPGGAMTIAS DVPVGAGLSS SAAIECAVLE ALVAASGSEA PDRTTLARIA QRAENEYVGA PTGLLDQMSS LYGEQDTALL LDFRSLAVDR VPMNLGTAVL LAIDSRTPHQ HAGGEYGARR RSCEAAAAEL GLSSLRDASD GAWTRTDDAV TARRARHVIT ENARVLAAAD ALASGDFTRF GELMVESHHS MRDDFEITVP AIDFIADEAC RFGAYGARMT GGGFGGTVVV LAPASAAERI VSELPEAVHS AGHPRPSIAA VRPGGGAFAE KT
|
| |