Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4187 |
Symbol | |
ID | 9158375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4312008 |
End bp | 4313498 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003649095 |
Protein GI | 296141852 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCAG CTCCCGAGAC CTCGACCGGA GAGCTCATCT CCACCAACCC CCGTACCGGC GCCGAGGTGG CCCGCTTCGC CATCGCCGAT GCCGCAGCGG TGGACGCCGC CGTCGCCACC GCGCACACCG CTGCGCAGTG GTGGGGCGGC CTGGAGCCGA AAGCCCGCCG CAGTTGGCTG CTCCGCTTCC GCGCCGAGCT CTCGCGTCGC GCTGAGGACC TCGCCGCGGT GGTCGCCGCC GAGACCGGCA AGCCCGTCGA CGACGCGCTC CTCGAGGTGA TGCTCGCGGT GGTGCACCTC GATTGGGCCG CCAAGAACGC CGAGAAGGTG CTGGGCCGGC GCAGCGTCGG CACCGGCATG CTGGGCGCGA ACCTCGCCGC CACCGTCGAG TACCGCCCGT TCGGTGTGGT CGGGGTGATC GGCCCGTGGA ATTATCCCGT CTACACCCCG ATGGGGTCGA TCTCGTATGC CCTCGCCGCC GGTAACGCCA TCGTCTTCAA GCCCAGCGAA CTGACCCCCG CCGTGGGCCA GTTCCTGGCC GATACCTGGG CCGCCGCCTG CCCCGGCCAG CCCGTGCTGC AGGCCATCCA CGGCGCCGGC GAGACCGGCG CCGCGCTGTG CCGCTCCGCC GTGGACAAGC TGGCGTTCAC CGGTTCTGCC GCGACCGCCC GCCGGGTCAT GGCCACCTGC GCGGAGAACC TCACCCCCGT CGCCATCGAG GGCGGCGGCA AGGACGCCTT CATCGTGGAC TCCGATGCGA ACATCGATAG CGCGGTCGAT GCCGCGGTCT TCGGCGCCTT CGGCAACGCC GGCCAGACCT GCGCCGGCGT CGAGCGCGTC TACGTGGTCG GCGACAAGTA CGACGAATTC GTCGACAAAC TCGCCGCGAA ATCTCGTGAG ATCCACGGAG GGTCCGAGGA TTCCGCCGAC TACGGCCCGG CCACCATGCA CAAGCAACTC ACGGTGATCG CCAGCCACAT CGATGACGCC CTCAACCGCG GCGGCCGCGC CATCGTCGGC GGCCGGGAAT CCGTGGGCGA GAACACCGTC CAGCCGGTCG TCCTGGTCGA CGTGCCGGAG GACTCCACGG CCGTCACCGA GGAGACCTTC GGGCCCACCG TGGTGGTGAA CCGCGTCAAG GACATCGACG AGGCGATCGA CCGCGCCAAC AACAGCACCT ACGGACTGTC CGCCGCGATC ATGACCAAGG ACCTGAACAG GGGCCGAGAG CTGGCGCGCA AGCTGCGCAC CGGTGCGGTG GCCGTCAATT CCTTCCTCTC CTTCGCCTCG GTACCCGCAC TGCCTTTCGG CGGCATCGGC GACTCCGGCT TCGGCCGCAT CCACGGCGCC GACGGCCTGC GCGAGTTCAG TCGCCCGCAG TCCGTTGCGG CGCAGAAGTT CGCGCTGCCG ATGAACCTGC TCACCTTCAA TCGCAAGGCG CGCGATATGA AGACCGTGCG GATGATGCTC AGCAAGGTCT ACTCGCGGTG A
|
Protein sequence | MTAAPETSTG ELISTNPRTG AEVARFAIAD AAAVDAAVAT AHTAAQWWGG LEPKARRSWL LRFRAELSRR AEDLAAVVAA ETGKPVDDAL LEVMLAVVHL DWAAKNAEKV LGRRSVGTGM LGANLAATVE YRPFGVVGVI GPWNYPVYTP MGSISYALAA GNAIVFKPSE LTPAVGQFLA DTWAAACPGQ PVLQAIHGAG ETGAALCRSA VDKLAFTGSA ATARRVMATC AENLTPVAIE GGGKDAFIVD SDANIDSAVD AAVFGAFGNA GQTCAGVERV YVVGDKYDEF VDKLAAKSRE IHGGSEDSAD YGPATMHKQL TVIASHIDDA LNRGGRAIVG GRESVGENTV QPVVLVDVPE DSTAVTEETF GPTVVVNRVK DIDEAIDRAN NSTYGLSAAI MTKDLNRGRE LARKLRTGAV AVNSFLSFAS VPALPFGGIG DSGFGRIHGA DGLREFSRPQ SVAAQKFALP MNLLTFNRKA RDMKTVRMML SKVYSR
|
| |