Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2023 |
Symbol | |
ID | 9156178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2107734 |
End bp | 2109080 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | histidinol dehydrogenase |
Protein accession | YP_003646974 |
Protein GI | 296139731 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.908809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATTC TGCAGCTGAA CCGTCTCGAC CTGCGCGGGC GCCGTCCGTC GCTGACCGAG TTGCGCGCGG CGCTGCCGCG CGGCGGCGTG GATGTGGACG CCGTGGTCCC GCAGGTTCGG CCCATCGTGG ACGCGGTCGC CGACCGTGGC GCCGAGGCCG CGCTCGAGTG GGGGGCCAGG TTCGACGGGG TCCGGCCGGA CACGGTCCGG GTGCCCGCGG CCGCCATCGC CGGTGCCCTG GCCGCACTCG ACCCCGCGAT CCGCGAGGCG CTCGAAACCG CCATCGCACG GGTACGCACC GTGCACGCCG ATCAACGCCG CACCGACACC GTCACCGAGG TCGTGCCCGG CGGTACCGTC ACCGAGCGGT GGATACCCGT GGACCGGGTA GGTCTGTATG TGCCGGGCGG TAACGCCGTC TACCCGTCGT CGGTCGTGAT GAACGTGGTC CCCGCGCAGA TCGCCGGCGT CGGGTCTCTG GTTGTTCTCT CGCCCGCGCA GAAGGACTTC GACGGCTTAC CGCATCCCAC GATCCTGGCG GCGTGCGCCC TGCTCGGTGT CGACGAGGTG TGGGCGACCG GTGGCGCGCA GGGCGTGGCC CTCGCCGCGC ACGGTGGCAC TGACATCGAT GGTGCCGAAT TGGCACCCGT CGACCTGATC ACCGGTCCCG GAAACATCTA CGTCACCGCT GCCAAGCGAC TGTGCCGCGG CCTGGTCGGC ATCGACGCCG AGGCCGGACC CACCGAGATC GCGATCCTCG CCGACACCAC CGCCGATCCG GTGCACGTGG CCGCGGACCT CATCAGTCAG GCCGAACACG ATGTGATGGC TGCGTCCGTG CTGGTCACCG ACTCCGTGGA GCTCGCCGAC CGGGTCGACG AGGCGCTCTC CGCACAGCTC GAGCGGACCC GTCATCGCGA GCGGGTCGAT ACCGCCCTGC GCGGTGCGCA GTCGGGCACC GTTCTGGTCG ACGATGTCAC CGAGGGACTT GCCGTGGTCA ATGCGTATGC AGCCGAGCAC CTCGAGATTC AGACCGCGAA CGCCCGTGAG GTCGCTGCGC GGGTCCGTGC CGCCGGTGCG ATCTTCGTCG GCCCGCACTC TCCAGTGAGC CTCGGCGACT ATGCCGCGGG CAGTAATCAC GTGCTACCGA CCGCCGGCTG CGCGCGACAC AGCTCGGGGC TGTCGGTACA GACCTTCCTG CGCGGTGTGC ACGTGGTCGA GTACGACGAG GCGGCGCTCA AGGACATCGC GGGAATCGTG ATCACTTTGG CCAATTCGGA GGATCTGCCG GCCCACGGTG AGGCCGTGCG GCTGCGGTTC GAATCGCTCA ACGGTGGCGC GTCGTGA
|
Protein sequence | MPILQLNRLD LRGRRPSLTE LRAALPRGGV DVDAVVPQVR PIVDAVADRG AEAALEWGAR FDGVRPDTVR VPAAAIAGAL AALDPAIREA LETAIARVRT VHADQRRTDT VTEVVPGGTV TERWIPVDRV GLYVPGGNAV YPSSVVMNVV PAQIAGVGSL VVLSPAQKDF DGLPHPTILA ACALLGVDEV WATGGAQGVA LAAHGGTDID GAELAPVDLI TGPGNIYVTA AKRLCRGLVG IDAEAGPTEI AILADTTADP VHVAADLISQ AEHDVMAASV LVTDSVELAD RVDEALSAQL ERTRHRERVD TALRGAQSGT VLVDDVTEGL AVVNAYAAEH LEIQTANARE VAARVRAAGA IFVGPHSPVS LGDYAAGSNH VLPTAGCARH SSGLSVQTFL RGVHVVEYDE AALKDIAGIV ITLANSEDLP AHGEAVRLRF ESLNGGAS
|
| |