Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2834 |
Symbol | |
ID | 9156999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2935750 |
End bp | 2937387 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Dak phosphatase |
Protein accession | YP_003647771 |
Protein GI | 296140528 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.114413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGGTGG CCTGGCTCCG GCGCGCGGTG CTCGGTCTCG AGCGTTCCGT CGACGAGATC AACGCCCTCA ACGTCTTCCC CGTTGCCGAC GCCGATACGG GGACGAACAT GCTGGTCACT CTGCGTGCTG CTGCCCGCGC GGCCGAGGAG GCCGATCGGA CGGAAGGGCC GCATGCGGTG GCCCGGGCGA CGGTGGCCGG CGCTGTTTCC GGGGCCCGCG GTAATTCGGG TGTGATCGTC TCGCAGATTA TGCGCGGCGT GGTCGGGCAG CTCGGTCCGG ACGCCGACCT CGAGGCCGCG GGGCTCGCCG AGGGGCTGCG CACCGCCACC GGCTTGGTCA CCTCCGCGGT CGCCGACCCG ATCGAGGGCA CCATCCTGTC CGTCCTCCGG GCCGCGGCCG ACGGTGCCGC AGCCGCACTG GCGGGTGGGG CCGACCTCCC CGCCTGCGCT CGCGGCGCCG CCGATACCGC GTTCGATGCG CTCCTGCGCA CCCGGGACCA GCTCGCCGAC AACGCCCGCG CCGGCGTGGT CGACGCCGGC GGCCGGGGTT TGTTGGTCTT GCTCGACGCC CTCGTCGAGG TGACCGCCGG AGTGACACCG GAGCGCCCCC GGTTCACTCG CCAGGCGGCG CCGCACGTCC AGGTGAACGT CGACCCGGAG GATGCGGCCG ACCATCCGCA CGATCACGGC CACGACGATC CGCCCCGCGG ACCCCACGTG GACTACGAGG TCATGTACCT GCTTCCGGAT GCCACCGATG CCGCGGCCTC GCGCCTGCGC GACGAGTTGC AGACGTTCGG CGATTCCGTG GTCGTGGTGG CCGCCGCCGA TCCCGATCCG GTGGCTACGT GGTCGGTGCA TGCGCACACC ACCGAGCCGG GGCGTGCCAT TCAGGCCGGC CTCGCCCTCG GCAAGCTCCG GAGCATCGCC GTCACTGTGC TGCAGGAGAC CGGCGAAACG CCGCTACAGG CGCTCCTGGA CGCGCCGCGG GCGATCGTCG CCCTGGTCTC CGGCGACGGT GCCGCCGAGC TGTTCGCCGC CGAGGGCGCG GAGGTGATCC GCTGTGATGA GGGGATCACC CACGCCGGTT TGCTCGTAGC GCTGCACCGG TTCGAGGGCC GCGAGGTGTT GCTCATGCCC AACGGAGCGC TGCCGACTCC GGAACTTCTG GCCGTGGCCG CACGCTCACG GGAATCCGGT GTCCTGGTGA CGCTTCTGCC CACCTCATCG ATGGTGCAGG CGATCGCAGC ACTCGCGGTG CACGATCCGC GCGGCCACGC CGCCGACGAC ACCTTCTCCA TGGCCGAAGC CGCCGCCGGC GCGCGGTGCG GCTCGGTCGT GGCCGTCACC GAGGACGCCC TCACCATCCT CGGACCGTGC GGTCCCGGCG ACTATCTCGG CATGGTCGGC GGTGAGGTCG TGGTGCTCGA AGAGGATCAG TACTCCGCGG GGGAGGCCCT CGCCGAGCTG TTGCTCGCCA CCGGCGGCGA TATGGTGACC GTGCTCCTCG GCGATGCCGG CGACGGCGAC TTCCCCGACC GGGTCGCAGC CGCTCTGCGG CCCGACCGGC CCGAGGTGGA AGTGGTCGGC TACCGCGGCG GACAAACCGG GAGTGTGATG GAGATCGGCG TCGAATGA
|
Protein sequence | MVVAWLRRAV LGLERSVDEI NALNVFPVAD ADTGTNMLVT LRAAARAAEE ADRTEGPHAV ARATVAGAVS GARGNSGVIV SQIMRGVVGQ LGPDADLEAA GLAEGLRTAT GLVTSAVADP IEGTILSVLR AAADGAAAAL AGGADLPACA RGAADTAFDA LLRTRDQLAD NARAGVVDAG GRGLLVLLDA LVEVTAGVTP ERPRFTRQAA PHVQVNVDPE DAADHPHDHG HDDPPRGPHV DYEVMYLLPD ATDAAASRLR DELQTFGDSV VVVAAADPDP VATWSVHAHT TEPGRAIQAG LALGKLRSIA VTVLQETGET PLQALLDAPR AIVALVSGDG AAELFAAEGA EVIRCDEGIT HAGLLVALHR FEGREVLLMP NGALPTPELL AVAARSRESG VLVTLLPTSS MVQAIAALAV HDPRGHAADD TFSMAEAAAG ARCGSVVAVT EDALTILGPC GPGDYLGMVG GEVVVLEEDQ YSAGEALAEL LLATGGDMVT VLLGDAGDGD FPDRVAAALR PDRPEVEVVG YRGGQTGSVM EIGVE
|
| |