Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2044 |
Symbol | |
ID | 9156199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2128832 |
End bp | 2130397 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | anthranilate synthase component I |
Protein accession | YP_003646995 |
Protein GI | 296139752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.874446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGC AATCGGCTTC GAACCTCCAG CCCGACGGCG CCACCACCAG CCTCGATGAG TTCTTGGAGC TCGCCGAACG GCATCGCGTG GTACCCGTGA CCCGCACCGT GCTCGCCGAT GCCGAGACCC CGCTCTCGGC GTACCGCAAA CTCGCGGGCG GACGCCCCGG CACCTTCCTG CTGGAATCCG CGGCGGCCGG CCAATCCTGG AGTCGATGGT CGTTCGTCGG TGCGCCCGCT CCGGCGGCGC TGACGGTGGT CGACGGTGAA GCGGCGTGGA TCGGCACCAA GCCGGCGGGC GCGCCATCGG GCGGCGATCC GATCGACGCG CTCGGCGAGG TGCTGCGCCT GCTCAGCTCA GAGCCGATCG AGGGGCTTCC GCCGCTCAAC GGCGGGATGG TGGGCTACCT CGCGTACGAC GCCGTGCGCC GGTTGGAGCG TCTGCCCGAG ACCACGGTCG ACGATCTCGG CCTGCCGGAG ATGGTGTGGC TGCTCGCGAC CGATCTCGCC GCGATCGACC ACCACGAGGG CACCATCACC CTCATCGCCA ACGCGGTGAA CTGGGACGGC GCCCGCGAAC GTGCGGCCGA GGCCTATGCC GACGCGGTCA CCCGGCTCGA TGCGATGGAG ACCGCACTGG CGGCGCCTCT GGCCTCGTCG GTGGCCTCGT TCACCCGGCC CGAGCCCGCC TACAGCGCGC AACGGACGGT CGAGGAGTAC AGCGCGATCG TTGAGAAGCT GGTCGGCGAC ATCGAGGCCG GCGAGGCCTT CCAGGTGGTG CCGTCGCAGC GCTTCTCGGT GCCGTCCCAC GCCGATCCGA TCGACGTCTA CCGCGTGCTG CGGGCATCGA ACCCGAGTCC CTACATGTAT CTGGTCCAGG TACCGGCGCC GGACGGATCG CTCGCTTTCT CGATCGTCGG CTCCAGCCCG GAGGCGCTGG TCACGGTCAG CGATGGCACC GCGACCACCC ACCCGATCGC AGGTACGCGA TGGCGCGGGG CGTCCGCGGA GGAGGACCTG CTGTTGGAGA AGGACCTGCG CGCCGACGAG AAGGAAAACA GCGAGCACCT CATGCTCGTC GACCTCGGCC GCAACGACCT CGGACGCGTG TGCACGCCGG GCACCGTCCG CGTCACCGAC TACCGGCGGA TCGAGCGCTA CAGCCATGTG ATGCACCTGG TCTCGACGGT CTCCGGTGAC CTCGCCCCCG ACAAGCAAGC GCTCGACGCG GTCACCGCCT GCTTTCCCGC CGGCACGCTC ACCGGGGCGC CGAAGGTCCG GGCGATGGAG CTGATCGACG AGGCCGAGCT GACGCGGCGC GGTCTCTACG GCGGCATCGT CGGCTACCTG GATTTCGCCG GTGACGCCGA CACCGCGATC GCGATACGCA CCGCCGTGCT CAAGGACGGC ACCGCCTTCG TCCAGGCCGG AGGCGGCGTG GTCGCGGATT CGGTCGGTGA ATACGAGTAC AACGAATCCC GGAACAAGGC GCTCGCCGCG CTCAAGGCGG TGGCGGCGGC CAATACGCTG CGCGCGGTCA CCGAGACGGA GGGGGAGGGC CGATGA
|
Protein sequence | MSTQSASNLQ PDGATTSLDE FLELAERHRV VPVTRTVLAD AETPLSAYRK LAGGRPGTFL LESAAAGQSW SRWSFVGAPA PAALTVVDGE AAWIGTKPAG APSGGDPIDA LGEVLRLLSS EPIEGLPPLN GGMVGYLAYD AVRRLERLPE TTVDDLGLPE MVWLLATDLA AIDHHEGTIT LIANAVNWDG ARERAAEAYA DAVTRLDAME TALAAPLASS VASFTRPEPA YSAQRTVEEY SAIVEKLVGD IEAGEAFQVV PSQRFSVPSH ADPIDVYRVL RASNPSPYMY LVQVPAPDGS LAFSIVGSSP EALVTVSDGT ATTHPIAGTR WRGASAEEDL LLEKDLRADE KENSEHLMLV DLGRNDLGRV CTPGTVRVTD YRRIERYSHV MHLVSTVSGD LAPDKQALDA VTACFPAGTL TGAPKVRAME LIDEAELTRR GLYGGIVGYL DFAGDADTAI AIRTAVLKDG TAFVQAGGGV VADSVGEYEY NESRNKALAA LKAVAAANTL RAVTETEGEG R
|
| |