Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2659 |
Symbol | |
ID | 9156820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2763601 |
End bp | 2765181 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | beta-carotene ketolase |
Protein accession | YP_003647598 |
Protein GI | 296140355 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACGC CCGAGCTCGA CGTCGCGGTC GTGGGGTCGG GCCACAACGC GTTGATCGGG GCCGCCTACC TCGCCGCCCG CGGTCGCCGG ATCTCGGTGT TCGAGGCCGA TACCGTGCCC GGCGGAGCGG TGAGCACCGT CGAACGCTTT CCCGGCTACC GCGTCGACCG GGGCTCATCG GCGCACCTGA TGTTCCGGCA CACCGGTATC GCCGAAGAAC TCGCTCTCGC CGACCACGGC CTGCGCTACC TCGACTGCGA TCCCTGGGCC TTCGTTCCCA CCCCACCGGG AACCTCGCTG CCACCCATCG TGTTCCGCGT CGATCTCGAT GCGACCGCGG AGTCGATCGC CGCCGCGTGC GGTGCGCGCG AGGCGGCCGC GTACCGTCGC TTCGTCGCAG ACTGGGCGCC CCGCGCCCGG GCCACCATGA CCGCCTTCGG TGCCGGGCCG ACGGCGGCCC GCCTGGGGCG AGCCTTCTGG CCGACCAAGG GAAGCGATTC GGTGGCCGGT CTATCGCGCA GCTATCTGGC GTCGGGCGAT GCGCTGCTCG ACGAGTATTT CGACGACGAG CGACTCAAGG CCGCCCTGGC GTGGTTCGGT GCGCAGTCGG GTCCAGCGAT GAGCGAACCG GGCACCGCGC CGATGGTGGG GTTCGCCGCT CTCATGCACA CCGTGCCGCC CGGCCGTGCC GTCGGCGGTA GCGGCGCCCT GACCTCCGCG CTGATCAGCC GGATCGAATC CGGCGGTGGC ACCGTGGAAC TCGGCACTCC GGTCACCGCA CTGCGCCGAG CCGGACGGGG CTGGCGGGTC ACCGTCACCG GGCCCGACGG TACGCGCACC GTCTCCGCCC GCACCGTCCT CGCCGGCTGC CACGTGCTCA CCACTCTCGA CCTCCTGGGC CGCGGCGGCT ACCCCGGCGA GCGGATCGAC CGATGGCGCA GGCGGATCCA GGTCGGCCCC GGTGTGGGCA TGGTGGTCCG GCTCGGCACC ACCGCTCCAC CGGAGTTCGA TGGACTCACG CTGGCAGACC AGTCAGGTCT GGGCCTGCTC ACCGCGGACC GCGGTCACCT CGCGCGGGCC CGCGGCGACA TGTTGGCGGC CGATCCGCCG CGCCGGCCCG CGGTGCTGGC GATGACCTTC TCCGGGCTCG ACCCGTCGAT CGCACCCGCC GGCCGGCACA ACACCACCCT GTGGGCGCAG TGGTATCCGT ACACACTCAC GCGCGACGAC TGGCCGGCGA TCGCCGAGGC GGAGGCGGAC CGGATCGTCG CCGAGACGCA GCGCTGGGCA CCGGGCTTCG CCGACACGAT CGAACACCGG TACGTGCAGA CCCCGGCGAA CCTGGAATCC GAGCTCGGCC TGATCGGCGG CAATGTGATG CACGTCGAGA TGGCCCTGCA CAACATGCTG CTGTTCCGGC CGCTCCCCGA ACTGGCCGGT GGGCGCGTTC CCGGCGCGCC GGGACTGGCA CTGGCGGGCG CATCGATGCA TCCCGGCGGG GGCGTGAACG GCTCCAGTGG ACGGATCGCG GCGCGGCTAC TCGCCCGCGA CCTCAGGGGC CTGGCACGAT GGCGGTCGTG A
|
Protein sequence | MSTPELDVAV VGSGHNALIG AAYLAARGRR ISVFEADTVP GGAVSTVERF PGYRVDRGSS AHLMFRHTGI AEELALADHG LRYLDCDPWA FVPTPPGTSL PPIVFRVDLD ATAESIAAAC GAREAAAYRR FVADWAPRAR ATMTAFGAGP TAARLGRAFW PTKGSDSVAG LSRSYLASGD ALLDEYFDDE RLKAALAWFG AQSGPAMSEP GTAPMVGFAA LMHTVPPGRA VGGSGALTSA LISRIESGGG TVELGTPVTA LRRAGRGWRV TVTGPDGTRT VSARTVLAGC HVLTTLDLLG RGGYPGERID RWRRRIQVGP GVGMVVRLGT TAPPEFDGLT LADQSGLGLL TADRGHLARA RGDMLAADPP RRPAVLAMTF SGLDPSIAPA GRHNTTLWAQ WYPYTLTRDD WPAIAEAEAD RIVAETQRWA PGFADTIEHR YVQTPANLES ELGLIGGNVM HVEMALHNML LFRPLPELAG GRVPGAPGLA LAGASMHPGG GVNGSSGRIA ARLLARDLRG LARWRS
|
| |