Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3944 |
Symbol | |
ID | 9158125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4064180 |
End bp | 4065625 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Carotenoid oxygenase |
Protein accession | YP_003648855 |
Protein GI | 296141612 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGC ACAAGCAGAT CGGCGTCGAT CGGTCCCGCG TCTTCGAGCC GGCGGCGGGT GAGTTCGACG ACGTGCTGAT CGAGGACATC ACCGGCGCGC TCCCCGAGGA TCTGCTCGGG ACGCTGTACC GCAGCGCACC GGTGCGCTGG GAAGCCGGCG GCTTCTACGC CAAGCACCTG TTCGACGGTG ACGGGATGGT CGCGAAGTTC TCCCTGGAGG GCGGCCGGCT GCGGTACCGG AACCGGTACG TCCGCACCCC CAAGTACCAG CGGGAGGAGG CCGGGCGAGG GGACCGAGTA CGTGGGCTCG GCACCCTCGC CGCGGGGGGT CCACTCGGTA ACGCGGGTCG CGGCCCCGCT GATCGCGCCA ACACCACGTC GATCTACCAC GGTGACCGGC TGGTCGCGCT CTCCGACGAC GGCCGGCCGT GGGAGCTCGA TCCCGACTCG CTGGCGACGC GCGGTCGGTG CGATTTCGAC GGCGCGCTGA GCCTGTTCAG CACGTTCTCG CCACACCCGA AGCTCGATCC GGTCTCGGGC GAGATGTTCA ACTTCGGCCT CACGCTGCCG CCGGTGGGAC GCAGCACCGG CATGGTGGGC CTGCGCTGCT ACCGGGTCGA TCCGCGCGGT CGGCTGAGCA CGATCCGCAC AGTGCCGCTC TCGCACGTGC TGATCAATCA CGACTTCGCG ATCACCGAGC GGTACCTGGT GTTCGTCCTG GACCCGCTGA CCGTGGGGAC GGTCGCGACG GCGCGAGCCG GCCTGGGCAT GATCGACTTC AACGCGGCCA CCGAGTTCCG CGCGGAGCTC GGGTCCGAAG TGATCCTGGT GCCACGGGAC GGCGGTGAGC CGCGCCGGTT CACCATCCCC GCGTTCCCGA AGGTGCACGT GAACAACGCC TACGAGGTGG GCGGCGACGT GCTGATCGAT GTGGTCAAGT ACGACGACTG GAACGAGACC GCGCGGATGC TCTGCGACTT CCGCGAATAC GGCCCTGCTC GGGGCGGCAC GCTCACGCGG TTGCGGATCA CGCCGAGCGA TCGGGTGGAA CTCACCGAGC TATCGGCCTC GCTGGGCGAA TTCCCGATGC ACGACTGGCG CCGCACCGGA CGACGATTCC GCTACTCGTA TCTGATGGAG TCCGACGGGG TGCGTCCGCC GTCGCTGGAG AAGATCGACA ACGAGACCGG CGAGCAGACC TCGATGGGCG GGTTCGCCTT GTTCGACGGT GTCGGTGAGC CGATCTTCGT GCCGCGCAGT CCCGATGCCG CCGAGGACGA CGGCTGGCTT CTGGTGGTGG TGTATCTGGC GTCGGACCAC CGCTCGGCGC TGATCGTCGT GGACGCTCGG GACATCGAGG CGGGGCCGGT GGCGGTAGCA CGGCTGCCGC ACCACTTCTT CCCCGGATTC CACGGCATGT TCACCGACCG GGTCTCCCTC ACCTAG
|
Protein sequence | MTTHKQIGVD RSRVFEPAAG EFDDVLIEDI TGALPEDLLG TLYRSAPVRW EAGGFYAKHL FDGDGMVAKF SLEGGRLRYR NRYVRTPKYQ REEAGRGDRV RGLGTLAAGG PLGNAGRGPA DRANTTSIYH GDRLVALSDD GRPWELDPDS LATRGRCDFD GALSLFSTFS PHPKLDPVSG EMFNFGLTLP PVGRSTGMVG LRCYRVDPRG RLSTIRTVPL SHVLINHDFA ITERYLVFVL DPLTVGTVAT ARAGLGMIDF NAATEFRAEL GSEVILVPRD GGEPRRFTIP AFPKVHVNNA YEVGGDVLID VVKYDDWNET ARMLCDFREY GPARGGTLTR LRITPSDRVE LTELSASLGE FPMHDWRRTG RRFRYSYLME SDGVRPPSLE KIDNETGEQT SMGGFALFDG VGEPIFVPRS PDAAEDDGWL LVVVYLASDH RSALIVVDAR DIEAGPVAVA RLPHHFFPGF HGMFTDRVSL T
|
| |