Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3700 |
Symbol | |
ID | 8605051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4251014 |
End bp | 4252696 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Carotenoid oxygenase |
Protein accession | YP_003301271 |
Protein GI | 269127901 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.101903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCATCC CACGCTCGAT CCTCAGCCGG ACCGACTTTT CCGACTTCGA ACTGTCCCTG ATCGCCGGAG CCTGGCCGCA GGACATCACC GGCCACTACG TGATCTGCAC CTCCGACCAG CGCACCCGCC CGCTGCACGC CTTCTTCGGC GACGGCGTGG TGATCCGCCT GGAGCTGCGG CCCGACGAGC GGGGGCGCTT CCCCTGGCGG GCGCGGGTCA TCGACACCCC CTCGGTCCGG CTGCGGCGCA AGCGTCCCGA CCTGTTCACC GCCGGGCCGG TCGGCACCAG CTCGCCGTGG GGATTCGTGA ACGCCGCCAA CACCGCGCCG CTGCCCTGGG GCGACCGGCT GCTGGCCACC TGGGACGCCG GCCGCCCGGT CGAGATCGAC CCGCTCTCCC TGGAATTCGT GGCCGAAGTG GGGCACCGCG ACGACTGGAA GCCGGCCATC GACCAGGCGG TGCTGCCGCT GATCTCCACC ACCGCCCACC CGGTGATCGA CCCCGAACGG GGCTGCCTGT GGAGCGTCAG CCGGGACGTC CTCACCGGCG AGGTCTCAGT GATCCGCTAC GACGGGCACG GCAAACACGT GCAGCGCTGG GCGGTCGAGG ACGCCGTGCT GCCGCAGGCC ACCCACACCA TCACCCAGAC CCGCCACTGG CTGGTGCTGG CCGACACCGC CTACAAGGTG GACCCCGAGG AGGTCTTCGG CGCCGAGCGC ACCGTCGCCA ACAACCCCGA CGGCCCGGTG CTGCTGATCC GCAAGGACGA CCTGGTGCCC GGGCGCGCCG GCGTGCCCTG CCGCACCTTC CGCATCGCCC CCGAGGTCAA CCACTTCTAC GCCCGCTACG ACGACACCGA CGGCGTCCAG GTGATCATGG AGCACTCCCC GGGCCTGGAC ATCGGCATGT ACCTGCGCGA AGACGACCTG GACGCCTTCG GCCGCCCCAT CGACCCGGCG CTGCGCGGCA TGTACTGCCA CGGCATGTCC CCGGCGCTGA CCACGCTGCT GCGCTTCGAC CCCGAGACCG GGCGGGTGCA CGAGCGGGCC CGCCTGTTCG ACCCCGAGCG CTACTGGCAG GCCGAGCTGT CGGCGATCGA CTGGAGCTTT GAGGGGCAGA CCAACCCCAC CCGGCACCAC CTCATCCACC TGGGCTTCCA CCCGGAGGCC ATCAACCAGC GGGCGCTGAA GAACTACGAG GGGCGGGTGA ACCCCGAGCT GTTCCCCGCC GAGGAGACCC CGGCGGTGCT CAGCAGCCTC GACTGGCACA CCCTCCGGCC GGTGGCCGAG TGGACCTTCG CGCTGGAGGA CTACCCGACC TCCCCGGTGT TCGTGCCGCG CGGCCCGGGC GCGCCGGGCC GGACCCGCTA TGCCGGGGCC GACCCGGGCG GCCACGACGG CTACCTGCTG GTCGCGGTGC ACAACGACGA CCGCTTCCGG ATCGAGCTGT TCGACGCCGC CGACGTCTCG CGCGGCCCGA TCGCCGTGCT GGCCGCACCG CAGGGCACCA CCGTCCCCTT CCTGATCCAC TCGGCGTGGA TGCCGCAGGC GCGGCCCGCC GACCCGGGCA TAGAGCGGCT GCGCTTCGCC GACGACCTGG ACTCCCGGCT GGACCAGCTG GACCCGGAGC TGGCCGCACT GACCCGCCAG GTGGCCGAGG AGCTGGACGC CCGCCACCGC TGA
|
Protein sequence | MPIPRSILSR TDFSDFELSL IAGAWPQDIT GHYVICTSDQ RTRPLHAFFG DGVVIRLELR PDERGRFPWR ARVIDTPSVR LRRKRPDLFT AGPVGTSSPW GFVNAANTAP LPWGDRLLAT WDAGRPVEID PLSLEFVAEV GHRDDWKPAI DQAVLPLIST TAHPVIDPER GCLWSVSRDV LTGEVSVIRY DGHGKHVQRW AVEDAVLPQA THTITQTRHW LVLADTAYKV DPEEVFGAER TVANNPDGPV LLIRKDDLVP GRAGVPCRTF RIAPEVNHFY ARYDDTDGVQ VIMEHSPGLD IGMYLREDDL DAFGRPIDPA LRGMYCHGMS PALTTLLRFD PETGRVHERA RLFDPERYWQ AELSAIDWSF EGQTNPTRHH LIHLGFHPEA INQRALKNYE GRVNPELFPA EETPAVLSSL DWHTLRPVAE WTFALEDYPT SPVFVPRGPG APGRTRYAGA DPGGHDGYLL VAVHNDDRFR IELFDAADVS RGPIAVLAAP QGTTVPFLIH SAWMPQARPA DPGIERLRFA DDLDSRLDQL DPELAALTRQ VAEELDARHR
|
| |