Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1272 |
Symbol | |
ID | 5898727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1331495 |
End bp | 1332958 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561757 |
Product | carotenoid oxygenase |
Protein accession | YP_001682900 |
Protein GI | 167645237 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.261404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00803245 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTCG ACCGCCTGCC GCCGGTGCGC ACCTCGCTGC ACCCGACCAA TCACCCCTAC ATGACCGGGG CCTGGACGCC GCTGCACGAG GAGGTCGACG CGGTCGATCT GGAGGTCCTG GAGGGCGCGA TCCCCGCCGA CCTCGACGGC GTCTATCTGC GCAACACCGA GAACCCCGTG CATCAGCCGC TGGGCCGCTA CCACCCGTTC GACGGCGACG GCATGGTCCA CCAGATCGAG TTCAAGAACG GCGCGGCCAG CTATCGCAAC CGTTTCATCC GCACGCGCGG CTTCGAGGCC GAGCAGGAGG CGGGCGAGAG TCTGTGGGGC GGGCTGGCCG ACGGGCCGGG CACGTCCAGG CGGCCGGGCT TCGGGGCCCA TGGCGGCCTG AAGGACACCG CCAGCACCGA CATCGTCGTC CATGCCGGCG AGGCGATCGC CACCTTCTAC CAGTGTGGCG AGGCCTATCG GCTGGATCCG CTGACCCTGG AGAACCTGGG CGTCGCCGCC TGGGCGCCGC TGGACGGCGT CTCGGCCCAC GCCAAGGTCG ACGAGGCGAC CGGCGAGCTC TTGTTCTTCA ACTATTCCAA GCACGCGCCC TACATGCACT ACGGCGTGGT CGACGCGAAC GGCAAGCGCA CCGTCTATCA GCCGATCGAC CTGCCCGGCC CGCGCCTGCC GCACGACATG GCGTTCACGG CGAACTATTC GATCCTCAAC GACCTGCCGG TGTTCTGGGA CCAGACCCTG CTGGAGCGCG ACATCCACGC CGTGCGCCTG CACAAGGGCG TGCCGTCGCG GTTCGGGATC GTGCCGCGCC ATGGCGGCGA AGTGCGCTGG TTCGAGGCCG CCCCGACCTA TGTGCTGCAC TGGCTCAACG CCTACGAGGA CGGCGACGAG ATCGTGCTCG ACGGCTATTT CCAGGAGAAC CCGACCCCGC GCCCGCTGGA GGACGCGCCC GAGGGCCACG GCCACCTGAT GGCCTATCTG GACGAGCACA GCTTCCGGCC CAAGCTGCAC CGCTGGCGCT TCAACCTGGC GACCGGCGAG ACCACCGAAC AGCATCTGGA CGAGCGAATC CTGGAGTTTG GGATGTTCAA CCAGGCCTAT GCGGGGCGGC CCTATCGCTA CGCCTATTCG ACCACCGCCA AGCCGGGCTG GTTCCTGTTC AACGGCTTCG TCAAGCATGA CCTGGAGACG GGTGAGAGCT GGTCGCTCGC GCTGGAACCT GGCCGCTACG CCAGCGAGGC CCCGTTCGCG CCGCGCGTCG GGGCGGTGGA CGAGGACGAC GGCTATCTGG TCAGCTTCAT CATCGACGAG AACCGGGGCA CGTCGGAGTG CCTGGTGGTC GACGCCAAGA CCATGACGCC GACCTGCCGG ATCGCCTTGC CGCATAAGAT CAGCAGCGGC ACGCATGCGG TGTGGGCGGG GCGGAAGATG TTGGCTCCCT CGACCATTCC CTAA
|
Protein sequence | MKLDRLPPVR TSLHPTNHPY MTGAWTPLHE EVDAVDLEVL EGAIPADLDG VYLRNTENPV HQPLGRYHPF DGDGMVHQIE FKNGAASYRN RFIRTRGFEA EQEAGESLWG GLADGPGTSR RPGFGAHGGL KDTASTDIVV HAGEAIATFY QCGEAYRLDP LTLENLGVAA WAPLDGVSAH AKVDEATGEL LFFNYSKHAP YMHYGVVDAN GKRTVYQPID LPGPRLPHDM AFTANYSILN DLPVFWDQTL LERDIHAVRL HKGVPSRFGI VPRHGGEVRW FEAAPTYVLH WLNAYEDGDE IVLDGYFQEN PTPRPLEDAP EGHGHLMAYL DEHSFRPKLH RWRFNLATGE TTEQHLDERI LEFGMFNQAY AGRPYRYAYS TTAKPGWFLF NGFVKHDLET GESWSLALEP GRYASEAPFA PRVGAVDEDD GYLVSFIIDE NRGTSECLVV DAKTMTPTCR IALPHKISSG THAVWAGRKM LAPSTIP
|
| |