Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1713 |
Symbol | |
ID | 5899168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1803281 |
End bp | 1804468 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641562203 |
Product | lycopene cyclase |
Protein accession | YP_001683340 |
Protein GI | 167645677 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01789] lycopene cyclase [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0564818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.226642 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCCG CGTCCTCGAC CCAGCGCGCC GACGTCGTCC TGGTGGGCGG CGGTCTGGCC AACGGCCTTA TCGCCCTGAG GCTCAAGAGC CTGCGGCCCG CCCTGCGCGT GGTGATGCTG GAACAGGGGC CGACCATCGG CGGCGAGCAC ACCTGGTGCC ACTTCGCGAC CGACGTGGAT GCGGTCGAGG CCGGCTGGCT GCGACCCCTG ATCGTCCATC GCTGGTCGGG CTACGAGGTG CGGTTTCCGG GCCATCGCCG GCAGCTGTCC ACCGACTACC TGGCCATCAC CTCGGCCCGC CTGCACAAGG TGCTGAGCGC CGTGCTCGGC GACGACGCCT GGCTGGGCGC GACGGTCAGC GACGTCAATC CGCACCAGGT GACCCTGGCC GACGGTCGGG TGATCGTCGC TGGAGCGGTG ATCGACGGGC GGGGACCGCG CAAGAGCCGC AGCCTGGCCC TGGGCTACCA GAAGTTCGTC GGCCAGGGCG TGCGCCTGAC CGCCCCCCAC GGCCTGACCC GGCCGATCAT CATGGACGCC ACCGTCTCGC AAGAGGACGG CTACCGCTTC GTCTATGTCC TGCCTCTCGA CCCCATGCGC CTGCTGATCG AGGACACCCG CTACAGCGAC GGCCCGGCCC TCGACCGCGC CGCCCTGCGC CGGGCGATCG GCGCCTACGC CAAGGCCCAG GGCTGGACGA TCGCCCAGAT CGAGCGCGAA GAGGACGGAA TCCTGCCGAT CGCCTTGGGC GGCGACATCG ACGCCTATTG GCGCGAGGCC CGCTCGCAGG TGGCGGAGGT CGGCCTGCGC GCCGCCCTGT TCCAGCCGAC CACCGGCTAT TCCCTGCCCG ACGCCGCCCG CCTGGCCGAG GCCATCGCCG CCCTGCCCCG CATCACCAGC GCCAGCGTCC GGGCCTGTGT CGAGACCCAG TCCAAGACCG TCTGGCGCCG CCGGCGGTTC CTGCGCCTGC TCAACCGCAT GCTATTTCGA GCCTGCGCGC CCGAGGATCG TTACAAGGTG CTGGAGCGCT TCTACCGTCT GCGGCCCGGC CTGATCCAAC GTTTCTACGC CGCGCGCCTG ACGAAGTGGG ACAAGGCTCG GATCCTGATC GGCAAGCCGC CCGTGCCGAT CTCGGCGGCC ATCAAGTGCA TCGGCGAAAG CTCGGTGTTC GGAGGACAAG ACGCATGA
|
Protein sequence | MASASSTQRA DVVLVGGGLA NGLIALRLKS LRPALRVVML EQGPTIGGEH TWCHFATDVD AVEAGWLRPL IVHRWSGYEV RFPGHRRQLS TDYLAITSAR LHKVLSAVLG DDAWLGATVS DVNPHQVTLA DGRVIVAGAV IDGRGPRKSR SLALGYQKFV GQGVRLTAPH GLTRPIIMDA TVSQEDGYRF VYVLPLDPMR LLIEDTRYSD GPALDRAALR RAIGAYAKAQ GWTIAQIERE EDGILPIALG GDIDAYWREA RSQVAEVGLR AALFQPTTGY SLPDAARLAE AIAALPRITS ASVRACVETQ SKTVWRRRRF LRLLNRMLFR ACAPEDRYKV LERFYRLRPG LIQRFYAARL TKWDKARILI GKPPVPISAA IKCIGESSVF GGQDA
|
| |