Gene Caul_1713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1713 
Symbol 
ID5899168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1803281 
End bp1804468 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content70% 
IMG OID641562203 
Productlycopene cyclase 
Protein accessionYP_001683340 
Protein GI167645677 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01789] lycopene cyclase
[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0564818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.226642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCG CGTCCTCGAC CCAGCGCGCC GACGTCGTCC TGGTGGGCGG CGGTCTGGCC 
AACGGCCTTA TCGCCCTGAG GCTCAAGAGC CTGCGGCCCG CCCTGCGCGT GGTGATGCTG
GAACAGGGGC CGACCATCGG CGGCGAGCAC ACCTGGTGCC ACTTCGCGAC CGACGTGGAT
GCGGTCGAGG CCGGCTGGCT GCGACCCCTG ATCGTCCATC GCTGGTCGGG CTACGAGGTG
CGGTTTCCGG GCCATCGCCG GCAGCTGTCC ACCGACTACC TGGCCATCAC CTCGGCCCGC
CTGCACAAGG TGCTGAGCGC CGTGCTCGGC GACGACGCCT GGCTGGGCGC GACGGTCAGC
GACGTCAATC CGCACCAGGT GACCCTGGCC GACGGTCGGG TGATCGTCGC TGGAGCGGTG
ATCGACGGGC GGGGACCGCG CAAGAGCCGC AGCCTGGCCC TGGGCTACCA GAAGTTCGTC
GGCCAGGGCG TGCGCCTGAC CGCCCCCCAC GGCCTGACCC GGCCGATCAT CATGGACGCC
ACCGTCTCGC AAGAGGACGG CTACCGCTTC GTCTATGTCC TGCCTCTCGA CCCCATGCGC
CTGCTGATCG AGGACACCCG CTACAGCGAC GGCCCGGCCC TCGACCGCGC CGCCCTGCGC
CGGGCGATCG GCGCCTACGC CAAGGCCCAG GGCTGGACGA TCGCCCAGAT CGAGCGCGAA
GAGGACGGAA TCCTGCCGAT CGCCTTGGGC GGCGACATCG ACGCCTATTG GCGCGAGGCC
CGCTCGCAGG TGGCGGAGGT CGGCCTGCGC GCCGCCCTGT TCCAGCCGAC CACCGGCTAT
TCCCTGCCCG ACGCCGCCCG CCTGGCCGAG GCCATCGCCG CCCTGCCCCG CATCACCAGC
GCCAGCGTCC GGGCCTGTGT CGAGACCCAG TCCAAGACCG TCTGGCGCCG CCGGCGGTTC
CTGCGCCTGC TCAACCGCAT GCTATTTCGA GCCTGCGCGC CCGAGGATCG TTACAAGGTG
CTGGAGCGCT TCTACCGTCT GCGGCCCGGC CTGATCCAAC GTTTCTACGC CGCGCGCCTG
ACGAAGTGGG ACAAGGCTCG GATCCTGATC GGCAAGCCGC CCGTGCCGAT CTCGGCGGCC
ATCAAGTGCA TCGGCGAAAG CTCGGTGTTC GGAGGACAAG ACGCATGA
 
Protein sequence
MASASSTQRA DVVLVGGGLA NGLIALRLKS LRPALRVVML EQGPTIGGEH TWCHFATDVD 
AVEAGWLRPL IVHRWSGYEV RFPGHRRQLS TDYLAITSAR LHKVLSAVLG DDAWLGATVS
DVNPHQVTLA DGRVIVAGAV IDGRGPRKSR SLALGYQKFV GQGVRLTAPH GLTRPIIMDA
TVSQEDGYRF VYVLPLDPMR LLIEDTRYSD GPALDRAALR RAIGAYAKAQ GWTIAQIERE
EDGILPIALG GDIDAYWREA RSQVAEVGLR AALFQPTTGY SLPDAARLAE AIAALPRITS
ASVRACVETQ SKTVWRRRRF LRLLNRMLFR ACAPEDRYKV LERFYRLRPG LIQRFYAARL
TKWDKARILI GKPPVPISAA IKCIGESSVF GGQDA