Gene Caul_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0441 
Symbol 
ID5897898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp484381 
End bp485763 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content65% 
IMG OID641560927 
Productcarotenoid oxygenase 
Protein accessionYP_001682076 
Protein GI167644413 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.786971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACC AAAGCCACAC CGATTTTTAC CTTTCAGGCA ATTACGCGCC CGTCCGCAGC 
GAGGACGACT TCGAGCTGGA GATCACCGGC CAGTTTCCCA AAGAGCTGCG CGGGGCGCTC
TATCGCAACG GGCCCAATCC CCAATTCCAG CCGCGTGATC CCAACCACCA CTGGTTCGGC
GGCGACGGCA TGGTCCACGG CTTCTATGTC GAGGACGGCA AGGTCCATTA TCGCAATCGC
TATGTCCGCA CCCCCAAATG GAAAACCGAG AACGCCGCGG GGCGAGCGCT ATTTGGCAGC
ATGGGCAACC CACGCACGAC CGATCCCAGT GTTCTGGGTC AGGACAGCGG GGTGGCCAAC
ACCAATATCC TGGCCCACGG CGGCCGGCTC CTGGCGCTTG AGGAGGGCCA CATGCCGTTC
GAAATGGACG CGCGGTCCTT GGACAGCCTG GGCTATGTCG AGGCCTATAA GGGCCGCGTC
ACCGCCCATC CCAAGATCGA TCCGGTGACC GGCGAGCTGC TGTGGTTCGG CTATGGGGTC
GGGGCCACGC CGTTCTCGCC GGGCATGAGT TTTGGTGTGA CCGACCGCAA CGGCGTGGTG
ACGCGCCGTG ACGATTTCCA GGCGCCTTAC TGCTCGATGG TCCACGACTT CATGGCCACC
CAGAACCACG TCCTGTTTCC CGTCCTGCCC CTGACCGGCA GTCTGGAGCG GGCGATGAAG
GGCGCGCCGA TCTGGGCTTG GGAGCCGGAC CAAGCGGCCT ATGTCGGGGT TCTGCGCCGC
GACGCCGACG TGTCCACCAT CCGCTGGTAC AACACCGGCG CCTGCTACGT CTTCCACACC
ATGAACGCCT GGGAAGCCGA CGGGAAGATC ATGTGCGACG TCATGCGCTT TGACGAGGCG
CCGTTCCCGC GCGCCGACGG CACGATGGGG AAAACGGTCT TCCCCCACAT GGTGCGCTGG
ACGTTCGACC TCTCGCCTGG TTCCGACGCC ATTCGCGAGG AGACCCTGGA TGATCTGGAC
GGGGAGTTCC CGCGTTTCGA CGACCGCCGG GCGACCCAGA CCTACCGCCA TGGCTGGTTC
GCGGCCGATC TTCGCAAGAC CTTCGAACTG ACCGGCATCG CGCACCTGGA CCTGGCGACT
GGCAAGCGAC AGGTCTATGC CCTGCCGCTG GGGGACATGA CGTCCGAGCC GGTGTTCGTC
GAGCGTTCGG CCGACGCCGA GGAAGGCGAC GGCTGGCTGC TGTCGGTGGT GTGGCGCGCG
GCGGAAAACC GCTCCGATCT CGTGGTCTTT GACGCCCAGG ACGTGGCCAA GGGTCCGATC
GCCACGGCGC GGGCTCCGCG GCGCGTGCCC TTCGGCTTCC ATGGCAACTG GGTCAACGCC
TAG
 
Protein sequence
MDDQSHTDFY LSGNYAPVRS EDDFELEITG QFPKELRGAL YRNGPNPQFQ PRDPNHHWFG 
GDGMVHGFYV EDGKVHYRNR YVRTPKWKTE NAAGRALFGS MGNPRTTDPS VLGQDSGVAN
TNILAHGGRL LALEEGHMPF EMDARSLDSL GYVEAYKGRV TAHPKIDPVT GELLWFGYGV
GATPFSPGMS FGVTDRNGVV TRRDDFQAPY CSMVHDFMAT QNHVLFPVLP LTGSLERAMK
GAPIWAWEPD QAAYVGVLRR DADVSTIRWY NTGACYVFHT MNAWEADGKI MCDVMRFDEA
PFPRADGTMG KTVFPHMVRW TFDLSPGSDA IREETLDDLD GEFPRFDDRR ATQTYRHGWF
AADLRKTFEL TGIAHLDLAT GKRQVYALPL GDMTSEPVFV ERSADAEEGD GWLLSVVWRA
AENRSDLVVF DAQDVAKGPI ATARAPRRVP FGFHGNWVNA