Gene Caul_1272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1272 
Symbol 
ID5898727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1331495 
End bp1332958 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content68% 
IMG OID641561757 
Productcarotenoid oxygenase 
Protein accessionYP_001682900 
Protein GI167645237 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.261404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00803245 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTCG ACCGCCTGCC GCCGGTGCGC ACCTCGCTGC ACCCGACCAA TCACCCCTAC 
ATGACCGGGG CCTGGACGCC GCTGCACGAG GAGGTCGACG CGGTCGATCT GGAGGTCCTG
GAGGGCGCGA TCCCCGCCGA CCTCGACGGC GTCTATCTGC GCAACACCGA GAACCCCGTG
CATCAGCCGC TGGGCCGCTA CCACCCGTTC GACGGCGACG GCATGGTCCA CCAGATCGAG
TTCAAGAACG GCGCGGCCAG CTATCGCAAC CGTTTCATCC GCACGCGCGG CTTCGAGGCC
GAGCAGGAGG CGGGCGAGAG TCTGTGGGGC GGGCTGGCCG ACGGGCCGGG CACGTCCAGG
CGGCCGGGCT TCGGGGCCCA TGGCGGCCTG AAGGACACCG CCAGCACCGA CATCGTCGTC
CATGCCGGCG AGGCGATCGC CACCTTCTAC CAGTGTGGCG AGGCCTATCG GCTGGATCCG
CTGACCCTGG AGAACCTGGG CGTCGCCGCC TGGGCGCCGC TGGACGGCGT CTCGGCCCAC
GCCAAGGTCG ACGAGGCGAC CGGCGAGCTC TTGTTCTTCA ACTATTCCAA GCACGCGCCC
TACATGCACT ACGGCGTGGT CGACGCGAAC GGCAAGCGCA CCGTCTATCA GCCGATCGAC
CTGCCCGGCC CGCGCCTGCC GCACGACATG GCGTTCACGG CGAACTATTC GATCCTCAAC
GACCTGCCGG TGTTCTGGGA CCAGACCCTG CTGGAGCGCG ACATCCACGC CGTGCGCCTG
CACAAGGGCG TGCCGTCGCG GTTCGGGATC GTGCCGCGCC ATGGCGGCGA AGTGCGCTGG
TTCGAGGCCG CCCCGACCTA TGTGCTGCAC TGGCTCAACG CCTACGAGGA CGGCGACGAG
ATCGTGCTCG ACGGCTATTT CCAGGAGAAC CCGACCCCGC GCCCGCTGGA GGACGCGCCC
GAGGGCCACG GCCACCTGAT GGCCTATCTG GACGAGCACA GCTTCCGGCC CAAGCTGCAC
CGCTGGCGCT TCAACCTGGC GACCGGCGAG ACCACCGAAC AGCATCTGGA CGAGCGAATC
CTGGAGTTTG GGATGTTCAA CCAGGCCTAT GCGGGGCGGC CCTATCGCTA CGCCTATTCG
ACCACCGCCA AGCCGGGCTG GTTCCTGTTC AACGGCTTCG TCAAGCATGA CCTGGAGACG
GGTGAGAGCT GGTCGCTCGC GCTGGAACCT GGCCGCTACG CCAGCGAGGC CCCGTTCGCG
CCGCGCGTCG GGGCGGTGGA CGAGGACGAC GGCTATCTGG TCAGCTTCAT CATCGACGAG
AACCGGGGCA CGTCGGAGTG CCTGGTGGTC GACGCCAAGA CCATGACGCC GACCTGCCGG
ATCGCCTTGC CGCATAAGAT CAGCAGCGGC ACGCATGCGG TGTGGGCGGG GCGGAAGATG
TTGGCTCCCT CGACCATTCC CTAA
 
Protein sequence
MKLDRLPPVR TSLHPTNHPY MTGAWTPLHE EVDAVDLEVL EGAIPADLDG VYLRNTENPV 
HQPLGRYHPF DGDGMVHQIE FKNGAASYRN RFIRTRGFEA EQEAGESLWG GLADGPGTSR
RPGFGAHGGL KDTASTDIVV HAGEAIATFY QCGEAYRLDP LTLENLGVAA WAPLDGVSAH
AKVDEATGEL LFFNYSKHAP YMHYGVVDAN GKRTVYQPID LPGPRLPHDM AFTANYSILN
DLPVFWDQTL LERDIHAVRL HKGVPSRFGI VPRHGGEVRW FEAAPTYVLH WLNAYEDGDE
IVLDGYFQEN PTPRPLEDAP EGHGHLMAYL DEHSFRPKLH RWRFNLATGE TTEQHLDERI
LEFGMFNQAY AGRPYRYAYS TTAKPGWFLF NGFVKHDLET GESWSLALEP GRYASEAPFA
PRVGAVDEDD GYLVSFIIDE NRGTSECLVV DAKTMTPTCR IALPHKISSG THAVWAGRKM
LAPSTIP