Gene Caul_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0020 
Symbol 
ID5897732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp25534 
End bp26802 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID641560503 
Productcytochrome P450 
Protein accessionYP_001681656 
Protein GI167643993 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0467456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG GCTCTATCGA TTTTGGCGAC GACGCGCGCG CCAAGGCTTG GTCCATTCCG 
CTGGAGGACT ACCATGTCGC CGATCCGGCC CTGTTCCAGG CCGACGCGAT GTGGCCCTAT
TTCGAGCGCC TGCGGAAGGA AGATCCGGTC CACTGGTCCA GGGGCATCGA GGAGACCGGT
CCCTACTGGT CGATCACCAA GTACAACGAC ATCATGGCGG TCGACACCAA CCATCAGGTG
TTCTCCAGCG ATGCGCATCT GGGCGGCATC ACCATCCGCG ACTTCGACGA GGACTTCGTC
CTGCCGATGT TCATCGCCAT GGACCCGCCC AAGCACGATA TCCAGCGCAA GACCGTCAGC
CCGATCGTCT CGCCGCAGAA CCTGGCCCGG CTGGAGGGGA TCATCCGCGA GCGGGTCTGC
ACGATCCTGG ACGGCCTGCC GATCGGCGAG ACCTTCGACT GGGTCGACAA GGTCTCGATC
GAGCTGACCA CCCAGATGCT GGCCACGCTG TTCGACTTCC CTTGGGAAGA GCGCCGCAAG
CTGACCCGCT GGTCGGACGT GGCCACCGCC TCGCCGGAAA GCGGCATCAT CGAGAGCGAG
GAGGCGCGCC GCGCCGAACT GCTGGAATGC CTGGCCTATT TCACCAACCT GTGGAACGAG
CGGGTCAACG CCACCGAGCC CGGCGATGAC CTGATCTCGA TGCTGGCCCA TGGCGAGGCC
ACCCGCGACA TGCCGCCCAT GGAGTATCTG GGCAACATCA TCCTGCTGAT CGTCGGGGGC
AACGACACGA CCCGCAACTC CCTGACCGGC GGCCTCTACG CGCTCTCCAA GAACCCGGAG
CAGGAAGCCA AGCTGCGGGC CGATCCCGAG CTGATCCCGT CGATGGTCTC GGAGATCATC
CGCTGGCAGA CGCCCCTGGC CCACATGCGT CGCACGGCGC TGGCCGATAT CGAACTGGGC
GGCAAGCAGA TCCGCAAGGG CGACAAGGTC GTCATGTGGT ACGTGTCGGG CAACCGCGAC
GATACGGTGA TCGAGAACCC CGACGCCTTC ATCATCGACC GCGAGAACCC CCGCCGCCAC
CTGTCGTTCG GCTTCGGCAT CCACCGCTGC GTCGGCAACC GCCTGGCCGA GATGCAGCTG
AAGATCGTCT GGGAGGAGAT CCTCAAGCGC TTCCCGAAGA TCGAGGTCCT GGGCGAGCCC
AAGCGGGTCT ATTCCAGCTT CGTGAAGGGC TATGAGAGCT TGCCGGTTCG GATCCCGACG
CGGCTTTGA
 
Protein sequence
MSDGSIDFGD DARAKAWSIP LEDYHVADPA LFQADAMWPY FERLRKEDPV HWSRGIEETG 
PYWSITKYND IMAVDTNHQV FSSDAHLGGI TIRDFDEDFV LPMFIAMDPP KHDIQRKTVS
PIVSPQNLAR LEGIIRERVC TILDGLPIGE TFDWVDKVSI ELTTQMLATL FDFPWEERRK
LTRWSDVATA SPESGIIESE EARRAELLEC LAYFTNLWNE RVNATEPGDD LISMLAHGEA
TRDMPPMEYL GNIILLIVGG NDTTRNSLTG GLYALSKNPE QEAKLRADPE LIPSMVSEII
RWQTPLAHMR RTALADIELG GKQIRKGDKV VMWYVSGNRD DTVIENPDAF IIDRENPRRH
LSFGFGIHRC VGNRLAEMQL KIVWEEILKR FPKIEVLGEP KRVYSSFVKG YESLPVRIPT
RL