Gene Caul_1912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1912 
Symbol 
ID5899367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2054206 
End bp2055486 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID641562402 
Productcytochrome P450 
Protein accessionYP_001683539 
Protein GI167645876 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.997168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CCCAGATGAC CCCCAACAAG GATCTCGCCG AGGGCTTCGC CCAAGTCGGC 
GCCCTGTTCG CGGGCAACGA CAAGAATATC GACGCGATCT ACCGCGACCA CCGCCAGAAC
ATGCCCGTGA TGCGGGGCGA TATCTGCGCC GAACTCGGCG CGGCATCCTT CGCGGGCCAG
ACAGGCCGCC CGATCTATAC GATCTTCAGG CACGCCGACG TCATGAAGGT GCTGCGCGAC
ACCAAGACCT TCACCAGCGG GATCCTGATG GAGACCGGCC TGGGCCAGTT CCTGGACGGC
CTGATGATCA CCGGGCTGGA TGGCGATGAG CACCGGCAGC TGCGTGGCAT TCTGCAGCCG
TCCTTCACGC CCGCGGTGAT GGAGGAATGG CGCGAGACCT ACATCCGTCC GCTGATCCAG
CGCTCCTTCG TCGAACCCCT GGTCGCGCTG GGCAAGACGG AGCTGATCGG CAGCGTCGGC
GTGATGTTCC CCATCCACGT CGTCTACGCC GTCCTGGGCT TCCAGGATAA CGATCCGGCG
GCGCTTGAGA CCTTCGCCAC CAAGGCCCTC AAGGTTCTGG GCGGCATGGC CGACGACCCG
GACGCCAAGC GCGCCGCCTT CCAGGCCTTC CAGGAACTCT ACGATCCGAC CCTCGCCGCC
GTCCAGGCGC GCCGGGCCTC CGGCGCTGAA GGCGCCGACC TGATCAGCCG CCTGATCCGC
GCCGAGTTCG AGGGCCGGAC CCTGAACGAT CATCAGATCA CCAATTTCGT GCGGATGATG
CTGCCTGCCG CGTCGGAGAC TACCTCCAGA ACCTTCGCGA CCATGCTGAC CCACCTGTTC
GATCACCCTG AAGTGCTTGA GCGCCTGCGC GCGGATCGGA GCCTGATGCG CAAGGTTCTG
GACGAAAGCG TGCGCCACGA CGCCGTGGCC ACGTTCAAGG TCCGGGAATG CCAGGCGGAC
GTCACGCTCC AGGACGTGAC CATTCCCAAG GGCTCGATCA TCTCGGCCTG CGTCGCCTCG
GCGAACCGTG ATGAGCTGGT GTTCGACAAA CCCGAAGTGT TCGACATCGA CCGCAAACAG
ATGCCGGCCT TCGGATTCGG GTTCGGAGCT CACATGTGCG TTGGAATGTG GCTAGCCAAG
GTGGAGATCG AAGAGGCCGT CGGCCTGCTG CTCGACATGC TGCCCAACCT GCGCCTCGAC
CCCGACCATC CTCGCCCGGA AGTGCGGGGC GTTTCGCTGC GCGGTCCGGA TGCGGTCCAT
GTGATCTGGG ATATCCCCTA G
 
Protein sequence
MSTAQMTPNK DLAEGFAQVG ALFAGNDKNI DAIYRDHRQN MPVMRGDICA ELGAASFAGQ 
TGRPIYTIFR HADVMKVLRD TKTFTSGILM ETGLGQFLDG LMITGLDGDE HRQLRGILQP
SFTPAVMEEW RETYIRPLIQ RSFVEPLVAL GKTELIGSVG VMFPIHVVYA VLGFQDNDPA
ALETFATKAL KVLGGMADDP DAKRAAFQAF QELYDPTLAA VQARRASGAE GADLISRLIR
AEFEGRTLND HQITNFVRMM LPAASETTSR TFATMLTHLF DHPEVLERLR ADRSLMRKVL
DESVRHDAVA TFKVRECQAD VTLQDVTIPK GSIISACVAS ANRDELVFDK PEVFDIDRKQ
MPAFGFGFGA HMCVGMWLAK VEIEEAVGLL LDMLPNLRLD PDHPRPEVRG VSLRGPDAVH
VIWDIP