Gene Caul_0297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0297 
Symbol 
ID5897571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp333249 
End bp334847 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content68% 
IMG OID641560781 
Producttryptophan halogenase 
Protein accessionYP_001681932 
Protein GI167644269 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.427874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG AGCCCACGAT TGACCGTATC CTGATCGTCG GCGGCGGCAC GGCCGGCTGG 
ATGACCGCCG CCTACCTGGC CCGCCGCCTG GGGGCGATGC GGCCGGACGG GGTCCAGATC
ACCCTGATCG AGTCCAGCGA GATCGGCATC ATCGGCGTGG GCGAGGGGAC CTTCCCGACC
ATCCAGAACA CCATGCGGAC GATCGGCGTC GACGAGGCGC GGTTCATGCG CGAGGCCGGG
GCGGCCTTCA AGCAAGGCAT CAAGTTCGTC GACTGGAAGA CCGCGCCCAA GGACGGCGTC
CACAGTCACT ACTACCACCC CTTCGCCCCT CCCCGGTTGC TGAACGGCGG CATGGACCTG
GCGCCCTACT GGCTGATGGG CGAGGCGGGA AACATCCCGT TCTCAGATGC GGTGACGTTG
CAGGACAAGG TCTGCGACGC CATGCGCGGC CCCAAGCGGC GCGACGATCC GCAGTACGGC
GGGCCGATGG CCTATGCCTA CCATTTCGAC GCCGGCAAGC TGGCCAACCT CCTGCGCGAC
GTCGGCAAGG CCACGGGCGT TAAGCACCTG CTGGGCAACG TCCAGGCGGT CAACAAGACG
GAAGACGGAT CGATCGCCTC GGTCACCATC CGCGAGCACG GCGACCTGAC CGCCGACCTC
TATATCGACT GCACCGGTTT CGCCGGAGCG CTGATCGGCG AGGCCATGGG CTCGGCCTGG
ATCGACAAGA ACGATGTGCT GTTCGTCGAC CGCGCCCTGG CCCTGCAGGT CCCCTATGAC
CGGCCGGACG CTCCGGTGGC CTCCACCACG CTCTCGACCG CCCACGAGGC CGGCTGGACC
TGGGACATCG GCCTGCCCGA CCGCCGGGGC ACGGGCTATG TTTATTCCAG TCGTCATACT
ACGGACGATC GGGCCGAGCA GATCCTTCTC GGCTATGTCG GCAAGGCGGG CGAAGGCTTG
AACCCGCGCC TGCTGAAGCT GAAGGTCGGC CACCGGGCCC AGCACTGGGT CAAGAACTGC
GTGGCCGTGG GCCTGTCGGG CGGCTTCCTG GAGCCGCTGG AATCCACCGG CATCGTGCTG
ATCGAGGCGG CCGCCTATAT GCTGGCCCGC AACCTGCCCC GCCGGGGCGG CATGGCGGCG
GCGGCGCGCC AGTTCAATAC CGCGATGACC GACCGCTACC TGCGGGCCAT CGACTTCATC
AAGCTGCACT ACTGCCTCAG CCAGCGCGCC GACAACAGCT TCTGGACCGA CAACGCCGAC
CCCGCCTCGA TCCCCCAGAC GCTGCAGGAT CACCTGGCGA TGTGGAAACA TCGCCCGCCC
AACGTCTTCG ACTTCCCGAA CCTCCACGAG TCGTTCAAGT CCTTCAATTA CCAGTACATC
CTGTACGGCA TGGGGTACGA GACGAAGGTC GATCCGGCCG CCCACGTCCA TGGCGACCTG
GCCCGGGCCG ACTTCGCGCG CGTGCGGGAG GCCGGCGTCC GCGCCGCCGC CAGCCTGCCC
GACCATCGCG CCCTGCTGAC CGAGGTCTAC GCCCACGGCT TCAAGACCAA GACCCCCGAC
GCCGCCTCGG CGGAGGCCGC CGAGGGGCTG CGCCGGTGA
 
Protein sequence
MSDEPTIDRI LIVGGGTAGW MTAAYLARRL GAMRPDGVQI TLIESSEIGI IGVGEGTFPT 
IQNTMRTIGV DEARFMREAG AAFKQGIKFV DWKTAPKDGV HSHYYHPFAP PRLLNGGMDL
APYWLMGEAG NIPFSDAVTL QDKVCDAMRG PKRRDDPQYG GPMAYAYHFD AGKLANLLRD
VGKATGVKHL LGNVQAVNKT EDGSIASVTI REHGDLTADL YIDCTGFAGA LIGEAMGSAW
IDKNDVLFVD RALALQVPYD RPDAPVASTT LSTAHEAGWT WDIGLPDRRG TGYVYSSRHT
TDDRAEQILL GYVGKAGEGL NPRLLKLKVG HRAQHWVKNC VAVGLSGGFL EPLESTGIVL
IEAAAYMLAR NLPRRGGMAA AARQFNTAMT DRYLRAIDFI KLHYCLSQRA DNSFWTDNAD
PASIPQTLQD HLAMWKHRPP NVFDFPNLHE SFKSFNYQYI LYGMGYETKV DPAAHVHGDL
ARADFARVRE AGVRAAASLP DHRALLTEVY AHGFKTKTPD AASAEAAEGL RR