Gene Caci_0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0530 
Symbol 
ID8331857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp614461 
End bp615936 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content68% 
IMG OID644953687 
Producttryptophan halogenase 
Protein accessionYP_003111314 
Protein GI256389750 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.680136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.208224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGC AGAACACAGT GAAAACTGCG AACGACGTAG ACAACCCGGA CAACGCCGAG 
AAGACCCGCG TGCTGGTCAT CGGCGGCGGC CCCGGCGGCT CGACCGCCGC CACCCTGCTG
GCCCGGCAGG GCATCGAGGT GACGCTGCTG GAGAGCGCCA TCTTCCCCCG CTACCACATC
GGCGAGTCGA TCCTGCCTTC CGTACTGCCG GTGCTGGACC TGCTCGGAGT CCGCGAGGAG
GTCGACAACC ACGGCTTCGT GCGCAAGGAC GGCGCCTATT TCGAGTGGGG CCCGGAGAAC
TGGGACCTGA ACTTCGACCA CCTGTCCGGC GCCAGCCGGC ACAGCTACCA GGTGATCCGG
TCCGAGTTCG ACCACATGCT GCTGAAGAAC GCCCAGGCCA AGGGCGTTGA CGTGCGCGAG
GGCGTCAAGG TCACGGAGAT CCTGTTCCAC GGCGACCGCC CCGTCGCGGC CCGCTGGTCA
GCCTCGGACA ACTCCGGGGC CGCCGGCACC ATCGCGTTCG ACTTCCTGGT CGACGCCTCG
GGCCGGGCCG GGGTGATGGC GACCAAGTAC CTGAAGAACC GCCGATACCA CGAGGCCTTC
AAGAACGTCG CGGTCTGGTC CTACTGGCGC GACGTCAAGC CGCTGGAGGT CGGGCCGAAG
GGCGCCATCG CGGTCTGCTC GGTGCCCTAC GGCTGGTTCT GGGCCATTCC GCTGCACGAC
GGCACGACCT CGATCGGCCT GGTGGCCAAG CGCACCACAT TCTCCGACGA GCGCGAGCGG
CTCGGCAGCA TCGAGGCGGT CTACGCCGAC GCGATGACCC AGGCCCCGCG GATCCTGGAG
ATGACCCAGG GCGCGAACAA GATCGAGGGT TACAAGGTCG AGCAGGACTA CTCCTACGTC
TCCGAGCGCA AGTCCGGCCC CGGCTACGTC CTGGTCGGCG ACGCGGCCGC CTTCCTGGAC
CCGCTGCTGT CCACCGGGGT GCACCTGTCG ACGTTCAGCG CGCTGCTCGC GGCCGCCTCG
GTCTCCGCGG TGCTGGACGG CGAGCTGGCC GAGCAGGAGG CGGTGGACTT CTACGAGCGG
GCCTACCACC AGGCCTACGA GCGGCTGCTG GTGGTCGTCT CCTTCTTCTA CAACAGCTAC
AACCGGCAGA CCCAGTTCTT CGAGGCCGAC AAGCTGACCC GGCGCGAGAG GCACATGCTG
AACCTCTACG AGTCTTTCCT GCACATCGTC ACCGGCATCG AGGACCTGGA CGACTCGATC
GATGGCGGCG AGGCGCTGGA GGAGGTCGCG CAGCAGATCG CGACCCAGAA GAAGATCGAC
GCCGGGCACA ACGAGGCGAT GAACTCGCTG CCGGACTCGC CGCGGCAGGC CGTCGGCGGG
TTGTACCTGG AACTGGAGCC CCGGCTACGC ATCCGCCGCA CATCCGAGGC GCCCGCCGGC
CCGGAGCCGG TGCGGGAAGC GAGGGCCGGG TTGTGA
 
Protein sequence
MTEQNTVKTA NDVDNPDNAE KTRVLVIGGG PGGSTAATLL ARQGIEVTLL ESAIFPRYHI 
GESILPSVLP VLDLLGVREE VDNHGFVRKD GAYFEWGPEN WDLNFDHLSG ASRHSYQVIR
SEFDHMLLKN AQAKGVDVRE GVKVTEILFH GDRPVAARWS ASDNSGAAGT IAFDFLVDAS
GRAGVMATKY LKNRRYHEAF KNVAVWSYWR DVKPLEVGPK GAIAVCSVPY GWFWAIPLHD
GTTSIGLVAK RTTFSDERER LGSIEAVYAD AMTQAPRILE MTQGANKIEG YKVEQDYSYV
SERKSGPGYV LVGDAAAFLD PLLSTGVHLS TFSALLAAAS VSAVLDGELA EQEAVDFYER
AYHQAYERLL VVVSFFYNSY NRQTQFFEAD KLTRRERHML NLYESFLHIV TGIEDLDDSI
DGGEALEEVA QQIATQKKID AGHNEAMNSL PDSPRQAVGG LYLELEPRLR IRRTSEAPAG
PEPVREARAG L