Gene Caci_2724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2724 
Symbol 
ID8334073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3119401 
End bp3120921 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content70% 
IMG OID644955874 
Producttryptophan halogenase 
Protein accessionYP_003113480 
Protein GI256391916 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.656683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCT CTGACAGAAG GATCCTGACA AAAAGGATGG TGTTTTCGGA GATGACCGAG 
GTAATCGTCG TCGGCGGCGG TCCGGCCGGA TCGACCGCAG CCGCCCTACT GGCCAAGAAC
GGAGTTTCGG TCACGCTCCT GGAACGCGAG GCGTTCCCCC GCTACCACGT CGGCGAGTCG
ATCACGTACT CGTGCCGGGG CGTGCTGGAC TACATCGGCG CGCTGGAGAA GATCGAAGCC
CGCGGCTACA CCCGCAAGAC CGGCGTGCTG CTGCGCTGGG GACAGGAGCT GGACTGGGCG
ATCGACTGGA CCGCGCAGTT CGGGCCGGAC GTGCGCTCCT GGCAGGTGGA CCGGGAGGAC
TTCGACCAGG TGCTGCTGCA GCACGCGGCC GAGTGCGGCG CCCAGGTCCT GGAGCAGGCG
CAGGTCAAGC GGGTGGTGTT CGAGGACGGC CGCGCCGTCG GCGTCGAGTG GACGCCCCAG
GGCCACAGCG AGCCGCAGAT CACCCGCGCG GACCTGGTCC TCGACGCCTC GGGCCGGGCC
GGTCTGATCA GCGCCCAGCA CTTCCGCGAC CGCCGCGCCA CCGAGATCTT CCGCAACGTC
GCGATCTGGG GCTACTGGGA CGGCGGCGAG CTGCTGCCGG ACAGTCCGTC CGGGAGCATC
AACGTCATCT CCTCGCCCGA GGGCTGGTAC TGGGTCATCC CGCTGAGCGG GAACCGGTTC
AGCGTCGGCT ACGTCACGCA CAAATCAGTG TTCGTCGAGC GGCGCAAGGA CTACGACACC
CTGGACGACA TGCTCGCGGC GGTGGTTGCC GAGTCGCCGA CGGTGAGCGA GGCGATGGCC
AAGGGCACGG TCCGCCCGGG CGCCCGGGTC GAGCAGGACT TCTCCTACGC GGCCGACAGC
TTCTGCGGCC CGGGCCACTT CCTGGTCGGC GACGCGGCCT GCTTCCTGGA CCCGCTGCTG
TCCACCGGCG TGCACCTGGC CATCTACAGC GGACTGCTCG CGGCCGCCTC GGTGCTCTCG
ATCGAGAACG GCGACGTCAC CGAGACCGAG GCCTACGCGT TCTACGAGTC CCGCTACCGC
AACTCCTACG AGCGGCTGTT CACGCTGGTC GCGGGCTTCT ACCAGAAGCA CGCCGGCAAG
GACCGCTACT TCGAGCTGGC CAAGGCCCTG ACCCGGGAGC ACAAGGGGCT GGAGGGCAGC
GCCGACCTGG CGTTCGGCGA GATCACCTCC GGCATCACCG ACCTGCGCGA GGCCAAGGAC
GACAGCGGGC TCGGCGACCG GCCGATCCGC GAGTCCGTCG CCGAGGCGGC CTCCCGGCGC
TCGAAGGTCC AGGACCTGCT CAGCGCGACC GAGCAGGCGC AGCAGCGGGC CGAGGCCGGG
CTGCCCAACA CCGGCGGCGA CCGCGGCCGC TCGCGGGTGC AGATCGACGC CGACGACCTC
TACGACGCCG CGACCGGGCT GCACCTGGTC ATGGAACCGC GGCTGGGGAT CCAGCGGGCG
GCGGTGCCGG CGGCCGGCTG A
 
Protein sequence
MSRSDRRILT KRMVFSEMTE VIVVGGGPAG STAAALLAKN GVSVTLLERE AFPRYHVGES 
ITYSCRGVLD YIGALEKIEA RGYTRKTGVL LRWGQELDWA IDWTAQFGPD VRSWQVDRED
FDQVLLQHAA ECGAQVLEQA QVKRVVFEDG RAVGVEWTPQ GHSEPQITRA DLVLDASGRA
GLISAQHFRD RRATEIFRNV AIWGYWDGGE LLPDSPSGSI NVISSPEGWY WVIPLSGNRF
SVGYVTHKSV FVERRKDYDT LDDMLAAVVA ESPTVSEAMA KGTVRPGARV EQDFSYAADS
FCGPGHFLVG DAACFLDPLL STGVHLAIYS GLLAAASVLS IENGDVTETE AYAFYESRYR
NSYERLFTLV AGFYQKHAGK DRYFELAKAL TREHKGLEGS ADLAFGEITS GITDLREAKD
DSGLGDRPIR ESVAEAASRR SKVQDLLSAT EQAQQRAEAG LPNTGGDRGR SRVQIDADDL
YDAATGLHLV MEPRLGIQRA AVPAAG